CN102436497B - Mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling - Google Patents

Mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling Download PDF

Info

Publication number
CN102436497B
CN102436497B CN201110359688.9A CN201110359688A CN102436497B CN 102436497 B CN102436497 B CN 102436497B CN 201110359688 A CN201110359688 A CN 201110359688A CN 102436497 B CN102436497 B CN 102436497B
Authority
CN
China
Prior art keywords
owl
model
ontology
hot
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201110359688.9A
Other languages
Chinese (zh)
Other versions
CN102436497A (en
Inventor
王楠
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu United Industrial Limited by Share Ltd
Original Assignee
JIANGSU LIANZHU INDUSTRIAL CO LTD
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by JIANGSU LIANZHU INDUSTRIAL CO LTD filed Critical JIANGSU LIANZHU INDUSTRIAL CO LTD
Priority to CN201110359688.9A priority Critical patent/CN102436497B/en
Publication of CN102436497A publication Critical patent/CN102436497A/en
Application granted granted Critical
Publication of CN102436497B publication Critical patent/CN102436497B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention discloses a mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling. The system is characterized by comprising the following functional modules: 1), an OWL ontology instance transforming module; 2), an invert index module; 3), an ontology element frequency counting module, which is used for performing the ontology element frequency counting on the OWL ontology invert index table and sequencing according to the size of the frequency; 4), an OWL seed model generating module; 5), a knowledge ontology model polymerizing module; 6), an OWL hot model counting module; 7), a news hot frequency counting module. For utilizing the method of the OWL ontology model analysis, the invention promotes the internet news hot-spot analysis to the concept hierarchy; some related hot-spot concepts are polymerized by using the relationship between the concepts to form the news hot, thereby overcoming the shortcoming due to partial hot-spot analyzing in traditional method.

Description

Based on mainstream media's social focus analytic system of learning type OWL modeling
Technical field
The present invention relates to a kind of mainstream media social focus analytic system, belong to network computer technology field.
Background technology
Traditional internet news analysis of central issue is quoted (reprinting) with crucial phrase or article by other websites, estimated by the frequency that user inquires about.Recently, utilize some short essays to be forwarded in microblogging, be concerned and also can be found out by the frequency statistics commented on " temperature " of this short essay.Effective to the estimation of the temperature of the news report being carrier with these crucial phrases, short essay, article, such as: Guo Mei good job part, two years old child are hit to many people by car and pass by and do not sue and labour, etc.But this analysis of central issue method cannot accomplish extension and the association analysis of concept, such as: Guo Meimei and the Red Cross Society of China? Ka Zhafei's is dead favourable to whom? Europe debt and world financial crisis, the economy of the U.S. on the impact of China, etc.Although some article does not directly occur certain keyword, what it was still said is the same same thing, uses traditional analysis of central issue method will omit such article.
Summary of the invention
Technical matters to be solved by this invention is mainstream media's social focus analytic system, by relevant focus concept polymerization composition hot news, thus makes retrieved relevant focus article more comprehensive.
For solving the problems of the technologies described above, the invention provides a kind of mainstream media's social focus analytic system based on learning type OWL modeling, it is characterized in that, comprise following functional module:
1) OWL instances of ontology modular converter: the conversion of OWL instances of ontology is done to the magnanimity news information text that search engine obtains;
2) inverted index module: this volume elements inverted index is done to OWL instances of ontology storehouse, described volume elements is undecomposable body minimum in OWL instances of ontology storehouse;
3) this volume elements frequency statistics module: this volume elements frequency statistics is done to this volume elements of OWL inverted index table, and sort according to the size of the frequency;
4) OWL Seed model generation module: using this volume elements of specified quantity of coming above as focus Seed model candidate, and in the mode of knowledge model stored in internet news OWL model bank; Seed model be the most original, be often also minimum knowledge model;
5) Ontology Model aggregation module: each Seed model in internet news OWL model bank is taken out respectively, do concept with other this volume elements in inverted index table to be polymerized, and amended hot news knowledge model is deposited back internet news OWL model bank, move in circles, until all Seed models are all complete by polymerization;
6) OWL hot spot model statistical module: traversal internet news OWL model bank, the knowledge model that statistical correlation concept is many also sorts, and the knowledge model coming specified quantity is above taken out as hot news knowledge model;
7) hot news frequency statistics module: find the instances of ontology corresponding with hot news knowledge model by inverted index table, the original article corresponding with these instances of ontology is found again by original document management system, and the frequency that original article is propagated, pays close attention to and commented on is added up, form hot news analysis result and export.
The beneficial effect that the present invention reaches:
The method that the present invention adopts OWL ontology model to analyze, internet news analysis of central issue is risen to concept hierarchy, utilizes the relation between concept and concept, by some relevant focus concept polymerizations, and form hot news, incomplete deficiency in traditional analysis of central issue method can be made up with this.
Accompanying drawing explanation
Fig. 1 is the structural representation of the report focal point analysis system of the mainstream media based on learning type OWL modeling in the present invention.
Embodiment
The present invention is the same with traditional news media analysis of central issue method, first hot news analytic system based on OWL (network ontology language Ontology of Web Language) knowledge model also will obtain the news information of magnanimity by search engine, be also propagate, pay close attention to, Main Basis that the frequency statistics commented on is estimated for temperature.Difference is, traditional is crucial phrase, short essay or article by objects of statistics, and this patent adopts based on OWL (being correlated with) conceptual combinations, that is: OWL knowledge model.
In order to obtain OWL knowledge model accurately, first the conversion of OWL instances of ontology to be carried out to the magnanimity news information text that search engine obtains, see patent " internet text this analysis and OWL converter based on OWL ", then, this volume elements inverted index table is done to the OWL instances of ontology storehouse representing magnanimity news information, see patent " a kind of inverted index method based on OWL ", just lie in this this volume elements inverted index table about the knowledge model of hot news.
The key of this patent is, does this volume elements frequency statistics to this volume elements inverted index table, and sets one group of OWL focus seed knowledge model according to statistics.Here basic assumption is, this volume elements (that is: key concept) that frequency of occurrence is higher in magnanimity news, and it is that the possibility of hot news carrier is larger.
Use this group seed knowledge model to do concept polymerization to whole volume elements inverted index table, that is: this relevant volume elements is aggregated in a knowledge model, each Seed model just " expansion ".Finally, the OWL knowledge model that frequency of occurrence is maximum, related notion is maximum is just defined as hot news model.
Pass through inverted index table, find out corresponding to the instances of ontology of each this volume elements and the original document of binding thereof in hot news model, re-use the frequency statistics method of traditional propagation, concern, comment, obtained original article is added up, the net result that its result is analyzed as hot news.As shown in Figure 1, each functional module and main flow are concrete function logic:
1) the magnanimity news information text obtained search engine does the conversion of OWL instances of ontology, and be 2011102707850 see patent " internet text this analysis and OWL converter based on OWL " application number, the applying date is on September 14th, 2011;
2) this volume elements inverted index is done to OWL instances of ontology storehouse;
3) this volume elements frequency statistics module does this volume elements frequency statistics to this volume elements of OWL inverted index table, and sorts according to the size of the frequency;
4) OWL Seed model generation module, using this volume elements of specified quantity of coming above as focus Seed model candidate, them with the form of knowledge model stored in internet news OWL model bank;
5) Ontology Model aggregation module, each Seed model in internet news OWL model bank is taken out respectively, do concept with other this volume elements in inverted index table to be polymerized, and amended hot news knowledge model is deposited back internet news OWL model bank, move in circles, until all Seed models are all complete by polymerization;
6) OWL hot spot model statistical module, traversal internet news OWL model bank, the knowledge model that statistical correlation concept is many also sorts, and the knowledge model coming specified quantity is above taken out as hot news knowledge model;
7) hot news frequency statistics module, the instances of ontology corresponding with hot news knowledge model is found by inverted index table, the original article corresponding with these instances of ontology is found again by original document management system, call traditional statistical method, to the frequency statistics that the article at the concept place in these hot news knowledge models is propagated, pays close attention to and commented on.Finally, form hot news analysis result to export.
Below disclose the present invention with preferred embodiment, so it is not intended to limiting the invention, and all employings are equal to replacement or the technical scheme that obtains of equivalent transformation mode, all drop within protection scope of the present invention.

Claims (1)

1., based on mainstream media's social focus analytic system of learning type OWL modeling, it is characterized in that, comprise following functional module:
1) OWL instances of ontology modular converter: the conversion of OWL instances of ontology is done to the magnanimity news information text that search engine obtains; Described OWL is network ontology language;
2) inverted index module: this volume elements inverted index is done to OWL instances of ontology storehouse, described volume elements is undecomposable body minimum in OWL instances of ontology storehouse;
3) this volume elements frequency statistics module: this volume elements frequency statistics is done to this volume elements of OWL inverted index table, and sort according to the size of the frequency;
4) OWL focus Seed model generation module: using this volume elements of specified quantity of coming above as focus Seed model candidate, and by focus Seed model in the mode of knowledge model stored in internet news OWL model bank;
5) Ontology Model aggregation module: each the focus Seed model in internet news OWL model bank is taken out respectively, do concept with other this volume elements in inverted index table to be polymerized, and amended hot news knowledge model is deposited back internet news OWL model bank, move in circles, until all focus Seed models are all complete by polymerization;
6) OWL hot spot model statistical module: traversal internet news OWL model bank, the knowledge model that statistical correlation concept is many also sorts, and the knowledge model coming specified quantity is above taken out as hot news knowledge model;
7) hot news frequency statistics module: find the instances of ontology corresponding with hot news knowledge model by inverted index table, the original article corresponding with instances of ontology is found again by original document management system, and the frequency that original article is propagated, pays close attention to and commented on is added up, form hot news analysis result and export.
CN201110359688.9A 2011-11-14 2011-11-14 Mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling Active CN102436497B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110359688.9A CN102436497B (en) 2011-11-14 2011-11-14 Mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110359688.9A CN102436497B (en) 2011-11-14 2011-11-14 Mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling

Publications (2)

Publication Number Publication Date
CN102436497A CN102436497A (en) 2012-05-02
CN102436497B true CN102436497B (en) 2014-12-31

Family

ID=45984559

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110359688.9A Active CN102436497B (en) 2011-11-14 2011-11-14 Mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling

Country Status (1)

Country Link
CN (1) CN102436497B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103455758A (en) * 2013-08-22 2013-12-18 北京奇虎科技有限公司 Method and device for identifying malicious website
CN105335888A (en) * 2014-07-17 2016-02-17 南方科技大学 Market monitoring system and method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101923544A (en) * 2009-06-15 2010-12-22 北京百分通联传媒技术有限公司 Method for monitoring and displaying Internet hot spots
CN102004792A (en) * 2010-12-07 2011-04-06 百度在线网络技术(北京)有限公司 Method and system for generating hot-searching word

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101441561B (en) * 2007-11-23 2012-05-23 国际商业机器公司 Method and device for generating service-oriented architecture strategy based on context model

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101923544A (en) * 2009-06-15 2010-12-22 北京百分通联传媒技术有限公司 Method for monitoring and displaying Internet hot spots
CN102004792A (en) * 2010-12-07 2011-04-06 百度在线网络技术(北京)有限公司 Method and system for generating hot-searching word

Also Published As

Publication number Publication date
CN102436497A (en) 2012-05-02

Similar Documents

Publication Publication Date Title
Zuo et al. Green building evaluation from a life-cycle perspective in Australia: A critical review
CN105260474B (en) A kind of microblog users influence power computational methods based on information exchange network
CN103678670B (en) Micro-blog hot word and hot topic mining system and method
CN103489045B (en) Demand response load optimization potential evaluation method based on multi-scene design
CN106372072A (en) Location-based recognition method for user relations in mobile social network
Zhang et al. A system for tender price evaluation of construction project based on big data
CN105721279A (en) Relationship circle excavation method and system of telecommunication network users
CN106570080A (en) Multilevel semantic matching method for cloud manufacturing resource services
Hong et al. Structural changes and growth factors of the ICT industry in Korea: 1995–2009
Wang et al. Integrated development of digital and energy industries: Paving the way for carbon emission reduction
CN107944755A (en) A kind of business model design method and system calculated based on city
Kolosok et al. Machine analysis of the UK electrical energy initiatives based on the e-petitions to the UK government and parliament
Tao et al. Coupling coordination analysis and Spatiotemporal heterogeneity between data elements and green development in China
CN102436497B (en) Mainstream media report hot-spot analyzing system based on studying type web ontology language (OWL) modeling
Li et al. Digital development influencing mechanism on green innovation performance: a perspective of green innovation network
Tian et al. The effect of resource abundance on Chinese urban green economic growth: A regional heterogeneity perspective
Pan et al. Research on the status of e-commerce development based on big data and Internet technology
Fallah et al. Forward patent citations as predictive measures for diffusion of emerging technologies
CN102004951B (en) Role group dividing method based on role correlation
Li et al. Mapping the scientific structure and evolution of renewable energy for sustainable development
Yoon et al. Ontological functional modeling of technology for reusability
Szczepańczyk Transformation towards circular economy in comparison with eco-innovation development on the example of EU member states
Liang Allocation of multi-dimensional distance learning resource based on MOOC data
Wang et al. Analysis of international competitive situation of key core technology in strategic emerging industries: New generation of information technology industry as an example
Liu et al. Classification of China's county administrative units based on carbon emissions from energy consumption and economic indicators

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C56 Change in the name or address of the patentee
CP01 Change in the name or title of a patent holder

Address after: 210006, 12 floor, Tong Tong Building, 501 South Zhongshan Road, Nanjing, Jiangsu

Patentee after: Jiangsu United Industrial Limited by Share Ltd

Address before: 210006, 12 floor, Tong Tong Building, 501 South Zhongshan Road, Nanjing, Jiangsu

Patentee before: Jiangsu Lianzhu Industrial Co.,Ltd.