CN110674447A - Information importance judging method, device, computer terminal and storage medium - Google Patents

Information importance judging method, device, computer terminal and storage medium Download PDF

Info

Publication number
CN110674447A
CN110674447A CN201910915004.5A CN201910915004A CN110674447A CN 110674447 A CN110674447 A CN 110674447A CN 201910915004 A CN201910915004 A CN 201910915004A CN 110674447 A CN110674447 A CN 110674447A
Authority
CN
China
Prior art keywords
information
analyzed
value
importance
calculating
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910915004.5A
Other languages
Chinese (zh)
Other versions
CN110674447B (en
Inventor
陈烨
谭悦
陈澈
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Yirui Information Technology Co Ltd
Original Assignee
Shanghai Yirui Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Yirui Information Technology Co Ltd filed Critical Shanghai Yirui Information Technology Co Ltd
Priority to CN201910915004.5A priority Critical patent/CN110674447B/en
Publication of CN110674447A publication Critical patent/CN110674447A/en
Application granted granted Critical
Publication of CN110674447B publication Critical patent/CN110674447B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9532Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention discloses a method and a device for judging information importance, a computer terminal and a storage medium, wherein the method comprises the following steps: acquiring related information of information to be analyzed, wherein the related information comprises an information source of the information to be analyzed and an authority value of the information source; calculating the heat value of the information to be analyzed; and calculating the importance value of the information to be analyzed according to the heat value and the authority value. The method for judging the importance of the information can accurately and quickly obtain the calculation result by calculating the importance value of the information to be analyzed through the information source of the information to be analyzed and the authority value of the information source.

Description

Information importance judging method, device, computer terminal and storage medium
Technical Field
The invention relates to the field of big data analysis, in particular to a method and a device for judging information importance, a computer terminal and a storage medium.
Background
With the rapid development of the internet, the information situation on the internet has been deficient from the previous information to the current information flooding, and in the case of such information flooding, various false news events frequently occur. Currently, a manual operation mode is mostly adopted, and the importance of information is judged and maintained subjectively. This method requires expensive operation cost, and when the amount of information increases, it is easy to cause situations such as insufficient coverage and error judgment.
Disclosure of Invention
The present invention is directed to a method, an apparatus, a computer terminal and a storage medium for determining importance of information.
Specifically, the present invention provides a method for determining the importance of information, comprising:
acquiring related information of information to be analyzed, wherein the related information comprises an information source of the information to be analyzed and an authority value of the information source;
calculating the heat value of the information to be analyzed;
and calculating the importance value of the information to be analyzed according to the heat value and the authority value.
As a further improvement of the technical scheme, the heat value of the information to be analyzed is obtained by calculation according to the number of information sources, the browsing amount and the number of comments of the information to be analyzed.
As a further improvement of the technical scheme, the authority value of each information source is obtained by comparing the information generated by history of different information sources with the historical hot information.
As a further improvement of the technical scheme, a crawler crawling technology is adopted to crawl information content and information source information so as to obtain related information of the information to be analyzed.
As a further improvement of the above technical solution, the formula for calculating the importance value of the information to be analyzed is as follows:
P=N*M;
wherein, P is the importance value of the information to be analyzed, N is the heat value of the information to be analyzed, and M is the average value of authority values of all information sources of the information to be analyzed.
As a further improvement of the above technical solution, the information to be analyzed is semantically compared with the information from different sources, and if the semantic similarity exceeds a predetermined threshold, the same information is determined and the number of information sources of the information to be analyzed is accumulated to obtain the number of information sources.
As a general technical concept, the present invention also provides an information importance judging apparatus, comprising:
the information acquisition unit is used for acquiring related information of the information to be analyzed, wherein the related information comprises an information source of the information to be analyzed and an authority value of the information source;
the heat calculating unit is used for calculating the heat value of the information to be analyzed;
and the importance calculating unit is used for calculating the importance value of the information to be analyzed according to the heat value and the authority value.
As a further improvement of the technical scheme, the heat value of the information to be analyzed is obtained by calculation according to the number of information sources, the browsing amount and the number of comments of the information to be analyzed.
As a general technical concept, the present invention also provides a computer terminal, comprising:
a processor and a memory;
the memory is used for storing a computer program, and the processor runs the computer program to enable the computer terminal to execute the information importance judgment method.
As a general technical concept, the present invention also provides a computer-readable storage medium storing a computer program, which when executed by a processor implements the information importance determination method.
Compared with the prior art, the embodiment of the invention provides the information importance judging method, the importance value of the information to be analyzed is calculated through the information source of the information to be analyzed and the authority value of the information source, and the calculation result can be accurately and quickly obtained.
Drawings
In order to more clearly illustrate the technical solution of the present invention, the drawings required to be used in the embodiments will be briefly described below, and it should be understood that the following drawings only illustrate some embodiments of the present invention, and therefore should not be considered as limiting the scope of the present invention. Like components are numbered similarly in the various figures.
FIG. 1 is a flowchart illustrating a method for determining importance of information according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of an information importance determination apparatus according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments.
The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations. Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.
Hereinafter, the terms "including", "having", and their derivatives, which may be used in various embodiments of the present invention, are only intended to indicate specific features, numbers, steps, operations, elements, components, or combinations of the foregoing, and should not be construed as first excluding the existence of, or adding to, one or more other features, numbers, steps, operations, elements, components, or combinations of the foregoing.
Furthermore, the terms "first," "second," "third," and the like are used solely to distinguish one from another and are not to be construed as indicating or implying relative importance.
Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which various embodiments of the present invention belong. The terms (such as those defined in commonly used dictionaries) should be interpreted as having a meaning that is consistent with their contextual meaning in the relevant art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein in various embodiments of the present invention.
As shown in fig. 1, an embodiment of the present invention provides a method for determining importance of information, including:
acquiring related information of information to be analyzed, wherein the related information comprises an information source of the information to be analyzed and an authority value of the information source;
calculating the heat value of the information to be analyzed;
and calculating the importance value of the information to be analyzed according to the heat value and the authority value.
The method for judging the importance of the information calculates the importance value of the information to be analyzed through the information source of the information to be analyzed and the authority value of the information source, can accurately and quickly obtain a calculation result, and is flexible in calculation process and high in feasibility.
Example 1
Specifically, information content and information source information are crawled by adopting a crawler crawling technology to obtain related information of the information to be analyzed, and then a semantic similarity algorithm is adopted to calculate the heat value of the information to be analyzed according to the number of the information sources, the browsing amount and the comment number of the information to be analyzed.
Further, an importance value of the information to be analyzed is calculated according to the heat value of the information and the authority value of the information source of the information. Wherein, the formula for calculating the importance value of the information to be analyzed is as follows:
P=N*M;
wherein, P is the importance value of the information to be analyzed, N is the heat value of the information to be analyzed, and M is the average value of authority values of all information sources of the information to be analyzed.
It should be noted that the authority value of the information source is obtained by comparing the information generated by history of different information sources and the hot information of the history with each other.
Preferably, the authority value is updated by a collaborative filtering algorithm according to the similarity and the heat condition of the information released in the set time, wherein the set time can be one day or half day. The invention does not limit the set time, and the specific time limit is adjusted according to specific conditions. In this embodiment, the set time is one day.
Specifically, in the actual calculation process, the authority value updating steps are exemplified by using a crawler crawling technology to crawl information sources of a civil network, a new wave network and a new Chinese network. The weight data of the acquired information source and the corresponding information are shown in table 1 below.
TABLE 1 weight data Table under information Source and corresponding information
Figure BDA0002215855630000051
Figure BDA0002215855630000061
Firstly, initializing authority values of a civil network, a Sina network and a Xinhua network to be 1, then taking the authority values of all information released on the day of the civil network as a calculation sequence, taking the authority values of all information released on the day of the Sina network as a calculation sequence, and taking the authority values of all information released on the day of the Sina network as a calculation sequence. And selecting the sequence corresponding to the civil network as a reference sequence, calculating the cosine similarity between the calculation sequence corresponding to the new wave network and the reference sequence, and subtracting the cosine similarity calculation result from the initial authority value 1 of the new wave network to obtain the updated authority value of the new wave network. Further, the calculation of the updated authority value of the xinhua network is consistent with the calculation steps, and details are not repeated herein.
It should be noted that, the authority value of the corresponding information source is obtained by the above method, and if there are a plurality of information sources of the information to be analyzed, the average value of the authority values of the information sources is taken as the authority value of the information source of the information to be analyzed.
Further, the information to be analyzed is determined as follows:
"certificate monitoring station 30 pieces of refinancing! The detailed re-financing service has definite capital investment, the same-industry competition standard, the reputation deduction value, the share pledge and the like.
The information source for acquiring the information by adopting the crawler method is the stockdealer China public number, the authority value of the stockdealer China public number is calculated by combining the calculation method of the authority value of the information source, and the importance value of the information is further calculated.
Example 2
In correspondence with the above embodiments, the present embodiment provides an information importance judging device, as shown in fig. 2, including:
the information acquisition unit is used for acquiring related information of the information to be analyzed, and the related information comprises an information source of the information to be analyzed and an authority value of the information source;
a heat calculator for calculating a heat value of the information to be analyzed;
and the importance calculating unit is used for calculating the importance value of the information to be analyzed according to the heat value and the authority value.
By the information importance judging device, the importance value of the information can be automatically calculated in real time without depending on manual operation, and the maintenance cost of the device is reduced.
Example 3
Correspondingly to the above embodiment 1, the present embodiment provides a computer terminal, including:
a processor and a memory;
the memory is used for storing a computer program, and the processor runs the computer program to enable the computer terminal to execute the information importance judging method.
Example 4
The present embodiment provides a computer-readable storage medium, which stores a computer program, wherein the computer program is executed by a processor to implement the method for determining importance of information.
In the embodiments provided in the present application, it should be understood that the disclosed apparatus and method can be implemented in other ways. The apparatus embodiments described above are merely illustrative and, for example, the flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of apparatus, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based devices that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
In addition, each functional module or unit in each embodiment of the present invention may be integrated together to form an independent part, or each module may exist separately, or two or more modules may be integrated to form an independent part.
The functions, if implemented in the form of software functional modules and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention or a part of the technical solution that contributes to the prior art in essence can be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a smart phone, a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and all the changes or substitutions should be covered within the scope of the present invention.

Claims (10)

1. An information importance determination method, comprising:
acquiring related information of information to be analyzed, wherein the related information comprises an information source of the information to be analyzed and an authority value of the information source;
calculating the heat value of the information to be analyzed;
and calculating the importance value of the information to be analyzed according to the heat value and the authority value.
2. The method as claimed in claim 1, wherein the heat value of the information to be analyzed is calculated according to the number of information sources, the browsing amount and the number of comments of the information to be analyzed.
3. The method of claim 1, wherein the authority value of each of the information sources is obtained by comparing historical generated information and historical trending information of each of the different information sources with each other.
4. The method as claimed in claim 1, wherein the information content and the information source information are crawled by using a crawler crawling technique to obtain the related information of the information to be analyzed.
5. The method of claim 1, wherein the formula for calculating the importance value of the information to be analyzed is as follows:
P=N*M;
wherein, P is the importance value of the information to be analyzed, N is the heat value of the information to be analyzed, and M is the average value of authority values of all information sources of the information to be analyzed.
6. The method as claimed in claim 2, wherein the information to be analyzed is semantically compared with the information from different sources, and if the semantic similarity exceeds a predetermined threshold, the same information is determined and the number of information sources of the information to be analyzed is accumulated to obtain the number of information sources.
7. An information importance judging device, comprising:
the information acquisition unit is used for acquiring related information of the information to be analyzed, wherein the related information comprises an information source of the information to be analyzed and an authority value of the information source;
the heat calculating unit is used for calculating the heat value of the information to be analyzed;
and the importance calculating unit is used for calculating the importance value of the information to be analyzed according to the heat value and the authority value.
8. The apparatus according to claim 7, wherein the heat value of the information to be analyzed is calculated based on the number of information sources, the browsing amount, and the number of comments of the information to be analyzed.
9. A computer terminal, comprising:
a processor and a memory;
the memory is used for storing a computer program, and the processor runs the computer program to enable the computer terminal to execute the information importance judging method according to any one of claims 1 to 6.
10. A computer-readable storage medium storing a computer program which, when executed by a processor, implements the information importance determination method according to any one of claims 1 to 6.
CN201910915004.5A 2019-09-26 2019-09-26 Information importance judging method, device, computer terminal and storage medium Active CN110674447B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910915004.5A CN110674447B (en) 2019-09-26 2019-09-26 Information importance judging method, device, computer terminal and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910915004.5A CN110674447B (en) 2019-09-26 2019-09-26 Information importance judging method, device, computer terminal and storage medium

Publications (2)

Publication Number Publication Date
CN110674447A true CN110674447A (en) 2020-01-10
CN110674447B CN110674447B (en) 2022-07-29

Family

ID=69079042

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910915004.5A Active CN110674447B (en) 2019-09-26 2019-09-26 Information importance judging method, device, computer terminal and storage medium

Country Status (1)

Country Link
CN (1) CN110674447B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104657496A (en) * 2015-03-09 2015-05-27 杭州朗和科技有限公司 Method and equipment for calculating information hot value
CN105224608A (en) * 2015-09-06 2016-01-06 华南理工大学 The hot news Forecasting Methodology analyzed based on microblog data and system
CN106294334A (en) * 2015-05-11 2017-01-04 国家计算机网络与信息安全管理中心 The computational methods of a kind of microblogging public sentiment index system and device
CN107705005A (en) * 2017-09-27 2018-02-16 吴殿义 A kind of movie and television contents Valuation Method
CN107800888A (en) * 2017-11-23 2018-03-13 北京麒麟合盛网络技术有限公司 Method for information display and device
CN108241727A (en) * 2017-09-01 2018-07-03 新华智云科技有限公司 News reliability evaluation method and equipment
CN109299884A (en) * 2018-10-19 2019-02-01 北京网智天元大数据科技有限公司 A kind of influence power appraisal procedure and assessment device
CN110020876A (en) * 2018-01-08 2019-07-16 北京京东尚科信息技术有限公司 A kind of information generating method and device

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104657496A (en) * 2015-03-09 2015-05-27 杭州朗和科技有限公司 Method and equipment for calculating information hot value
CN106294334A (en) * 2015-05-11 2017-01-04 国家计算机网络与信息安全管理中心 The computational methods of a kind of microblogging public sentiment index system and device
CN105224608A (en) * 2015-09-06 2016-01-06 华南理工大学 The hot news Forecasting Methodology analyzed based on microblog data and system
CN108241727A (en) * 2017-09-01 2018-07-03 新华智云科技有限公司 News reliability evaluation method and equipment
CN107705005A (en) * 2017-09-27 2018-02-16 吴殿义 A kind of movie and television contents Valuation Method
CN107800888A (en) * 2017-11-23 2018-03-13 北京麒麟合盛网络技术有限公司 Method for information display and device
CN110020876A (en) * 2018-01-08 2019-07-16 北京京东尚科信息技术有限公司 A kind of information generating method and device
CN109299884A (en) * 2018-10-19 2019-02-01 北京网智天元大数据科技有限公司 A kind of influence power appraisal procedure and assessment device

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
于广川 等: ""融合语境分析的时序推特摘要方法"", 《软件学报》 *
蒋盛益 等: ""微博信息可信度分析研究综述"", 《图书情报工作》 *
赵龙文 等: "基于意见领袖参与行为的微博话题热度预测研究", 《情报杂志》 *

Also Published As

Publication number Publication date
CN110674447B (en) 2022-07-29

Similar Documents

Publication Publication Date Title
CN111026570B (en) Method and device for determining abnormal reason of business system
CN106657057B (en) Anti-crawler system and method
CN106682906B (en) Risk identification and service processing method and equipment
CN106874165B (en) Webpage detection method and device
CN108228722B (en) Method for detecting geographic space distribution uniformity of sampling points in crushing area
US10210214B2 (en) Scalable trend detection in a personalized search context
CN106776609B (en) Statistical method and device for website reprint quantity
CN111159697B (en) Key detection method and device and electronic equipment
CN105404631B (en) Picture identification method and device
CN113507455B (en) Network security detection method and system based on big data
CN107944032B (en) Method and apparatus for generating information
CN109828780B (en) Open source software identification method and device
CN113992340B (en) User abnormal behavior identification method, device, equipment and storage medium
CN109729069B (en) Abnormal IP address detection method and device and electronic equipment
CN108366274B (en) Method and device for detecting brushing playing amount
CN105138245A (en) Deduplication processing method and device for screenshot pictures of intelligent terminal
CN111046087A (en) Data processing method, device, equipment and storage medium
CN108090364B (en) Method and system for positioning data leakage source
CN108804501B (en) Method and device for detecting effective information
CN110674447B (en) Information importance judging method, device, computer terminal and storage medium
US10984105B2 (en) Using a machine learning model in quantized steps for malware detection
CN108804917B (en) File detection method and device, electronic equipment and storage medium
CN110852893A (en) Risk identification method, system, equipment and storage medium based on mass data
CN105740666A (en) Method and device for identifying on-line operational risk
CN112580027A (en) Malicious sample determination method and device, storage medium and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: 200050 7th floor, Oriental Pearl TV Tower triumph center, Lane 1522, Kaixuan Road, Changning District, Shanghai

Applicant after: Hubo network technology (Shanghai) Co.,Ltd.

Address before: 200050 Room 802, building e, No. 753, Yuyuan Road, Changning District, Shanghai

Applicant before: Shanghai Yirui Information Technology Co.,Ltd.

GR01 Patent grant
GR01 Patent grant