WO2023273295A1 - Enterprise esg index determination method based on clustering technology, and related product - Google Patents
Enterprise esg index determination method based on clustering technology, and related product Download PDFInfo
- Publication number
- WO2023273295A1 WO2023273295A1 PCT/CN2022/071253 CN2022071253W WO2023273295A1 WO 2023273295 A1 WO2023273295 A1 WO 2023273295A1 CN 2022071253 W CN2022071253 W CN 2022071253W WO 2023273295 A1 WO2023273295 A1 WO 2023273295A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- news
- group
- newsgroups
- original
- enterprise
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 44
- 238000005516 engineering process Methods 0.000 title claims abstract description 15
- 238000011156 evaluation Methods 0.000 claims abstract description 25
- 239000013598 vector Substances 0.000 claims description 31
- 230000008451 emotion Effects 0.000 claims description 29
- 230000002996 emotional effect Effects 0.000 claims description 28
- 230000008909 emotion recognition Effects 0.000 claims description 22
- 238000012545 processing Methods 0.000 claims description 19
- 238000004590 computer program Methods 0.000 claims description 16
- 238000013507 mapping Methods 0.000 claims description 9
- 238000000605 extraction Methods 0.000 claims description 5
- 238000011161 development Methods 0.000 description 11
- 230000006399 behavior Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 230000009471 action Effects 0.000 description 3
- 238000013473 artificial intelligence Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 230000008520 organization Effects 0.000 description 3
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 1
- 238000005265 energy consumption Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
Definitions
- This application relates to the technical field of data processing, in particular to a method for determining an enterprise ESG index based on clustering technology and related products.
- a company's ESG score is a comprehensive score for the company's environment (E), society (S), and governance (G).
- International and domestic companies have accumulated some successful experience in scoring ESG performance.
- Internationally renowned rating agencies such as MSCI and FTSE have established their own scoring standards and conducted ESG evaluations on internationally renowned companies.
- ESG scores for companies are mainly based on the number of news related to the company. However, there is a lot of noise in the news obtained. The accuracy of ESG scores is low, resulting in low accuracy of investment decisions based on ESG scores.
- the embodiment of this application provides a method for determining an enterprise ESG index based on clustering technology and related products, distinguishing original news and reprinted news in multiple news articles of a news event, improving the accuracy of ESG scoring by enterprises, and formulating Accuracy of investment decisions.
- the embodiment of the present application provides a method for determining an enterprise ESG index based on clustering technology, including:
- M pieces of news about the enterprise to be evaluated within a preset time period where M is an integer greater than or equal to 1;
- each first newsgroup in the K first newsgroups corresponds to a news event, and each first newsgroup includes One or more news articles in the M news articles, K is an integer greater than or equal to 1;
- the target public opinion score of the news event corresponding to each of the first newsgroups is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;
- ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;
- an enterprise ESG index determination device including:
- the obtaining unit is used to obtain M pieces of news of the enterprise to be evaluated within a preset time period, where M is an integer greater than or equal to 1;
- a processing unit configured to cluster the M pieces of news to obtain K first news groups, wherein each first news group in the K first news groups corresponds to a news event, and each of the K first news groups corresponds to a news event.
- the first news group includes one or more news articles in the M news articles, K is an integer greater than or equal to 1;
- the target public opinion score of the news event corresponding to each of the first newsgroups is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;
- ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;
- the sending unit is configured to send the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device makes an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
- an embodiment of the present application provides an electronic device, including: a processor, the processor is connected to a memory, the memory is used to store a computer program, and the processor is used to execute the computer program stored in the memory , so that the electronic device executes the method as described in the first aspect, the method includes:
- M pieces of news about the enterprise to be evaluated within a preset time period where M is an integer greater than or equal to 1;
- each first newsgroup in the K first newsgroups corresponds to a news event, and each first newsgroup includes One or more news articles in the M news articles, K is an integer greater than or equal to 1;
- the target public opinion score of the news event corresponding to each of the first newsgroups is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;
- ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;
- an embodiment of the present application provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and the computer program causes a computer to execute the method as described in the first aspect, the method comprising:
- M pieces of news about the enterprise to be evaluated within a preset time period where M is an integer greater than or equal to 1;
- each first newsgroup in the K first newsgroups corresponds to a news event, and each first newsgroup includes One or more news articles in the M news articles, K is an integer greater than or equal to 1;
- the target public opinion score of the news event corresponding to each of the first newsgroups is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;
- ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;
- the ESG index with high precision can be sent to the target device, so as to improve the accuracy of the investment decision made.
- FIG. 1 is a schematic flowchart of a method for determining an enterprise ESG index based on clustering technology provided by an embodiment of the present application;
- Fig. 2 is a block diagram of functional units of an enterprise ESG index determination device provided by the embodiment of the present application;
- FIG. 3 is a schematic structural diagram of a device for determining an enterprise ESG index provided by an embodiment of the present application.
- AI artificial intelligence
- This application may relate to the field of artificial intelligence technology, such as acquiring and processing relevant data based on artificial intelligence technology.
- artificial intelligence is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. .
- the corporate ESG index or ESG index involved in the embodiment of this application is the corporate ESG score, which can also be called ESG score. They are essentially the same and will not be distinguished later.
- the ESG evaluation of enterprises, or the ESG scoring of enterprises are essentially the same, and they are all used to determine the ESG index of enterprises.
- Fig. 1 is a clustering technology-based ESG index method for enterprises provided by the embodiment of the present application. This method is applied to the enterprise ESG index determination device. The method includes the following steps:
- the enterprise ESG index determination device can obtain W articles of news within a preset time period from multiple news media platforms through crawler technology, and identify the W articles of news (for example, text classification can be performed on W articles of news, ) Select N news related to ESG evaluation from W news. For example, if a certain news discloses the energy consumption of a certain company, this news is regarded as news related to ESG index.
- the preset time period may be any historical time period, for example, it may be the last ten days, last month, or last year, and so on.
- This application does not limit the preset time period. That is to say, the enterprise ESG index determination method of this application can evaluate the ESG of the enterprise based on the news in the most recent time period, and obtain the ESG index of the enterprise in the most recent time period; News conducts ESG evaluations on companies and obtains the ESG index of companies in a certain historical period.
- entity recognition can be performed on each of the N news articles to obtain the entity words in each news article, and determine the company involved in each news article according to the entity words in each news article, wherein the entity words can be is the name of the enterprise (for example, Chinese name, English name), the products of the enterprise, the location of the enterprise, the ranking of the enterprise, etc.; All companies involved in this news.
- the number of news related to each company in all companies involved in N news can be obtained, that is, M news related to the company to be evaluated is obtained, and M is less than or equal to N, wherein, the enterprise to be evaluated is any enterprise among all enterprises.
- each first news group in the K first news groups corresponds to a news event, and each first news group A group includes one or more news articles in the M news articles, K is an integer greater than or equal to 1.
- the device for determining the ESG index of an enterprise extracts semantic information from each of the M pieces of news to obtain a semantic vector corresponding to each piece of news, wherein the semantic vector corresponding to each piece of news is used to represent the information contained in each piece of news Describe the news event.
- Cluster M news based on the semantic vector of each news to obtain K first news groups, that is, determine the Euclidean distance between the semantic vectors of any two news in M news, and combine the semantic vectors of any two news The Euclidean distance between them is used as the first similarity between any two news articles; then, the two news articles with the first similarity greater than the first threshold are classified into one category, that is, they are classified into a first news group, Get the K first newsgroups. Since the first similarity between the semantic vectors of the two news is greater than the first threshold, it means that the two news describe a news event, so one or more news descriptions included in each first news group It's the same news event.
- the enterprise ESG index determination device determines the second similarity between any two news articles in each first news group, and classifies the news with the second similarity greater than the second threshold into one category, that is, classifies them into one In the second newsgroup, get L second newsgroups.
- the second similarity between any two news articles in each first news group can be the similarity between the news content of the two news articles, for example, the headline and summary of the two news articles can be calculated.
- the similarity between the title and the abstract is used as the similarity between the two news, if the similarity is greater than the second threshold, it means that the two news are the same news (that is, one of the two news One article is the original news, and the other is the news that reprinted the original news; or, both news are reprinted news).
- a piece of original news and all reprinted news that reprinted this original news can be clustered into a second news group.
- the contents of the two original news articles are different, the two original news articles will be clustered into two different second newsgroups. Therefore, through the above-mentioned clustering method, all original news can be separated, and reprinted news corresponding to each original news can be clustered together to obtain L second news groups.
- each second news group it must contain an original news and H reprinted news.
- H is greater than or equal to 1
- the publication time must be the earliest, so the news with the earliest publication time in each second news group can be used as the original news in each second news group, and the rest of the news is the first news of each second news group. Reprinted news in the group.
- the target public opinion score is used to represent the influence of the news event corresponding to each first news group on the enterprise to be evaluated.
- Exemplary first determine the scaling ratio of each first news group from the news quantity, then determine the original public opinion score of each first news group from the news content; finally, the scaling ratio of each first news group and The original public opinion scores are fused to obtain the target public opinion scores of each first news group.
- the scaling ratio of each first news group represents the social attention to the news event corresponding to each first news group
- the target public opinion score is used to represent the news event corresponding to each first news group.
- Influence refers to the degree of influence on the enterprise being evaluated.
- a preset ratio of original news in each second news group is acquired, wherein the preset ratio represents the contribution of original news to social attention to news events corresponding to each first news group.
- original news is independently created by journalists and has nothing to do with troll hype. Therefore, all original news under a news event should attract the same attention to this news event, that is to say, every The original news is made in terms of the social attention received by the news event. Therefore, a ratio is preset for each original news event, so that the contribution of original news in terms of attention is not affected by the number of reprinted news; according to the preset ratio of original news in each second news group and each The number H of reposted news contained in the second newsgroup determines the scaling ratio of each second newsgroup.
- the scaling ratio of each second newsgroup can be expressed by formula (1):
- heat j is the zoom ratio of the jth second newsgroup in the L second newsgroups
- H j is the number of reprinted news contained in the jth second newsgroup
- a is each second newsgroup
- the preset ratio of original news in generally takes a value of 1.
- the scaling ratio of each second newsgroup can also be expressed by formula (2):
- heat is the zoom ratio of each first newsgroup.
- a group of second news groups in determining the public opinion score of the news event corresponding to each first news group, can be randomly selected from the L second news groups corresponding to each first news group , and carry out emotion recognition on any piece of news in the second newsgroup to obtain the emotion label corresponding to the news, and use the emotion label of the news as the emotion label corresponding to each first newsgroup; or, from A piece of news is randomly selected from all the news corresponding to each first news group for emotion recognition, and the emotional label of the news is obtained, and the emotional label of the news is used as the emotional label corresponding to each first news group. Because the news events described by all the news under each first news group are the same.
- the mode of adding emotional tags for each first newsgroup is more flexible, and does not limit the mode of adding emotional tags for each first newsgroup.
- the sentiment tag corresponding to each first news group is used to represent whether the news event corresponding to each first news group is a positive news event or a negative news event.
- the specific content of the negative news event can be determined, for example, it can be the amount of fines, business interruption, high-level criminal detention or prohibition of government Purchasing, etc.;
- the sentiment tag is used to represent that the news event described by each first news group is a positive news event, determine the specific content of the positive news event, for example, it can be effective for energy saving and emission reduction.
- the publishing media of all news in the L second newsgroups for example, carry out text recognition to the news content of each news in the L second newsgroups, identify the content of each news from each news content Publishing media, for example, the publishing media of some news is located at the end of the news content, and the publishing media of some news is located at the very beginning of the news content. Therefore, carry out text recognition to the news content of each news, can obtain the publishing media of each news; Publish the media.
- the priority relationship of publishing media is preset in the enterprise ESG index determining device. Generally speaking, the priorities are as follows: national-level publishing media > national-level publishing media > municipal-level publishing media > self-media > entertainment media, and so on. Therefore, after identifying the publishing media of each news article in the L second newsgroups, the highest-level publishing media in each first newsgroup is determined according to the preset priority relationship.
- the mapping relationship between publishing media, emotional labels and public opinion scores can be as follows: if the news event is characterized as a neutral news event (that is, there is no criticism or praise), then set the public opinion score to 0; if the news event represents When it is a negative news event, points will be deducted from 0 to -10, and the degree of deduction is related to the specific content of the negative news event and the level of the publishing media. For example, the news event corresponding to each first news group is If executives are detained and the highest-level publishing media is a national-level publishing media, 10 points will be deducted, that is, the original public opinion score corresponding to each first news group is -10 points; if the news event is characterized as a positive news event, Add from 0 to 5.
- the degree of deduction is related to the specific content of the negative news event and the level of the publishing media; 2 points are added to the publishing media of the first level, that is, the original public opinion score corresponding to the news event is 2 points. That is to say, the mapping relationship between various types of news events, various publishing levels and public opinion scores is pre-configured. After the highest-level publishing media and the type of news events of each first news group are determined, the original public opinion score corresponding to each first news group can be determined according to the mapping relationship.
- the original public opinion score of each first news group is multiplied by the scaling ratio to obtain the target public opinion score corresponding to each first news group.
- the target public opinion score corresponding to each first news group can be expressed by formula (4):
- adjscoreEvent is the target public opinion score corresponding to each first newsgroup
- adjscore is the original public opinion score of each first newsgroup.
- the scaling ratio heat plays the role of an amplifier in formula (4), that is, the original public opinion score of each news event is enlarged or reduced. For example, if the scaling ratio is 1.5, the original public opinion score is enlarged by 1.5 times. Since the scaling ratio itself reflects the social attention to the news event, through the multiplicative fusion in formula (4), negative public opinion can be deducted more points, and positive public opinion can be added more points, so that the final calculated The target public opinion score is more reasonable and accurate.
- the target public opinion score corresponding to each first news group may be used as an ESG index when ESG evaluation is performed on the enterprise to be evaluated from the news dimension.
- the decisions related to the enterprise to be evaluated are also different.
- the ESG index of this application has the following application scenarios:
- Scenario 1 When the target equipment is the equipment of an investment institution, and the ESG index of the enterprise to be evaluated is sent to the investment institution, the investment institution can make a decision related to the enterprise to be evaluated: investment related to the enterprise to be evaluated decision making. For example, since the ESG index of an enterprise reflects the value and sustainable development capability of the enterprise, when the ESG index of the enterprise to be evaluated is high, the investment decision can be to increase the investment amount and investment cycle of the enterprise to be evaluated ; When the ESG index of the enterprise to be evaluated is low, the investment can definitely divest the investment in the enterprise to be evaluated or reduce the investment in the enterprise to be evaluated, and so on. In general, sending the ESG index of the enterprise to be evaluated to the investment institution can provide direction guidance for the investment decision-making of the investment institution and reduce investment risk.
- Scenario 2 When the target equipment is the equipment of the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is sent to the enterprise to be evaluated, the enterprise to be evaluated can formulate a decision related to the enterprise to be evaluated as follows: management decisions. For example, since the ESG index of a company reflects the value and sustainable development capabilities of the company, as investors' acceptance of the ESG index deepens and they attach importance to corporate social responsibility, they are more willing to invest in higher ESG indexes. enterprise.
- the management decision makes a decision to strengthen corporate management for the enterprise to be evaluated, and continue to maintain excellent performance in ESG; when the ESG index of the enterprise to be evaluated is low, the management decision The decision is to adjust the company's development strategy, improve the company's sustainable development, and improve the ESG index.
- sending the ESG index of the companies to be evaluated to the companies to be evaluated will help promote the companies to be evaluated to improve their own ESG evaluation and guide the healthy development of the companies to be evaluated.
- Scenario 3 When the target equipment is the equipment of the government or social organization, the ESG index of the enterprise to be evaluated is sent to the government or social organization, then the government or social organization can make a decision related to the enterprise to be evaluated. Evaluate enterprise-related support decisions. Exemplarily, the ESG index of an enterprise reflects the value and sustainable development capability of the enterprise.
- the ESG index of the enterprise to be evaluated when the ESG index of the enterprise to be evaluated is high, it indicates that the development potential of the enterprise to be evaluated is relatively large, and the support decision can greatly promote the enterprise to be evaluated so as to provide more development opportunities for such enterprises;
- the ESG index of the enterprise to be evaluated is low, it indicates that the development potential of the enterprise to be evaluated is low, and the support decision can be to order such an enterprise to adjust the company's development strategy, or reduce support to guide the enterprise to be evaluated to a healthy development direction Adjustment.
- FIG. 2 is a block diagram of functional units of an enterprise ESG index determination device provided in the embodiment of the present application.
- the enterprise ESG index determination device 200 includes: an acquisition unit 201, a processing unit 202 and a sending unit 203, wherein:
- Acquisition unit 201 is used to obtain M pieces of news of the enterprise to be evaluated within the preset time period, wherein M is an integer greater than or equal to 1;
- the processing unit 202 is configured to cluster the M pieces of news to obtain K first news groups, wherein each first news group in the K first news groups corresponds to a news event, and each of the K first news groups corresponds to a news event.
- a first news group includes one or more news articles in the M news articles, K is an integer greater than or equal to 1;
- the target public opinion score of the news event corresponding to each of the first newsgroups is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;
- ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;
- the sending unit 203 is configured to send the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device makes an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
- each of the first news groups corresponds to In terms of the target public opinion scoring of news events, the processing unit 202 is specifically used for:
- the scaling ratio of the news event corresponding to each first news group represents the degree of attention to the news event corresponding to each first news group
- the news content of an original news and the news content of H reprinted news included in each second news group in the L second news groups determine the original news event corresponding to each of the first news groups Score of public opinion;
- the processing unit 202 is specifically used for:
- the scaling ratios of the L second newsgroups are summed to obtain the scaling ratios of the news events corresponding to each of the first newsgroups.
- the processing unit 202 is specifically used for:
- mapping relationship between published media, emotional tags and public opinion scores, and the highest-level published media in each first news group and the corresponding emotional tags of each first news group determine each The original public opinion score of the news event corresponding to the first news group.
- the processing unit 202 in terms of performing emotion recognition on the news in each of the first newsgroups to obtain the emotion tags corresponding to each of the first newsgroups, is specifically configured to:
- the label is used as the sentiment label corresponding to each of the first newsgroups
- the label serves as the sentiment label corresponding to each first newsgroup.
- the processing unit 202 is specifically configured to:
- the M news are clustered to obtain K first news groups, wherein, among the K first news groups
- the first similarity between the semantic vectors of any two news articles in each first news group is greater than the first threshold.
- the processing unit 202 is specifically used for:
- one or more news articles included in each first news group are clustered to obtain L second news groups, wherein, the second similarity between any two news articles in each second news group is greater than a second threshold.
- FIG. 3 is a schematic structural diagram of an electronic device provided in an embodiment of the present application.
- an electronic device 300 includes a transceiver 301 , a processor 302 and a memory 303 . They are connected through a bus 304 .
- the memory 303 is used to store computer programs and data, and can transmit the data stored in the memory 303 to the processor 302 .
- the processor 302 is used to read the computer program in the memory 303 to perform the following operations:
- each first newsgroup in the K first newsgroups corresponds to a news event, and each first newsgroup includes One or more news articles in the M news articles, K is an integer greater than or equal to 1;
- the target public opinion score of the news event corresponding to each of the first newsgroups is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;
- ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;
- the control transceiver 301 sends the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device can make an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
- each of the first news groups corresponds to In terms of the target public opinion scoring of news events, the processor 302 is specifically used for:
- the scaling ratio of the news event corresponding to each first news group represents the degree of attention to the news event corresponding to each first news group
- the news content of an original news and the news content of H reprinted news included in each second news group in the L second news groups determine the original news event corresponding to each of the first news groups Score of public opinion;
- the processor 302 is specifically configured to perform the following operations:
- the scaling ratios of the L second newsgroups are summed to obtain the scaling ratios of the news events corresponding to each of the first newsgroups.
- the processor 302 is specifically configured to perform the following operations:
- mapping relationship between published media, emotional tags and public opinion scores, and the highest-level published media in each first news group and the corresponding emotional tags of each first news group determine each The original public opinion score of the news event corresponding to the first news group.
- the processor 302 in terms of performing emotion recognition on the news in each of the first newsgroups to obtain the emotion tags corresponding to each of the first newsgroups, the processor 302 is specifically configured to perform the following operate:
- the label is used as the sentiment label corresponding to each of the first newsgroups
- the label serves as the sentiment label corresponding to each first news group.
- the processor 302 is specifically configured to perform the following operations:
- the M news are clustered to obtain K first news groups, wherein, among the K first news groups
- the first similarity between the semantic vectors of any two news articles in each first news group is greater than the first threshold.
- the processor 302 is specifically configured to perform the following operations:
- one or more news articles included in each first news group are clustered to obtain L second news groups, wherein, the second similarity between any two news articles in each second news group is greater than a second threshold.
- the above-mentioned transceiver 301 can be the acquisition unit 201 and the sending unit 203 of the enterprise ESG index determination device 200 of the embodiment shown in FIG. 2
- the above-mentioned processor 302 can be the enterprise ESG index determination The processing unit 202 of the device 200 .
- the embodiment of the present application also provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and the computer program is executed by a processor to implement any clustering-based Some or all of the steps in the corporate ESG index determination methodology for technology.
- the storage medium involved in this application such as a computer-readable storage medium, may be non-volatile or volatile.
- the embodiment of the present application also provides a computer program product, the computer program product includes a non-transitory computer-readable storage medium storing a computer program, and the computer program is operable to enable the computer to execute the method described in the above method embodiments Part or all of the steps of any method for determining an enterprise ESG index based on clustering technology.
- the disclosed device can be implemented in other ways.
- the device embodiments described above are only illustrative.
- the division of the units is only a logical function division. In actual implementation, there may be other division methods.
- multiple units or components can be combined or can be Integrate into another system, or some features may be ignored, or not implemented.
- the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical or other forms.
- the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
- each functional unit in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit.
- the above-mentioned integrated units can be implemented not only in the form of hardware, but also in the form of software program modules.
- the integrated units may be stored in a computer-readable memory if implemented in the form of a software program module and sold or used as an independent product.
- the technical solution of the present application is essentially or part of the contribution to the prior art, or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a memory.
- Several instructions are included to make a computer device (which may be a personal computer, server or network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application.
- the aforementioned memory includes: various media that can store program codes such as U disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disk or optical disk.
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Provided are an enterprise ESG index determination method based on a clustering technology, and a related product. The method comprises: acquiring M pieces of news, which are within a preset time period, of an enterprise to be evaluated; clustering the M pieces of news to obtain K first news groups; clustering one or more pieces of news in each first news group, so as to obtain L second news groups corresponding to each first news group; according to a piece of original news and H pieces of reprinted news, which are comprised in each second news group of the L second news groups, determining a target public opinion score of a news event corresponding to each first news group, wherein the target public opinion score is used for representing the influence of the news event corresponding to each first news group on said enterprise (104); and performing ESG evaluation on said enterprise according to the target public opinion score of the news event corresponding to each first news group, so as to obtain an ESG index of said enterprise (105).
Description
本申请要求于2021年7月2日提交中国专利局、申请号为202110748931.X,发明名称为“基于聚类技术的企业ESG指数确定方法及相关产品”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number 202110748931.X filed on July 2, 2021, and the title of the invention is "Method for Determining Enterprise ESG Index Based on Clustering Technology and Related Products", all of which The contents are incorporated by reference in this application.
本申请涉及数据处理技术领域,具体涉及一种基于聚类技术的企业ESG指数确定方法及相关产品。This application relates to the technical field of data processing, in particular to a method for determining an enterprise ESG index based on clustering technology and related products.
企业的ESG评分是对企业的环境(E)、社会(S)、治理(G)方面的综合评分。国际和国内在对企业ESG表现进行评分方面已经积累了一些成功的经验,国际上知名的评级机构比如MSCI、FTSE等都建立了各自的评分标准,并对国际上知名的企业进行了ESG评价。随着国际社会,比如各类投资机构和政府对企业责任的重视,特别是最近随着全球气候变化合作的进展,以及中国在国际上做出的到2030年实现碳达峰,2060年实现碳中和的承诺,中国境内投资者和政府对企业在ESG方面的评分也有大量的需求。A company's ESG score is a comprehensive score for the company's environment (E), society (S), and governance (G). International and domestic companies have accumulated some successful experience in scoring ESG performance. Internationally renowned rating agencies such as MSCI and FTSE have established their own scoring standards and conducted ESG evaluations on internationally renowned companies. As the international community, such as various investment institutions and governments, attach importance to corporate responsibility, especially with the recent progress of global climate change cooperation, and China's international commitment to achieve carbon peaking by 2030 and carbon emissions by 2060 China's domestic investors and the government also have a lot of demand for companies' ESG scores.
发明人意识到,在评价企业的ESG表现时,需要用到与企业相关的新闻来对企业做ESG评分。目前主要通过与企业相关的新闻的数量对企业进行ESG评分,然而,在获取到的新闻中,存在很多噪音,比如,某个新闻由于炒作出现了多次转载,因此单纯使用新闻数量对企业进行ESG评分的精度较低,导致基于ESG评分制定出的投资决策精度比较低。The inventor realized that when evaluating the ESG performance of an enterprise, it is necessary to use news related to the enterprise to score the enterprise on ESG. At present, ESG scores for companies are mainly based on the number of news related to the company. However, there is a lot of noise in the news obtained. The accuracy of ESG scores is low, resulting in low accuracy of investment decisions based on ESG scores.
发明内容Contents of the invention
本申请实施例提供了一种基于聚类技术的企业ESG指数确定方法及相关产品,区分一个新闻事件的多篇新闻中的原创新闻和转载新闻,提高企业进行ESG评分的精度,以及制定出的投资决策的精度。The embodiment of this application provides a method for determining an enterprise ESG index based on clustering technology and related products, distinguishing original news and reprinted news in multiple news articles of a news event, improving the accuracy of ESG scoring by enterprises, and formulating Accuracy of investment decisions.
第一方面,本申请实施例提供一种基于聚类技术的企业ESG指数确定方法,包括:In the first aspect, the embodiment of the present application provides a method for determining an enterprise ESG index based on clustering technology, including:
获取待评价企业在预设时间段内的M篇新闻,其中,M为大于或者等于1的整数;Obtain M pieces of news about the enterprise to be evaluated within a preset time period, where M is an integer greater than or equal to 1;
对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组对应一个新闻事件,所述每个第一新闻组包括所述M篇新闻中的一篇或多篇新闻,K为大于或者等于1的整数;Clustering the M pieces of news to obtain K first newsgroups, wherein each first newsgroup in the K first newsgroups corresponds to a news event, and each first newsgroup includes One or more news articles in the M news articles, K is an integer greater than or equal to 1;
对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应的L个第二新闻组,其中,所述每个第二新闻组包括一篇原创新闻以及与所述原创新闻对应的H篇转载新闻,H为大于或者等于0的整数;Clustering one or more news articles in each of the first newsgroups to obtain L second newsgroups corresponding to each of the first newsgroups, wherein each of the second newsgroups includes One piece of original news and H pieces of reprinted news corresponding to the original news, where H is an integer greater than or equal to 0;
根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,所述目标舆情评分用于表征所述每个第一新闻组对应的新闻事件对所述待评价企业的影响力;According to an original news and H pieces of reposted news included in each of the L second newsgroups, determine the target public opinion score of the news event corresponding to each of the first newsgroups, the target The public opinion score is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;
根据所述每个第一新闻组对应的新闻事件的目标舆情评分,对所述待评价企业进行ESG评价,得到所述待评价企业的ESG指数;According to the target public opinion score of the news event corresponding to each first news group, ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;
向目标设备发送所述待评价企业的ESG指数,以便所述目标设备的用户根据所述待评价企业的ESG指数制定与所述待评价企业相关的投资决策。Sending the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device makes an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
第二方面,本申请实施例提供一种企业ESG指数确定装置,包括:In the second aspect, the embodiment of the present application provides an enterprise ESG index determination device, including:
获取单元,用于获取待评价企业在预设时间段内的M篇新闻,其中,M为大于或者等于1的整数;The obtaining unit is used to obtain M pieces of news of the enterprise to be evaluated within a preset time period, where M is an integer greater than or equal to 1;
处理单元,用于对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组对应一个新闻事件,所述每个第一新闻组包括所述M篇新闻中的一篇或多篇新闻,K为大于或者等于1的整数;A processing unit, configured to cluster the M pieces of news to obtain K first news groups, wherein each first news group in the K first news groups corresponds to a news event, and each of the K first news groups corresponds to a news event. The first news group includes one or more news articles in the M news articles, K is an integer greater than or equal to 1;
对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应 的L个第二新闻组,其中,所述每个第二新闻组包括一篇原创新闻以及与所述原创新闻对应的H篇转载新闻,H为大于或者等于0的整数;Clustering one or more news articles in each of the first newsgroups to obtain L second newsgroups corresponding to each of the first newsgroups, wherein each of the second newsgroups includes One piece of original news and H pieces of reprinted news corresponding to the original news, where H is an integer greater than or equal to 0;
根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,所述目标舆情评分用于表征所述每个第一新闻组对应的新闻事件对所述待评价企业的影响力;According to an original news and H pieces of reposted news included in each of the L second newsgroups, determine the target public opinion score of the news event corresponding to each of the first newsgroups, the target The public opinion score is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;
根据所述每个第一新闻组对应的新闻事件的目标舆情评分,对所述待评价企业进行ESG评价,得到所述待评价企业的ESG指数;According to the target public opinion score of the news event corresponding to each first news group, ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;
发送单元,用于向目标设备发送所述待评价企业的ESG指数,以便所述目标设备的用户根据所述待评价企业的ESG指数制定与所述待评价企业相关的投资决策。The sending unit is configured to send the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device makes an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
第三方面,本申请实施例提供一种电子设备,包括:处理器,所述处理器与存储器相连,所述存储器用于存储计算机程序,所述处理器用于执行所述存储器中存储的计算机程序,以使得所述电子设备执行如第一方面所述的方法,该方法包括:In a third aspect, an embodiment of the present application provides an electronic device, including: a processor, the processor is connected to a memory, the memory is used to store a computer program, and the processor is used to execute the computer program stored in the memory , so that the electronic device executes the method as described in the first aspect, the method includes:
获取待评价企业在预设时间段内的M篇新闻,其中,M为大于或者等于1的整数;Obtain M pieces of news about the enterprise to be evaluated within a preset time period, where M is an integer greater than or equal to 1;
对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组对应一个新闻事件,所述每个第一新闻组包括所述M篇新闻中的一篇或多篇新闻,K为大于或者等于1的整数;Clustering the M pieces of news to obtain K first newsgroups, wherein each first newsgroup in the K first newsgroups corresponds to a news event, and each first newsgroup includes One or more news articles in the M news articles, K is an integer greater than or equal to 1;
对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应的L个第二新闻组,其中,所述每个第二新闻组包括一篇原创新闻以及与所述原创新闻对应的H篇转载新闻,H为大于或者等于0的整数;Clustering one or more news articles in each of the first newsgroups to obtain L second newsgroups corresponding to each of the first newsgroups, wherein each of the second newsgroups includes One piece of original news and H pieces of reprinted news corresponding to the original news, where H is an integer greater than or equal to 0;
根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,所述目标舆情评分用于表征所述每个第一新闻组对应的新闻事件对所述待评价企业的影响力;According to an original news and H pieces of reposted news included in each of the L second newsgroups, determine the target public opinion score of the news event corresponding to each of the first newsgroups, the target The public opinion score is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;
根据所述每个第一新闻组对应的新闻事件的目标舆情评分,对所述待评价企业进行ESG评价,得到所述待评价企业的ESG指数;According to the target public opinion score of the news event corresponding to each first news group, ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;
向目标设备发送所述待评价企业的ESG指数,以便所述目标设备的用户根据所述待评价企业的ESG指数制定与所述待评价企业相关的投资决策。Sending the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device makes an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
第四方面,本申请实施例提供一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序使得计算机执行如第一方面所述的方法,该方法包括:In a fourth aspect, an embodiment of the present application provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and the computer program causes a computer to execute the method as described in the first aspect, the method comprising:
获取待评价企业在预设时间段内的M篇新闻,其中,M为大于或者等于1的整数;Obtain M pieces of news about the enterprise to be evaluated within a preset time period, where M is an integer greater than or equal to 1;
对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组对应一个新闻事件,所述每个第一新闻组包括所述M篇新闻中的一篇或多篇新闻,K为大于或者等于1的整数;Clustering the M pieces of news to obtain K first newsgroups, wherein each first newsgroup in the K first newsgroups corresponds to a news event, and each first newsgroup includes One or more news articles in the M news articles, K is an integer greater than or equal to 1;
对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应的L个第二新闻组,其中,所述每个第二新闻组包括一篇原创新闻以及与所述原创新闻对应的H篇转载新闻,H为大于或者等于0的整数;Clustering one or more news articles in each of the first newsgroups to obtain L second newsgroups corresponding to each of the first newsgroups, wherein each of the second newsgroups includes One piece of original news and H pieces of reprinted news corresponding to the original news, where H is an integer greater than or equal to 0;
根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,所述目标舆情评分用于表征所述每个第一新闻组对应的新闻事件对所述待评价企业的影响力;According to an original news and H pieces of reposted news included in each of the L second newsgroups, determine the target public opinion score of the news event corresponding to each of the first newsgroups, the target The public opinion score is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;
根据所述每个第一新闻组对应的新闻事件的目标舆情评分,对所述待评价企业进行ESG评价,得到所述待评价企业的ESG指数;According to the target public opinion score of the news event corresponding to each first news group, ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;
向目标设备发送所述待评价企业的ESG指数,以便所述目标设备的用户根据所述待评价企业的ESG指数制定与所述待评价企业相关的投资决策。Sending the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device makes an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
在本申请实施方式中,可以向目标设备发送精度较高的ESG指数,提高制定出的投资决策的准确度。In the implementation manner of the present application, the ESG index with high precision can be sent to the target device, so as to improve the accuracy of the investment decision made.
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application, the following will briefly introduce the drawings that need to be used in the description of the embodiments. Obviously, the drawings in the following description are some embodiments of the present application. For Those of ordinary skill in the art can also obtain other drawings based on these drawings without making creative efforts.
图1为本申请实施例提供的一种基于聚类技术的企业ESG指数确定方法的流程示意图;FIG. 1 is a schematic flowchart of a method for determining an enterprise ESG index based on clustering technology provided by an embodiment of the present application;
图2为本申请实施例提供的一种企业ESG指数确定装置的功能单元组成框图;Fig. 2 is a block diagram of functional units of an enterprise ESG index determination device provided by the embodiment of the present application;
图3为本申请实施例提供的一种企业ESG指数确定装置的结构示意图。FIG. 3 is a schematic structural diagram of a device for determining an enterprise ESG index provided by an embodiment of the present application.
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of this application.
本申请的说明书和权利要求书及所述附图中的术语“第一”、“第二”、“第三”和“第四”等是用于区别不同对象,而不是用于描述特定顺序。此外,术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "first", "second", "third" and "fourth" in the specification and claims of the present application and the drawings are used to distinguish different objects, rather than to describe a specific order . Furthermore, the terms "include" and "have", as well as any variations thereof, are intended to cover a non-exclusive inclusion. For example, a process, method, system, product or device comprising a series of steps or units is not limited to the listed steps or units, but optionally also includes unlisted steps or units, or optionally further includes For other steps or units inherent in these processes, methods, products or apparatuses.
在本文中提及“实施例”意味着,结合实施例描述的特定特征、结果或特性可以包含在本申请的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例,也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其它实施例相结合。Reference herein to an "embodiment" means that a particular feature, result, or characteristic described in connection with the embodiment can be included in at least one embodiment of the present application. The occurrences of this phrase in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. It is understood explicitly and implicitly by those skilled in the art that the embodiments described herein can be combined with other embodiments.
本申请可涉及人工智能技术领域,如可以基于人工智能技术对相关的数据进行获取和处理。其中,人工智能(Artificial Intelligence,AI)是利用数字计算机或者数字计算机控制的机器模拟、延伸和扩展人的智能,感知环境、获取知识并使用知识获得最佳结果的理论、方法、技术及应用系统。This application may relate to the field of artificial intelligence technology, such as acquiring and processing relevant data based on artificial intelligence technology. Among them, artificial intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. .
首先说明,本申请实施例所涉及的企业ESG指数或者ESG指数,即是企业ESG评分,也可以称为ESG评分,它们在本质上是一样的,后面不用再区分。相应的,对企业进行ESG评价,或者对企业进行ESG评分,它们在本质上也都是一样的,都是用于确定企业的ESG指数。First of all, the corporate ESG index or ESG index involved in the embodiment of this application is the corporate ESG score, which can also be called ESG score. They are essentially the same and will not be distinguished later. Correspondingly, the ESG evaluation of enterprises, or the ESG scoring of enterprises, are essentially the same, and they are all used to determine the ESG index of enterprises.
参阅图1,图1为本申请实施例提供的一种基于聚类技术的对企业ESG指数方法。该方法应用于企业ESG指数确定装置。该方法包括以下步骤内容:Referring to Fig. 1, Fig. 1 is a clustering technology-based ESG index method for enterprises provided by the embodiment of the present application. This method is applied to the enterprise ESG index determination device. The method includes the following steps:
101:获取待评价企业在预设时间段内的M篇新闻,其中,所述M篇新闻用于对所述进行ESG评价,M为大于或者等于1的整数。101: Obtain M pieces of news about the enterprise to be evaluated within a preset period of time, wherein the M pieces of news are used for ESG evaluation, and M is an integer greater than or equal to 1.
示例性的,企业ESG指数确定装置可以通过爬虫技术从多个新闻媒体平台中获取预设时间段内的W篇新闻,对该W篇新闻进行识别(比如,可以对W篇新闻进行文本分类,)从W篇新闻中筛选出与ESG评价相关的N篇新闻,例如,某篇新闻中披露了某企业的能源消耗情况,则将该篇新闻作为与ESG指数相关的新闻。Exemplarily, the enterprise ESG index determination device can obtain W articles of news within a preset time period from multiple news media platforms through crawler technology, and identify the W articles of news (for example, text classification can be performed on W articles of news, ) Select N news related to ESG evaluation from W news. For example, if a certain news discloses the energy consumption of a certain company, this news is regarded as news related to ESG index.
其中,该预设时间段可以为任意一个历史时间段,比如,可以为近十天、上个月、或者去年,等等。本申请不对预设时间段进行限定。也就是说,本申请的企业ESG指数确定方法可以基于最近时间段内的新闻对企业进行ESG评价,得到企业在最近时间段内的ESG指数;也可以回溯历史,基于某个历史时间段内的新闻对企业进行ESG评价,得到企业在某个历史时间段内的ESG指数。Wherein, the preset time period may be any historical time period, for example, it may be the last ten days, last month, or last year, and so on. This application does not limit the preset time period. That is to say, the enterprise ESG index determination method of this application can evaluate the ESG of the enterprise based on the news in the most recent time period, and obtain the ESG index of the enterprise in the most recent time period; News conducts ESG evaluations on companies and obtains the ESG index of companies in a certain historical period.
示例性的,可对N篇新闻中的每篇新闻进行实体识别,得到每篇新闻中的实体词,根据每篇新闻中的实体词确定每篇新闻所涉及的企业,其中,该实体词可以为企业的名称(比 如,中文名称、英文名称)、企业的产品、企业的位置、企业的排名,等等;然后,将N篇新闻中每篇新闻所涉及的企业进行合并,可得到该N篇新闻所涉及的所有企业。相应的,根据每篇新闻所涉及的企业,可以得到N篇新闻所涉及的所有企业中每个企业所涉及的新闻的数量,即得到了与待评价企业相关的M篇新闻,M小于或等于N,其中,该待评价企业为该所有企业中的任意一个企业。Exemplarily, entity recognition can be performed on each of the N news articles to obtain the entity words in each news article, and determine the company involved in each news article according to the entity words in each news article, wherein the entity words can be is the name of the enterprise (for example, Chinese name, English name), the products of the enterprise, the location of the enterprise, the ranking of the enterprise, etc.; All companies involved in this news. Correspondingly, according to the companies involved in each news, the number of news related to each company in all companies involved in N news can be obtained, that is, M news related to the company to be evaluated is obtained, and M is less than or equal to N, wherein, the enterprise to be evaluated is any enterprise among all enterprises.
102:对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组对应一个新闻事件,所述每个第一新闻组包括所述M篇新闻中的一篇或多篇新闻,K为大于或者等于1的整数。102: Clustering the M news pieces to obtain K first news groups, wherein each first news group in the K first news groups corresponds to a news event, and each first news group A group includes one or more news articles in the M news articles, K is an integer greater than or equal to 1.
示例性的,企业ESG指数确定装置对M篇新闻中的每篇新闻进行语义信息提取,得到每篇新闻所对应的语义向量,其中,每篇新闻所对应的语义向量用于表征每篇新闻所描述的新闻事件。基于每篇新闻的语义向量对M篇新闻进行聚类,得到K个第一新闻组,即确定M篇新闻中任意两篇新闻的语义向量之间的欧式距离,将任意两篇新闻的语义向量之间的欧式距离作为任意两篇新闻之间的第一相似度;然后,将第一相似度大于第一阈值的两篇新闻归为一类,即归类为到一个第一新闻组里面,得到K个第一新闻组。由于两篇新闻的语义向量之间的第一相似度大于第一阈值,则说明这两篇新闻描述的是一个新闻事件,因此每个第一新闻组中所包括的一篇或多篇新闻描述的是同一个新闻事件。Exemplarily, the device for determining the ESG index of an enterprise extracts semantic information from each of the M pieces of news to obtain a semantic vector corresponding to each piece of news, wherein the semantic vector corresponding to each piece of news is used to represent the information contained in each piece of news Describe the news event. Cluster M news based on the semantic vector of each news to obtain K first news groups, that is, determine the Euclidean distance between the semantic vectors of any two news in M news, and combine the semantic vectors of any two news The Euclidean distance between them is used as the first similarity between any two news articles; then, the two news articles with the first similarity greater than the first threshold are classified into one category, that is, they are classified into a first news group, Get the K first newsgroups. Since the first similarity between the semantic vectors of the two news is greater than the first threshold, it means that the two news describe a news event, so one or more news descriptions included in each first news group It's the same news event.
103:对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应的L个第二新闻组,其中,所述每个第二新闻组包括一篇原创新闻以及与所述原创新闻对应的H篇转载新闻,H为大于或者等于0的整数。103: Clustering one or more news articles in each first news group to obtain L second news groups corresponding to each first news group, wherein each second news group The group includes one original news and H reprinted news corresponding to the original news, where H is an integer greater than or equal to 0.
示例性的,企业ESG指数确定装置确定每个第一新闻组中任意两篇新闻之间的第二相似度,将第二相似度大于第二阈值的新闻归为一类,即归类到一个第二新闻组里面,得到L个第二新闻组。具体的,每个第一新闻组中的任意两篇新闻之间的第二相似度可以为该两篇新闻的新闻内容之间的相似度,比如,可以计算该两篇新闻的标题以及摘要之间的相似度,将标题以及摘要之间的相似度作为这两篇新闻之间的相似度,若相似度大于第二阈值,则说明该两篇新闻是同一篇新闻(即两篇新闻中一篇是原创新闻,一篇是转载这篇原创新闻的新闻;或者,两篇新闻都是转载新闻)。经过上述聚类之后,就可以把一篇原创新闻和转载这篇原创新闻的所有转载新闻聚类到通一个第二新闻组里面。又由于两篇原创新闻之间的内容是不同的,因此,两篇原创新闻会聚类为两个不同的第二新闻组。从而通过上述的聚类方法,可以将所有的原创新闻分开,并将与每篇原创新闻对应的转载新闻聚类到一起,得到L个第二新闻组。Exemplarily, the enterprise ESG index determination device determines the second similarity between any two news articles in each first news group, and classifies the news with the second similarity greater than the second threshold into one category, that is, classifies them into one In the second newsgroup, get L second newsgroups. Specifically, the second similarity between any two news articles in each first news group can be the similarity between the news content of the two news articles, for example, the headline and summary of the two news articles can be calculated. The similarity between the title and the abstract is used as the similarity between the two news, if the similarity is greater than the second threshold, it means that the two news are the same news (that is, one of the two news One article is the original news, and the other is the news that reprinted the original news; or, both news are reprinted news). After the above clustering, a piece of original news and all reprinted news that reprinted this original news can be clustered into a second news group. Also, because the contents of the two original news articles are different, the two original news articles will be clustered into two different second newsgroups. Therefore, through the above-mentioned clustering method, all original news can be separated, and reprinted news corresponding to each original news can be clustered together to obtain L second news groups.
此外,对于每个第二新闻组来说,必然包含了一篇原创新闻和H篇转载新闻,当H大于或者等于1时,若想确定出每个第二新闻组中的原创新闻,则可以获取每个第二新闻组中的每篇新闻的发表时间。对于原创新闻来说,发表时间必然是最早的,因此可以将每个第二新闻组中发表时间最早的新闻作为每个第二新闻组中的原创新闻,剩余的新闻均为每个第二新闻组中的转载新闻。In addition, for each second news group, it must contain an original news and H reprinted news. When H is greater than or equal to 1, if you want to determine the original news in each second news group, you can Obtain the publication time of each news item in each second newsgroup. For original news, the publication time must be the earliest, so the news with the earliest publication time in each second news group can be used as the original news in each second news group, and the rest of the news is the first news of each second news group. Reprinted news in the group.
104:根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,所述目标舆情评分用于表征所述每个第一新闻组对应的新闻事件对所述待评价企业的影响力。104: According to an original news and H pieces of reprinted news included in each of the L second news groups, determine the target public opinion score of the news event corresponding to each of the first news groups, so The target public opinion score is used to represent the influence of the news event corresponding to each first news group on the enterprise to be evaluated.
示例性的,首先从新闻数量上确定每个第一新闻组的缩放比例,再从新闻内容上确定每个第一新闻组的原始舆情评分;最后,将每个第一新闻组的缩放比例以及原始舆情评分进行融合,得到每个第一新闻组的目标舆情评分。其中,每个第一新闻组的缩放比例表征了社会对每个第一新闻组所对应的新闻事件的关注度,目标舆情评分用于表征每个第一新闻组对应的新闻事件对待评价企业的影响力,即对待评价企业的影响程度。Exemplary, first determine the scaling ratio of each first news group from the news quantity, then determine the original public opinion score of each first news group from the news content; finally, the scaling ratio of each first news group and The original public opinion scores are fused to obtain the target public opinion scores of each first news group. Among them, the scaling ratio of each first news group represents the social attention to the news event corresponding to each first news group, and the target public opinion score is used to represent the news event corresponding to each first news group. Influence refers to the degree of influence on the enterprise being evaluated.
具体的,获取每个第二新闻组中的原创新闻的预设比例,其中,预设比例表征了原创新闻在社会对每个第一新闻组对应的新闻事件的关注度上的贡献度。应理解,原创新闻是由新闻工作者自主创作的,是与水军炒作无关的,因此,一个新闻事件下的所有原创新闻引起的对这个新闻事件的关注度应该是相同,也就是说,每篇原创新闻在新闻事件所受的社会的关注方面的所作出。所以,为每篇原创新闻事件预先设定一个比例,使原创新闻在关注度方面的贡献不受转载新闻数量的影响;根据每个第二新闻组中的原创新闻的的预设比例以及每个第二新闻组中所包含的转载新闻的数量H,确定每个第二新闻组的缩放比例。Specifically, a preset ratio of original news in each second news group is acquired, wherein the preset ratio represents the contribution of original news to social attention to news events corresponding to each first news group. It should be understood that original news is independently created by journalists and has nothing to do with troll hype. Therefore, all original news under a news event should attract the same attention to this news event, that is to say, every The original news is made in terms of the social attention received by the news event. Therefore, a ratio is preset for each original news event, so that the contribution of original news in terms of attention is not affected by the number of reprinted news; according to the preset ratio of original news in each second news group and each The number H of reposted news contained in the second newsgroup determines the scaling ratio of each second newsgroup.
示例性的,每个第二新闻组的缩放比例可以通过公式(1)表示:Exemplarily, the scaling ratio of each second newsgroup can be expressed by formula (1):
heat
j=log2(H
j+1)+a 公式(1)
heat j =log2(H j +1)+a formula (1)
其中,heat
j为L个第二新闻组中的第j个第二新闻组的缩放比例,H
j为第j个第二新闻组所包含的转载新闻的数量,a为每个第二新闻组中的原创新闻的的预设比例,一般取值为1。
Among them, heat j is the zoom ratio of the jth second newsgroup in the L second newsgroups, H j is the number of reprinted news contained in the jth second newsgroup, a is each second newsgroup The preset ratio of original news in , generally takes a value of 1.
可以看出,上述公式(1)在计算每个第二新闻组的缩放比例时,随着转载新闻的数量增多,缩放比例的增长幅度逐渐减弱,这样刻意转载或者炒作的行为就会被减弱,即通过公式(1)减弱了刻意转载或水军炒作的行为,使确定出的缩放比例(关注度)更加精确,即不容易受水军炒作的影响。It can be seen that when the above formula (1) calculates the scaling ratio of each second news group, as the number of reprinted news increases, the growth rate of the scaling ratio gradually weakens, so that the behavior of deliberate reprinting or hype will be weakened. That is to say, through the formula (1), the behavior of deliberate reposting or troll hype is weakened, so that the determined scaling ratio (attention) is more accurate, that is, it is not easily affected by troll hype.
示例性的,每个第二新闻组的缩放比例还可以通过公式(2)表示:Exemplarily, the scaling ratio of each second newsgroup can also be expressed by formula (2):
可以看出,H
j=0时,也就是每个第二新闻组中的转载新闻的数量为零时,则第二新闻组的缩放比例为a,即缩放比列只与原创新闻的数量相关;当H
j无穷大时,第二新闻组的缩放比列为1+a,即有上限。这样即使有水军炒作,也会将炒作的行为收敛在1+a处,不会无休止的受炒作行为的影响,不会让社会对新闻事件的的关注度无限放大,由于社会对某个新闻事件的关注度本身就不会无限的放大,从而使确定出的缩放比例更加精确,使最后计算出的目标舆情评分更加准确。
It can be seen that when H j =0, that is, when the number of reprinted news in each second news group is zero, the scaling ratio of the second news group is a, that is, the scaling ratio column is only related to the number of original news ; When H j is infinite, the scaling ratio of the second newsgroup is listed as 1+a, that is, there is an upper limit. In this way, even if there is speculation by the trolls, the behavior of the hype will be converged at 1+a, and it will not be endlessly affected by the hype behavior, and the society's attention to news events will not be infinitely enlarged. The degree of attention of the news event itself will not be infinitely magnified, so that the determined scaling ratio will be more accurate, and the final calculated target public opinion score will be more accurate.
进一步的,对每个第二新闻组的缩放比例进行求和,得到每个第一新闻组的缩放比列。因此,每个第一新闻组的缩放比例可以通过公式(3)表示:Further, summing the scaling ratios of each second newsgroup to obtain a scaling ratio column of each first newsgroup. Therefore, the scaling ratio of each first newsgroup can be expressed by formula (3):
其中,heat为每个第一新闻组的缩放比例。Wherein, heat is the zoom ratio of each first newsgroup.
在本申请的一个实施方式中,在确定每个第一新闻组对应的新闻事件的舆情评分方面,可以从每个第一新闻组对应的L个第二新闻组随机选择一组第二新闻组,并对该第二新闻组中的任意一篇新闻进行情感识别,得到与该篇新闻对应的情感标签,将该篇新闻的情感标签作为每个第一新闻组对应的情感标签;或者,从每个第一新闻组对应的所有新闻中随机选择一篇新闻进行情感识别,得到该篇新闻的情感标签,并将该篇新闻的情感标签作为每个第一新闻组对应的情感标签。由于每个第一新闻组下的所有新闻所描述的新闻事件是相同的。因此,为每个第一新闻组添加情感标签的方式比较灵活,不去限定为每个第一新 闻组添加情感标签的方式。其中,每个第一新闻组对应的情感标签用于表征每个第一新闻组所对应的新闻事件是正面新闻事件,还是负面新闻事件。In one embodiment of the present application, in determining the public opinion score of the news event corresponding to each first news group, a group of second news groups can be randomly selected from the L second news groups corresponding to each first news group , and carry out emotion recognition on any piece of news in the second newsgroup to obtain the emotion label corresponding to the news, and use the emotion label of the news as the emotion label corresponding to each first newsgroup; or, from A piece of news is randomly selected from all the news corresponding to each first news group for emotion recognition, and the emotional label of the news is obtained, and the emotional label of the news is used as the emotional label corresponding to each first news group. Because the news events described by all the news under each first news group are the same. Therefore, the mode of adding emotional tags for each first newsgroup is more flexible, and does not limit the mode of adding emotional tags for each first newsgroup. Wherein, the sentiment tag corresponding to each first news group is used to represent whether the news event corresponding to each first news group is a positive news event or a negative news event.
进一步的,在情感标签用于表征每个第一新闻组所描述的新闻事件是负面新闻事件时,确定负面新闻事件的具体内容,比如,可以为罚款数目、业务中断、高管刑拘或者禁止政府采购,等;在情感标签用于表征每个第一新闻组所描述的新闻事件是正面新闻事件时,确定正面新闻事件的具体内容,比如,可以为节能减排有效。Further, when the emotion tag is used to represent that the news event described by each first news group is a negative news event, the specific content of the negative news event can be determined, for example, it can be the amount of fines, business interruption, high-level criminal detention or prohibition of government Purchasing, etc.; when the sentiment tag is used to represent that the news event described by each first news group is a positive news event, determine the specific content of the positive news event, for example, it can be effective for energy saving and emission reduction.
进一步的,确定出L个第二新闻组中所有新闻的发表媒体;例如,对L个第二新闻组中每篇新闻的新闻内容进行文本识别,从每篇新闻内容中识别出每篇新闻的发表媒体,比如,有些新闻的发表媒体位于新闻内容的最后落款处,有些新闻的发表媒体位于新闻内容的最开始位置处。因此,对每篇新闻的新闻内容进行文本识别,可得到每篇新闻的发表媒体;并基于L个第二个新闻组中所有新闻的发表媒体确定出每个第一新闻组中的最高级别的发表媒体。应理解,企业ESG指数确定装置中预置有发表媒体的优先级关系。一般来说,优先级如下所示:国家级的发表媒体>省级的发表媒体>市级的发表媒体>自媒体>娱乐媒体,等等。因此,在识别出L个第二新闻组中每篇新闻的发表媒体之后,根据预置的优先级关系,确定出每个第一新闻组中的最高级别的发表媒体。Further, determine the publishing media of all news in the L second newsgroups; for example, carry out text recognition to the news content of each news in the L second newsgroups, identify the content of each news from each news content Publishing media, for example, the publishing media of some news is located at the end of the news content, and the publishing media of some news is located at the very beginning of the news content. Therefore, carry out text recognition to the news content of each news, can obtain the publishing media of each news; Publish the media. It should be understood that the priority relationship of publishing media is preset in the enterprise ESG index determining device. Generally speaking, the priorities are as follows: national-level publishing media > provincial-level publishing media > municipal-level publishing media > self-media > entertainment media, and so on. Therefore, after identifying the publishing media of each news article in the L second newsgroups, the highest-level publishing media in each first newsgroup is determined according to the preset priority relationship.
最后,根据发表媒体、情感标签与舆情评分之间的映射关系,以及每个第一新闻组中的最高级别的发表媒体以及每个第一新闻组所对应的情感标签,确定每个第一新闻组对应的原始舆情评分。Finally, according to the mapping relationship between the publishing media, emotional labels and public opinion scores, as well as the highest-level publishing media in each first news group and the corresponding emotional tags of each first news group, determine the The original public opinion score corresponding to the group.
示例性的,发表媒体、情感标签与舆情评分之间的映射关系可以为:若新闻事件表征为中性新闻事件(即没有批评也没有表扬),则设定舆情评分为0,若新闻事件表征为负面新闻事件时,从0开始往下扣分,扣到-10为止,且扣分程度与负面新闻事件具体内容以及发表媒体的级别有关,比如,每个第一新闻组对应的新闻事件为高管被拘留,且最高级别的发表媒体为国家级别的发表媒体,则扣10分,即每个第一新闻组对应的原始舆情评分为-10分;若新闻事件表征为正面新闻事件时,从0开始加,并加到5为止,同样,扣分程度与负面新闻事件具体内容以及发表媒体的级别有关;比如,新闻事件为年终奖覆盖率为100%,且最高级别的发表媒体为省级的发表媒体,则加2分,即该新闻事件对应的原始舆情评分为2分。也就是说,预先配置了各个类型的新闻事件、各个发表级别与舆情评分之间的映射关系。当确定了每个第一新闻组的最高级别的发表媒体以及新闻事件的类型之后,根据该映射关系即可确定出每个第一新闻组对应的原始舆情评分。Exemplarily, the mapping relationship between publishing media, emotional labels and public opinion scores can be as follows: if the news event is characterized as a neutral news event (that is, there is no criticism or praise), then set the public opinion score to 0; if the news event represents When it is a negative news event, points will be deducted from 0 to -10, and the degree of deduction is related to the specific content of the negative news event and the level of the publishing media. For example, the news event corresponding to each first news group is If executives are detained and the highest-level publishing media is a national-level publishing media, 10 points will be deducted, that is, the original public opinion score corresponding to each first news group is -10 points; if the news event is characterized as a positive news event, Add from 0 to 5. Similarly, the degree of deduction is related to the specific content of the negative news event and the level of the publishing media; 2 points are added to the publishing media of the first level, that is, the original public opinion score corresponding to the news event is 2 points. That is to say, the mapping relationship between various types of news events, various publishing levels and public opinion scores is pre-configured. After the highest-level publishing media and the type of news events of each first news group are determined, the original public opinion score corresponding to each first news group can be determined according to the mapping relationship.
此外,由于负面新闻多由他方(比如媒体、政府等)发起,起舆论监督作用,而正面新闻可能由企业自身公关部门发起,因此,负面新闻的扣分范围(-10)要比正面新闻的加分范围(+5)广,使舆情评分的分布更加合理。In addition, because negative news is mostly initiated by other parties (such as the media, the government, etc.) to monitor public opinion, while positive news may be initiated by the company’s own public relations department, the range of deductions for negative news (-10) is lower than that of positive news. The range of extra points (+5) is wide, which makes the distribution of public opinion scores more reasonable.
最后,将每个第一新闻组的原始舆情评分与缩放比例进行乘积运算,得到每个第一新闻组对应的目标舆情评分。Finally, the original public opinion score of each first news group is multiplied by the scaling ratio to obtain the target public opinion score corresponding to each first news group.
因此,每个第一新闻组对应的目标舆情评分可以通过公式(4)表示:Therefore, the target public opinion score corresponding to each first news group can be expressed by formula (4):
adjscoreEvent=heat*adjscore 公式(4)adjscoreEvent=heat*adjscore Formula (4)
其中,adjscoreEvent为每个第一新闻组对应的目标舆情评分,adjscore为每个第一新闻组的原始舆情评分。Among them, adjscoreEvent is the target public opinion score corresponding to each first newsgroup, and adjscore is the original public opinion score of each first newsgroup.
其中,缩放比例heat在公式(4)中起到了放大器的作用,即将每个新闻事件的原始舆情评分进行放大或缩小。比如,若缩放比例是1.5,则将原始舆情评分放大1.5倍。由于缩放比例本身就反映了社会对该新闻事件的关注度,所以通过公式(4)中的乘法融合,可 使得负面的舆情扣分更多,正面的舆情加分更多,从而使最后计算出的目标舆情评分更加合理和精确。Among them, the scaling ratio heat plays the role of an amplifier in formula (4), that is, the original public opinion score of each news event is enlarged or reduced. For example, if the scaling ratio is 1.5, the original public opinion score is enlarged by 1.5 times. Since the scaling ratio itself reflects the social attention to the news event, through the multiplicative fusion in formula (4), negative public opinion can be deducted more points, and positive public opinion can be added more points, so that the final calculated The target public opinion score is more reasonable and accurate.
105:根据所述每个第一新闻组对应的新闻事件的目标舆情评分,对所述待评价企业进行ESG评价,得到所述待评价企业的ESG指数。105: Perform an ESG evaluation on the enterprise to be evaluated according to the target public opinion score of the news event corresponding to each first news group, and obtain an ESG index of the enterprise to be evaluated.
示例性的,可以将每个第一新闻组对应的目标舆情评分作为从新闻维度对待评价企业进行ESG评价时的ESG指数。Exemplarily, the target public opinion score corresponding to each first news group may be used as an ESG index when ESG evaluation is performed on the enterprise to be evaluated from the news dimension.
106:向目标设备发送所述待评价企业的ESG指数,以便所述目标设备的用户根据所述待评价企业的ESG指数制定与所述待评价企业相关的投资决策。106: Send the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device makes an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
可以看出,在本申请实施方式中,在获取到与用于对待评价企业进行ESG评价的新闻后,不是直接使用新闻的数量对待评价企业进行ESG评价,而是,采用聚类的方法,将属于每个新闻事件的新闻分组,从新闻事件粒度进行ESG评价,实现更精确的评价;然后,针对每个新闻事件下的新闻组,也不是直接使用该新闻组中的新闻数量进行ESG评价,而是,进一步聚类从原创新闻和转载新闻的维度对待评价企业进行ESG评价,对原创新闻和转载新闻进行不同的区分,可进一步提高ESG评价的准确度,使得到的ESG指数比较高。从而可以向目标设备发送精度较高的ESG指数,提高制定出的投资决策的准确度。It can be seen that in the implementation of the present application, after obtaining the news for ESG evaluation of the enterprise to be evaluated, instead of directly using the number of news to evaluate the ESG of the enterprise to be evaluated, the method of clustering is adopted to For the news groups belonging to each news event, ESG evaluation is carried out from the granularity of news events to achieve more accurate evaluation; then, for the news groups under each news event, ESG evaluation is not directly performed using the number of news in the news group, Instead, further clustering from the dimension of original news and reprinted news to evaluate companies for ESG evaluation, and different distinctions between original news and reprinted news can further improve the accuracy of ESG evaluation and make the obtained ESG index relatively high. In this way, ESG indexes with high precision can be sent to the target equipment, and the accuracy of the investment decision made can be improved.
在本申请的一个实施方式中,实质上根据目标设备的角色不同,可以制定出与待评价企业相关的决策也是不同的,本申请的ESG指数有以下应用场景:In one embodiment of this application, in essence, according to the different roles of the target device, the decisions related to the enterprise to be evaluated are also different. The ESG index of this application has the following application scenarios:
场景1:当目标设备为投资机构的设备时,将待评价企业的ESG指数发送给投资机构,则投资机构可以制定出的与该待评价企业相关的决策为:与该待评价企业相关的投资决策。例如,由于企业的ESG指数反映了该企业的价值以及可持续发展的能力,则当待评价企业的ESG指数较高时,则该投资决策可以为追加对该待评价企业的投资金额以及投资周期;当待评价企业的ESG指数较低时,则该投资决可以撤资对该待评价企业的投资或者减少对待评价企业的投资,等等。总的来说,将待评价企业的ESG指数发送给投资机构,则可以对投资机构的投资决策的制定提供方向指引,降低投资风险。Scenario 1: When the target equipment is the equipment of an investment institution, and the ESG index of the enterprise to be evaluated is sent to the investment institution, the investment institution can make a decision related to the enterprise to be evaluated: investment related to the enterprise to be evaluated decision making. For example, since the ESG index of an enterprise reflects the value and sustainable development capability of the enterprise, when the ESG index of the enterprise to be evaluated is high, the investment decision can be to increase the investment amount and investment cycle of the enterprise to be evaluated ; When the ESG index of the enterprise to be evaluated is low, the investment can definitely divest the investment in the enterprise to be evaluated or reduce the investment in the enterprise to be evaluated, and so on. In general, sending the ESG index of the enterprise to be evaluated to the investment institution can provide direction guidance for the investment decision-making of the investment institution and reduce investment risk.
场景2:当目标设备为待评价企业的设备时,将待评价企业的ESG指数发送给待评价企业,则待评价企业可以制定与该待评价企业相关的决策为,制定与该待评价企业相关的管理决策。示例性的,由于企业的ESG指数反映了该企业的价值以及可持续发展的能力,随着投资者对ESG指数的接受程度加深,对企业社会责任的看重,更愿意投资ESG指数的较高的企业。因此,当待评价企业的ESG指数较高时,该管理决策为待评价企业制定出加强企业管理的决策,继续保持在ESG方面的优异表现;当待评价企业的ESG指数较低时,该管理决策为调整公司发展战略,提高公司的可持续发展性,提高ESG指数。总体来说,将待评价企业的ESG指数发送给待评价企业,则有利于促进待评价企业努力提高自身ESG评价情况,引导待评价企业的良性发展。Scenario 2: When the target equipment is the equipment of the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is sent to the enterprise to be evaluated, the enterprise to be evaluated can formulate a decision related to the enterprise to be evaluated as follows: management decisions. For example, since the ESG index of a company reflects the value and sustainable development capabilities of the company, as investors' acceptance of the ESG index deepens and they attach importance to corporate social responsibility, they are more willing to invest in higher ESG indexes. enterprise. Therefore, when the ESG index of the enterprise to be evaluated is high, the management decision makes a decision to strengthen corporate management for the enterprise to be evaluated, and continue to maintain excellent performance in ESG; when the ESG index of the enterprise to be evaluated is low, the management decision The decision is to adjust the company's development strategy, improve the company's sustainable development, and improve the ESG index. Generally speaking, sending the ESG index of the companies to be evaluated to the companies to be evaluated will help promote the companies to be evaluated to improve their own ESG evaluation and guide the healthy development of the companies to be evaluated.
场景3:当目标设备为政府或者社会组织的设备时,将待评价企业的ESG指数发送给政府或者社会组织,则政府或者社会组织可以制定与该待评价企业相关的决策为,制定与该待评价企业相关的扶持决策。示例性的,由于企业的ESG指数反映了该企业的价值以及可持续发展的能力。因此,当待评价企业的ESG指数较高时,表明该待评价企业的发展潜力比较大,则扶持决策可以大为力宣传该待评价企业,以便给这样的企业提供更多的发展机会;当待评价企业的ESG指数较低时,表明该待评价企业的发展潜力较低,则扶持决策可以为责令这样的企业调整公司发展战略,或者,减少扶持,以引导待评价企业往良性发展的方向调整。Scenario 3: When the target equipment is the equipment of the government or social organization, the ESG index of the enterprise to be evaluated is sent to the government or social organization, then the government or social organization can make a decision related to the enterprise to be evaluated. Evaluate enterprise-related support decisions. Exemplarily, the ESG index of an enterprise reflects the value and sustainable development capability of the enterprise. Therefore, when the ESG index of the enterprise to be evaluated is high, it indicates that the development potential of the enterprise to be evaluated is relatively large, and the support decision can greatly promote the enterprise to be evaluated so as to provide more development opportunities for such enterprises; When the ESG index of the enterprise to be evaluated is low, it indicates that the development potential of the enterprise to be evaluated is low, and the support decision can be to order such an enterprise to adjust the company's development strategy, or reduce support to guide the enterprise to be evaluated to a healthy development direction Adjustment.
参阅图2,图2本申请实施例提供的一种企业ESG指数确定装置的功能单元组成框图。企业ESG指数确定装置200包括:获取单元201、处理单元202和发送单元203,其中:Referring to FIG. 2, FIG. 2 is a block diagram of functional units of an enterprise ESG index determination device provided in the embodiment of the present application. The enterprise ESG index determination device 200 includes: an acquisition unit 201, a processing unit 202 and a sending unit 203, wherein:
获取单元201,用于获取待评价企业在预设时间段内的M篇新闻,其中,M为大于或 者等于1的整数;Acquisition unit 201 is used to obtain M pieces of news of the enterprise to be evaluated within the preset time period, wherein M is an integer greater than or equal to 1;
处理单元202,用于对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组对应一个新闻事件,所述每个第一新闻组包括所述M篇新闻中的一篇或多篇新闻,K为大于或者等于1的整数;The processing unit 202 is configured to cluster the M pieces of news to obtain K first news groups, wherein each first news group in the K first news groups corresponds to a news event, and each of the K first news groups corresponds to a news event. A first news group includes one or more news articles in the M news articles, K is an integer greater than or equal to 1;
对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应的L个第二新闻组,其中,所述每个第二新闻组包括一篇原创新闻以及与所述原创新闻对应的H篇转载新闻,H为大于或者等于0的整数;Clustering one or more news articles in each of the first newsgroups to obtain L second newsgroups corresponding to each of the first newsgroups, wherein each of the second newsgroups includes One piece of original news and H pieces of reprinted news corresponding to the original news, where H is an integer greater than or equal to 0;
根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,所述目标舆情评分用于表征所述每个第一新闻组对应的新闻事件对所述待评价企业的影响力;According to an original news and H pieces of reposted news included in each of the L second newsgroups, determine the target public opinion score of the news event corresponding to each of the first newsgroups, the target The public opinion score is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;
根据所述每个第一新闻组对应的新闻事件的目标舆情评分,对所述待评价企业进行ESG评价,得到所述待评价企业的ESG指数;According to the target public opinion score of the news event corresponding to each first news group, ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;
发送单元203,用于向目标设备发送所述待评价企业的ESG指数,以便所述目标设备的用户根据所述待评价企业的ESG指数对制定与所述待评价企业相关的投资决策。The sending unit 203 is configured to send the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device makes an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
在本申请一些可能的实施方式中,在根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分方面,处理单元202,具体用于:In some possible implementations of the present application, according to an original news item and H reprinted news items included in each of the L second news groups, it is determined that each of the first news groups corresponds to In terms of the target public opinion scoring of news events, the processing unit 202 is specifically used for:
根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻,以及转载新闻的数量H,确定所述每个第一新闻组对应的新闻事件的缩放比例,所述每个第一新闻组对应的新闻事件的缩放比例表征了对所述每个第一新闻组对应的新闻事件的关注度;According to an original news included in each of the L second newsgroups, and the number H of reprinted news, determine the scaling ratio of the news event corresponding to each of the first newsgroups, the The scaling ratio of the news event corresponding to each first news group represents the degree of attention to the news event corresponding to each first news group;
根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻的新闻内容以及H篇转载新闻的新闻内容,确定所述每个第一新闻组对应的新闻事件的原始舆情评分;According to the news content of an original news and the news content of H reprinted news included in each second news group in the L second news groups, determine the original news event corresponding to each of the first news groups Score of public opinion;
对所述每个第一新闻组对应的新闻事件的缩放比例以及原始舆情评分进行乘积处理,得到所述每个第一新闻组对应的新闻事件的目标舆情评分。Perform product processing on the scaling ratio of the news event corresponding to each first news group and the original public opinion score to obtain the target public opinion score of the news event corresponding to each first news group.
在本申请一些可能的实施方式中,在根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻,以及转载新闻的数量H,确定所述每个第一新闻组对应的新闻事件的缩放比例方面,处理单元202,具体用于:In some possible implementations of the present application, according to an original news included in each second news group in the L second news groups, and the number H of reprinted news, determine the number of each first news In terms of the scaling ratio of the news event corresponding to the group, the processing unit 202 is specifically used for:
获取所述每个第二新闻组中的原创新闻的预设比例,所述预设比例表征了所述原创新闻在社会对所述每个第一新闻组对应的新闻事件的关注度上的贡献度;Obtaining a preset ratio of original news in each second news group, where the preset ratio represents the contribution of the original news to social attention to news events corresponding to each first news group Spend;
根据所述每个第二新闻组中的原创新闻的预设比例以及所述每个第二新闻组包括的转载新闻的数量H,确定所述每个第二新闻组的缩放比例;Determine the scaling ratio of each second news group according to the preset ratio of original news in each second news group and the number H of reprinted news included in each second news group;
对所述L个第二新闻组的缩放比例进行求和,得到所述每个第一新闻组对应的新闻事件的缩放比例。The scaling ratios of the L second newsgroups are summed to obtain the scaling ratios of the news events corresponding to each of the first newsgroups.
在本申请一些可能的实施方式中,在根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻的新闻内容以及H篇转载新闻的新闻内容,确定所述每个第一新闻组对应的新闻事件的原始舆情评分方面,处理单元202,具体用于:In some possible implementations of the present application, according to the news content of an original news and the news content of H reprinted news included in each of the L second news groups, it is determined that each In terms of the original public opinion score of the news event corresponding to the first news group, the processing unit 202 is specifically used for:
对所述每个第一新闻组中的新闻进行情感识别,得到所述每个第一新闻组对应的情感标签,其中,所述每个第一新闻组对应的情感标签用于表征所述每个第一新闻组对应的新闻事件为正面新闻事件或者负面新闻事件;Perform emotion recognition on the news in each first news group to obtain an emotion tag corresponding to each first news group, wherein the emotion tag corresponding to each first news group is used to represent each The news event corresponding to the first news group is a positive news event or a negative news event;
获取所述每个第一新闻组中的每篇新闻的发表媒体;Acquiring the publishing media of each news item in each of the first newsgroups;
根据所述每个第一新闻组中的每篇新闻的发表媒体,确定所述每个第一新闻组中的最高级别的发表媒体;determining the highest-level publishing media in each of the first newsgroups according to the publishing media of each news in each of the first newsgroups;
根据发表媒体、情感标签与舆情评分之间的映射关系,以及所述每个第一新闻组中的最高级别的发表媒体以及所述每个第一新闻组对应的情感标签,确定所述每个第一新闻组 对应的新闻事件的原始舆情评分。According to the mapping relationship between published media, emotional tags and public opinion scores, and the highest-level published media in each first news group and the corresponding emotional tags of each first news group, determine each The original public opinion score of the news event corresponding to the first news group.
在本申请一些可能的实施方式中,在对所述每个第一新闻组中的新闻进行情感识别,得到所述每个第一新闻组对应的情感标签方面,处理单元202,具体用于:In some possible implementations of the present application, in terms of performing emotion recognition on the news in each of the first newsgroups to obtain the emotion tags corresponding to each of the first newsgroups, the processing unit 202 is specifically configured to:
从所述L个第二新闻组随机选择一个第二新闻组,并对所述第二新闻组中的任意一篇新闻进行情感识别,得到与该篇新闻的情感标签,将该篇新闻的情感标签作为所述每个第一新闻组对应的情感标签;Randomly select a second news group from the L second news groups, and carry out emotional recognition to any news in the second news group, obtain the emotional label with the news, and use the emotional label of the news The label is used as the sentiment label corresponding to each of the first newsgroups;
或者,or,
从所述每个第一新闻组包括的一篇或多篇新闻中随机选择一篇新闻,并对该篇新闻进行情感识别,得到与该篇新闻对应的情感标签,将该篇新闻对应的情感标签作为所述每个第一新闻组对应的情感标签。Randomly select a piece of news from one or more pieces of news included in each of the first newsgroups, and carry out emotion recognition to the piece of news, obtain the emotion label corresponding to the piece of news, and the emotion corresponding to the piece of news The label serves as the sentiment label corresponding to each first newsgroup.
在本申请一些可能的实施方式中,在对所述M篇新闻进行聚类,得到K个第一新闻组方面,处理单元202,具体用于:In some possible implementation manners of the present application, in terms of clustering the M pieces of news to obtain K first newsgroups, the processing unit 202 is specifically configured to:
对所述M篇新闻中的每篇新闻进行语义信息提取,得到所述每篇新闻的语义向量,其中,所述每篇新闻的语义向量用于表征所述每篇新闻所描述的新闻事件;Performing semantic information extraction on each of the M pieces of news to obtain a semantic vector of each of the news, wherein the semantic vector of each of the news is used to represent the news events described in each of the news;
确定所述M篇新闻中任意两篇新闻的语义向量之间的第一相似度;determining the first similarity between the semantic vectors of any two news in the M news;
根据所述M篇新闻中任意两篇新闻的语义向量之间的第一相似度对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组中任意两篇新闻的语义向量之间的第一相似度大于第一阈值。According to the first similarity between the semantic vectors of any two news in the M news, the M news are clustered to obtain K first news groups, wherein, among the K first news groups The first similarity between the semantic vectors of any two news articles in each first news group is greater than the first threshold.
在本申请一些可能的实施方式中,在对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应的L个第二新闻组方面,处理单元202,具体用于:In some possible implementation manners of the present application, in terms of clustering one or more news articles in each first news group to obtain L second news groups corresponding to each first news group , the processing unit 202 is specifically used for:
确定所述每个第一新闻组中任意两篇新闻之间的第二相似度,其中,所述每个第一新闻组中任意两篇新闻之间的第二相似度用于表征所述每个第一新闻组中任意两篇新闻的新闻内容之间的相似度;determining a second similarity between any two news articles in each first news group, wherein the second similarity between any two news articles in each first news group is used to characterize each The similarity between the news content of any two news articles in the first news group;
根据所述每个第一新闻组中任意两篇新闻之间的第二相似度对所述每个第一新闻组包括的一篇或多篇新闻进行聚类,得到L个第二新闻组,其中,每个第二新闻组中任意两篇新闻之间的第二相似度大于第二阈值。According to the second similarity between any two news articles in each first news group, one or more news articles included in each first news group are clustered to obtain L second news groups, Wherein, the second similarity between any two news articles in each second news group is greater than a second threshold.
参阅图3,图3为本申请实施例提供的一种电子设备的结构示意图。如图3所示,电子设备300包括收发器301、处理器302和存储器303。它们之间通过总线304连接。存储器303用于存储计算机程序和数据,并可以将存储器303存储的数据传输给处理器302。Referring to FIG. 3 , FIG. 3 is a schematic structural diagram of an electronic device provided in an embodiment of the present application. As shown in FIG. 3 , an electronic device 300 includes a transceiver 301 , a processor 302 and a memory 303 . They are connected through a bus 304 . The memory 303 is used to store computer programs and data, and can transmit the data stored in the memory 303 to the processor 302 .
处理器302用于读取存储器303中的计算机程序执行以下操作:The processor 302 is used to read the computer program in the memory 303 to perform the following operations:
控制收发器301获取待评价企业在预设时间段内的M篇新闻,其中,M为大于或者等于1的整数;Controlling the transceiver 301 to obtain M pieces of news about the enterprise to be evaluated within a preset time period, where M is an integer greater than or equal to 1;
对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组对应一个新闻事件,所述每个第一新闻组包括所述M篇新闻中的一篇或多篇新闻,K为大于或者等于1的整数;Clustering the M pieces of news to obtain K first newsgroups, wherein each first newsgroup in the K first newsgroups corresponds to a news event, and each first newsgroup includes One or more news articles in the M news articles, K is an integer greater than or equal to 1;
对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应的L个第二新闻组,其中,所述每个第二新闻组包括一篇原创新闻以及与所述原创新闻对应的H篇转载新闻,H为大于或者等于0的整数;Clustering one or more news articles in each of the first newsgroups to obtain L second newsgroups corresponding to each of the first newsgroups, wherein each of the second newsgroups includes One piece of original news and H pieces of reprinted news corresponding to the original news, where H is an integer greater than or equal to 0;
根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,所述目标舆情评分用于表征所述每个第一新闻组对应的新闻事件对所述待评价企业的影响力;According to an original news and H pieces of reposted news included in each of the L second newsgroups, determine the target public opinion score of the news event corresponding to each of the first newsgroups, the target The public opinion score is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;
根据所述每个第一新闻组对应的新闻事件的目标舆情评分,对所述待评价企业进行ESG评价,得到所述待评价企业的ESG指数;According to the target public opinion score of the news event corresponding to each first news group, ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;
控制收发器301向目标设备发送所述待评价企业的ESG指数,以便所述目标设备的用 户根据所述待评价企业的ESG指数对制定与所述待评价企业相关的投资决策。The control transceiver 301 sends the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device can make an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
在本申请一些可能的实施方式中,在根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分方面,处理器302具体用于:In some possible implementations of the present application, according to an original news item and H reprinted news items included in each of the L second news groups, it is determined that each of the first news groups corresponds to In terms of the target public opinion scoring of news events, the processor 302 is specifically used for:
根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻,以及转载新闻的数量H,确定所述每个第一新闻组对应的新闻事件的缩放比例,所述每个第一新闻组对应的新闻事件的缩放比例表征了对所述每个第一新闻组对应的新闻事件的关注度;According to an original news included in each of the L second newsgroups, and the number H of reprinted news, determine the scaling ratio of the news event corresponding to each of the first newsgroups, the The scaling ratio of the news event corresponding to each first news group represents the degree of attention to the news event corresponding to each first news group;
根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻的新闻内容以及H篇转载新闻的新闻内容,确定所述每个第一新闻组对应的新闻事件的原始舆情评分;According to the news content of an original news and the news content of H reprinted news included in each second news group in the L second news groups, determine the original news event corresponding to each of the first news groups Score of public opinion;
对所述每个第一新闻组对应的新闻事件的缩放比例以及原始舆情评分进行乘积处理,得到所述每个第一新闻组对应的新闻事件的目标舆情评分。Perform product processing on the scaling ratio of the news event corresponding to each first news group and the original public opinion score to obtain the target public opinion score of the news event corresponding to each first news group.
在本申请一些可能的实施方式中,在根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻,以及转载新闻的数量H,确定所述每个第一新闻组对应的新闻事件的缩放比例方面,处理器302具体用于执行以下操作:In some possible implementations of the present application, according to an original news included in each second news group in the L second news groups, and the number H of reprinted news, determine the number of each first news Regarding the scaling ratio of the news event corresponding to the group, the processor 302 is specifically configured to perform the following operations:
获取所述每个第二新闻组中的原创新闻的预设比例,所述预设比例表征了所述原创新闻在社会对所述每个第一新闻组对应的新闻事件的关注度上的贡献度;Obtaining a preset ratio of original news in each second news group, where the preset ratio represents the contribution of the original news to social attention to news events corresponding to each first news group Spend;
根据所述每个第二新闻组中的原创新闻的预设比例以及所述每个第二新闻组包括的转载新闻的数量H,确定所述每个第二新闻组的缩放比例;Determine the scaling ratio of each second news group according to the preset ratio of original news in each second news group and the number H of reprinted news included in each second news group;
对所述L个第二新闻组的缩放比例进行求和,得到所述每个第一新闻组对应的新闻事件的缩放比例。The scaling ratios of the L second newsgroups are summed to obtain the scaling ratios of the news events corresponding to each of the first newsgroups.
在本申请一些可能的实施方式中,在根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻的新闻内容以及H篇转载新闻的新闻内容,确定所述每个第一新闻组对应的新闻事件的原始舆情评分方面,处理器302具体用于执行以下操作:In some possible implementations of the present application, according to the news content of an original news and the news content of H reprinted news included in each of the L second news groups, it is determined that each In terms of the original public opinion score of the news event corresponding to the first news group, the processor 302 is specifically configured to perform the following operations:
对所述每个第一新闻组中的新闻进行情感识别,得到所述每个第一新闻组对应的情感标签,其中,所述每个第一新闻组对应的情感标签用于表征所述每个第一新闻组对应的新闻事件为正面新闻事件或者负面新闻事件;Perform emotion recognition on the news in each first news group to obtain an emotion tag corresponding to each first news group, wherein the emotion tag corresponding to each first news group is used to represent each The news event corresponding to the first news group is a positive news event or a negative news event;
获取所述每个第一新闻组中的每篇新闻的发表媒体;Acquiring the publishing media of each news item in each of the first newsgroups;
根据所述每个第一新闻组中的每篇新闻的发表媒体,确定所述每个第一新闻组中的最高级别的发表媒体;determining the highest-level publishing media in each of the first newsgroups according to the publishing media of each news in each of the first newsgroups;
根据发表媒体、情感标签与舆情评分之间的映射关系,以及所述每个第一新闻组中的最高级别的发表媒体以及所述每个第一新闻组对应的情感标签,确定所述每个第一新闻组对应的新闻事件的原始舆情评分。According to the mapping relationship between published media, emotional tags and public opinion scores, and the highest-level published media in each first news group and the corresponding emotional tags of each first news group, determine each The original public opinion score of the news event corresponding to the first news group.
在本申请一些可能的实施方式中,在对所述每个第一新闻组中的新闻进行情感识别,得到所述每个第一新闻组对应的情感标签方面,处理器302具体用于执行以下操作:In some possible implementations of the present application, in terms of performing emotion recognition on the news in each of the first newsgroups to obtain the emotion tags corresponding to each of the first newsgroups, the processor 302 is specifically configured to perform the following operate:
从所述L个第二新闻组随机选择一个第二新闻组,并对所述第二新闻组中的任意一篇新闻进行情感识别,得到与该篇新闻的情感标签,将该篇新闻的情感标签作为所述每个第一新闻组对应的情感标签;Randomly select a second news group from the L second news groups, and carry out emotional recognition to any news in the second news group, obtain the emotional label with the news, and use the emotional label of the news The label is used as the sentiment label corresponding to each of the first newsgroups;
或者,or,
从所述每个第一新闻组包括的一篇或多篇新闻中随机选择一篇新闻,并对该篇新闻进行情感识别,得到与该篇新闻对应的情感标签,将该篇新闻对应的情感标签作为所述每个第一新闻组对应的情感标签。Randomly select a piece of news from one or more pieces of news included in each of the first newsgroups, and carry out emotion recognition to the piece of news, obtain the emotion label corresponding to the piece of news, and the emotion corresponding to the piece of news The label serves as the sentiment label corresponding to each first news group.
在本申请一些可能的实施方式中,在对所述M篇新闻进行聚类,得到K个第一新闻组方面,处理器302具体用于执行以下操作:In some possible implementations of the present application, in terms of clustering the M pieces of news to obtain K first newsgroups, the processor 302 is specifically configured to perform the following operations:
对所述M篇新闻中的每篇新闻进行语义信息提取,得到所述每篇新闻的语义向量,其 中,所述每篇新闻的语义向量用于表征所述每篇新闻所描述的新闻事件;Carry out semantic information extraction to each piece of news in the M pieces of news, obtain the semantic vector of each piece of news, wherein, the semantic vector of each piece of news is used to characterize the news event described in each piece of news;
确定所述M篇新闻中任意两篇新闻的语义向量之间的第一相似度;determining the first similarity between the semantic vectors of any two news in the M news;
根据所述M篇新闻中任意两篇新闻的语义向量之间的第一相似度对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组中任意两篇新闻的语义向量之间的第一相似度大于第一阈值。According to the first similarity between the semantic vectors of any two news in the M news, the M news are clustered to obtain K first news groups, wherein, among the K first news groups The first similarity between the semantic vectors of any two news articles in each first news group is greater than the first threshold.
在本申请一些可能的实施方式中,在对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应的L个第二新闻组方面,处理器302具体用于执行以下操作:In some possible implementation manners of the present application, in terms of clustering one or more news articles in each first news group to obtain L second news groups corresponding to each first news group , the processor 302 is specifically configured to perform the following operations:
确定所述每个第一新闻组中任意两篇新闻之间的第二相似度,其中,所述每个第一新闻组中任意两篇新闻之间的第二相似度用于表征所述每个第一新闻组中任意两篇新闻的新闻内容之间的相似度;determining a second similarity between any two news articles in each first news group, wherein the second similarity between any two news articles in each first news group is used to characterize each The similarity between the news content of any two news articles in the first news group;
根据所述每个第一新闻组中任意两篇新闻之间的第二相似度对所述每个第一新闻组包括的一篇或多篇新闻进行聚类,得到L个第二新闻组,其中,每个第二新闻组中任意两篇新闻之间的第二相似度大于第二阈值。According to the second similarity between any two news articles in each first news group, one or more news articles included in each first news group are clustered to obtain L second news groups, Wherein, the second similarity between any two news articles in each second news group is greater than a second threshold.
具体地,上述收发器301可为图2所述的实施例的企业ESG指数确定装置200的获取单元201和发送单元203,上述处理器302可以为图2所述的实施例的企业ESG指数确定装置200的处理单元202。Specifically, the above-mentioned transceiver 301 can be the acquisition unit 201 and the sending unit 203 of the enterprise ESG index determination device 200 of the embodiment shown in FIG. 2 , and the above-mentioned processor 302 can be the enterprise ESG index determination The processing unit 202 of the device 200 .
本申请实施例还提供一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行以实现如上述方法实施例中记载的任何一种基于聚类技术的企业ESG指数确定方法的部分或全部步骤。The embodiment of the present application also provides a computer-readable storage medium, the computer-readable storage medium stores a computer program, and the computer program is executed by a processor to implement any clustering-based Some or all of the steps in the corporate ESG index determination methodology for technology.
可选的,本申请涉及的存储介质如计算机可读存储介质可以是非易失性的,也可以是易失性的。Optionally, the storage medium involved in this application, such as a computer-readable storage medium, may be non-volatile or volatile.
本申请实施例还提供一种计算机程序产品,所述计算机程序产品包括存储了计算机程序的非瞬时性计算机可读存储介质,所述计算机程序可操作来使计算机执行如上述方法实施例中记载的任何一种基于聚类技术的企业ESG指数确定方法的部分或全部步骤。The embodiment of the present application also provides a computer program product, the computer program product includes a non-transitory computer-readable storage medium storing a computer program, and the computer program is operable to enable the computer to execute the method described in the above method embodiments Part or all of the steps of any method for determining an enterprise ESG index based on clustering technology.
需要说明的是,对于前述的各方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本申请并不受所描述的动作顺序的限制,因为依据本申请,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于可选实施例,所涉及的动作和模块并不一定是本申请所必须的。It should be noted that for the foregoing method embodiments, for the sake of simple description, they are expressed as a series of action combinations, but those skilled in the art should know that the present application is not limited by the described action sequence. Depending on the application, certain steps may be performed in other orders or simultaneously. Secondly, those skilled in the art should also know that the embodiments described in the specification are all optional embodiments, and the actions and modules involved are not necessarily required by the application.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。In the foregoing embodiments, the descriptions of each embodiment have their own emphases, and for parts not described in detail in a certain embodiment, reference may be made to relevant descriptions of other embodiments.
在本申请所提供的几个实施例中,应该理解到,所揭露的装置,可通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed device can be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components can be combined or can be Integrate into another system, or some features may be ignored, or not implemented. In another point, the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件程序模块的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented not only in the form of hardware, but also in the form of software program modules.
所述集成的单元如果以软件程序模块的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储器中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储器中,包括若干指令用以使得一台计算机设备(可为个人计算机、服务器或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储器包括:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。The integrated units may be stored in a computer-readable memory if implemented in the form of a software program module and sold or used as an independent product. Based on this understanding, the technical solution of the present application is essentially or part of the contribution to the prior art, or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a memory. Several instructions are included to make a computer device (which may be a personal computer, server or network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned memory includes: various media that can store program codes such as U disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disk or optical disk.
本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读存储器中,存储器可以包括:闪存盘、只读存储器(英文:Read-Only Memory,简称:ROM)、随机存取器(英文:Random Access Memory,简称:RAM)、磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the various methods of the above-mentioned embodiments can be completed by instructing related hardware through a program, and the program can be stored in a computer-readable memory, and the memory can include: a flash disk , Read-only memory (English: Read-Only Memory, referred to as: ROM), random access device (English: Random Access Memory, referred to as: RAM), magnetic disk or optical disc, etc.
以上对本申请实施例进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的一般技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。The embodiments of the present application have been introduced in detail above, and specific examples have been used in this paper to illustrate the principles and implementation methods of the present application. The descriptions of the above embodiments are only used to help understand the methods and core ideas of the present application; meanwhile, for Those skilled in the art will have changes in specific implementation methods and application scopes based on the ideas of the present application. In summary, the contents of this specification should not be construed as limiting the present application.
Claims (20)
- 一种基于聚类技术的企业ESG指数确定方法,包括:A method for determining an enterprise ESG index based on clustering technology, including:获取待评价企业在预设时间段内的M篇新闻,其中,M为大于或者等于1的整数;Obtain M pieces of news about the enterprise to be evaluated within a preset time period, where M is an integer greater than or equal to 1;对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组对应一个新闻事件,所述每个第一新闻组包括所述M篇新闻中的一篇或多篇新闻,K为大于或者等于1的整数;Clustering the M pieces of news to obtain K first newsgroups, wherein each first newsgroup in the K first newsgroups corresponds to a news event, and each first newsgroup includes One or more news articles in the M news articles, K is an integer greater than or equal to 1;对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应的L个第二新闻组,其中,所述每个第二新闻组包括一篇原创新闻以及与所述原创新闻对应的H篇转载新闻,H为大于或者等于0的整数;Clustering one or more news articles in each of the first newsgroups to obtain L second newsgroups corresponding to each of the first newsgroups, wherein each of the second newsgroups includes One piece of original news and H pieces of reprinted news corresponding to the original news, where H is an integer greater than or equal to 0;根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,所述目标舆情评分用于表征所述每个第一新闻组对应的新闻事件对所述待评价企业的影响力;According to an original news and H pieces of reposted news included in each of the L second newsgroups, determine the target public opinion score of the news event corresponding to each of the first newsgroups, the target The public opinion score is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;根据所述每个第一新闻组对应的新闻事件的目标舆情评分,对所述待评价企业进行ESG评价,得到所述待评价企业的ESG指数;According to the target public opinion score of the news event corresponding to each first news group, ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;向目标设备发送所述待评价企业的ESG指数,以便所述目标设备的用户根据所述待评价企业的ESG指数制定与所述待评价企业相关的投资决策。Sending the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device makes an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
- 根据权利要求1所述的方法,其中,所述根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,包括:The method according to claim 1, wherein, according to an original news and H reprinted news included in each second news group in the L second news groups, it is determined that each first news The target public opinion score of the news event corresponding to the group, including:根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻,以及转载新闻的数量H,确定所述每个第一新闻组对应的新闻事件的缩放比例,所述每个第一新闻组对应的新闻事件的缩放比例表征了社会对所述每个第一新闻组对应的新闻事件的关注度;According to an original news included in each of the L second newsgroups, and the number H of reprinted news, determine the scaling ratio of the news event corresponding to each of the first newsgroups, the The scaling ratio of the news event corresponding to each first news group represents the degree of social attention to the news event corresponding to each first news group;根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻的新闻内容以及H篇转载新闻的新闻内容,确定所述每个第一新闻组对应的新闻事件的原始舆情评分;According to the news content of an original news and the news content of H reprinted news included in each second news group in the L second news groups, determine the original news event corresponding to each of the first news groups Score of public opinion;对所述每个第一新闻组对应的新闻事件的缩放比例以及原始舆情评分进行乘积处理,得到所述每个第一新闻组对应的新闻事件的目标舆情评分。Perform product processing on the scaling ratio of the news event corresponding to each first news group and the original public opinion score to obtain the target public opinion score of the news event corresponding to each first news group.
- 根据权利要求2所述的方法,其中,所述根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻,以及转载新闻的数量H,确定所述每个第一新闻组对应的新闻事件的缩放比例,包括:The method according to claim 2, wherein, according to an original news included in each second news group in the L second news groups, and the number H of reprinted news, it is determined that each of the second news groups The scaling ratio of news events corresponding to a newsgroup, including:获取所述每个第二新闻组中的原创新闻的预设比例,所述预设比例表征了所述原创新闻在社会对所述每个第一新闻组对应的新闻事件的关注度上的贡献度;Obtaining a preset ratio of original news in each second news group, where the preset ratio represents the contribution of the original news to social attention to news events corresponding to each first news group Spend;根据所述每个第二新闻组中的原创新闻的预设比例以及所述每个第二新闻组包括的转载新闻的数量H,确定所述每个第二新闻组的缩放比例;Determine the scaling ratio of each second news group according to the preset ratio of original news in each second news group and the number H of reprinted news included in each second news group;对所述L个第二新闻组的缩放比例进行求和,得到所述每个第一新闻组对应的新闻事件的缩放比例。The scaling ratios of the L second newsgroups are summed to obtain the scaling ratios of the news events corresponding to each of the first newsgroups.
- 根据权利要求2或3所述的方法,其中,所述根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻的新闻内容以及H篇转载新闻的新闻内容,确定所述每个第一新闻组对应的新闻事件的原始舆情评分,包括:The method according to claim 2 or 3, wherein, according to the news content of an original news and the news content of H reprinted news included in each second news group in the L second news groups, Determining the original public opinion score of the news event corresponding to each first news group includes:对所述每个第一新闻组中的新闻进行情感识别,得到所述每个第一新闻组对应的情感标签,其中,所述每个第一新闻组对应的情感标签用于表征所述每个第一新闻组对应的新闻事件为正面新闻事件或者负面新闻事件;Perform emotion recognition on the news in each first news group to obtain an emotion tag corresponding to each first news group, wherein the emotion tag corresponding to each first news group is used to represent each The news event corresponding to the first news group is a positive news event or a negative news event;获取所述每个第一新闻组中的每篇新闻的发表媒体;Acquiring the publishing media of each news item in each of the first newsgroups;根据所述每个第一新闻组中的每篇新闻的发表媒体,确定所述每个第一新闻组中的最高级别的发表媒体;determining the highest-level publishing media in each of the first newsgroups according to the publishing media of each news in each of the first newsgroups;根据发表媒体、情感标签与舆情评分之间的映射关系,以及所述每个第一新闻组中的最高级别的发表媒体和所述每个第一新闻组对应的情感标签,确定所述每个第一新闻组对应的新闻事件的原始舆情评分。According to the mapping relationship between published media, emotional tags and public opinion scores, and the highest-level published media in each first news group and the corresponding emotional tags of each first news group, determine each The original public opinion score of the news event corresponding to the first news group.
- 根据权利要求4所述的方法,其中,所述对所述每个第一新闻组中的新闻进行情感识别,得到所述每个第一新闻组对应的情感标签,包括:The method according to claim 4, wherein said carrying out emotion recognition to the news in each of the first newsgroups to obtain the corresponding emotion tags of each of the first newsgroups comprises:从所述L个第二新闻组随机选择一个第二新闻组,并对所述第二新闻组中的任意一篇新闻进行情感识别,得到与该篇新闻的情感标签,将该篇新闻的情感标签作为所述每个第一新闻组对应的情感标签;Randomly select a second news group from the L second news groups, and carry out emotional recognition to any news in the second news group, obtain the emotional label with the news, and use the emotional label of the news The label is used as the sentiment label corresponding to each of the first newsgroups;或者,or,从所述每个第一新闻组包括的一篇或多篇新闻中随机选择一篇新闻,并对该篇新闻进行情感识别,得到与该篇新闻对应的情感标签,将该篇新闻对应的情感标签作为所述每个第一新闻组对应的情感标签。Randomly select a piece of news from one or more pieces of news included in each of the first newsgroups, and carry out emotion recognition to the piece of news, obtain the emotion label corresponding to the piece of news, and the emotion corresponding to the piece of news The label serves as the sentiment label corresponding to each first news group.
- 根据权利要求5所述的方法,其中,所述对所述M篇新闻进行聚类,得到K个第一新闻组,包括:The method according to claim 5, wherein said clustering said M pieces of news to obtain K first newsgroups comprises:对所述M篇新闻中的每篇新闻进行语义信息提取,得到所述每篇新闻的语义向量,其中,所述每篇新闻的语义向量用于表征所述每篇新闻所描述的新闻事件;Performing semantic information extraction on each of the M pieces of news to obtain a semantic vector of each of the news, wherein the semantic vector of each of the news is used to represent the news events described in each of the news;确定所述M篇新闻中任意两篇新闻的语义向量之间的第一相似度;determining the first similarity between the semantic vectors of any two news in the M news;根据所述M篇新闻中任意两篇新闻的语义向量之间的第一相似度对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组中任意两篇新闻的语义向量之间的第一相似度大于第一阈值。According to the first similarity between the semantic vectors of any two news in the M news, the M news are clustered to obtain K first news groups, wherein, among the K first news groups The first similarity between the semantic vectors of any two news articles in each first news group is greater than the first threshold.
- 根据权利要求6所述的方法,其中,所述对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应的L个第二新闻组,包括:The method according to claim 6, wherein said clustering one or more news articles in each first news group obtains L second news articles corresponding to each first news group group, including:确定所述每个第一新闻组中任意两篇新闻之间的第二相似度,其中,所述每个第一新闻组中任意两篇新闻之间的第二相似度用于表征所述每个第一新闻组中任意两篇新闻的新闻内容之间的相似度;determining a second similarity between any two news articles in each first news group, wherein the second similarity between any two news articles in each first news group is used to characterize each The similarity between the news content of any two news articles in the first news group;根据所述每个第一新闻组中任意两篇新闻之间的第二相似度对所述每个第一新闻组包括的一篇或多篇新闻进行聚类,得到L个第二新闻组,其中,每个第二新闻组中任意两篇新闻之间的第二相似度大于第二阈值。According to the second similarity between any two news articles in each first news group, one or more news articles included in each first news group are clustered to obtain L second news groups, Wherein, the second similarity between any two news articles in each second news group is greater than a second threshold.
- 一种企业ESG指数确定装置,包括:A device for determining an enterprise ESG index, comprising:获取单元,用于获取待评价企业在预设时间段内的M篇新闻,其中,M为大于或者等于1的整数;The obtaining unit is used to obtain M pieces of news of the enterprise to be evaluated within a preset time period, where M is an integer greater than or equal to 1;处理单元,用于对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组对应一个新闻事件,所述每个第一新闻组包括所述M篇新闻中的一篇或多篇新闻,K为大于或者等于1的整数;A processing unit, configured to cluster the M pieces of news to obtain K first news groups, wherein each first news group in the K first news groups corresponds to a news event, and each of the K first news groups corresponds to a news event. The first news group includes one or more news articles in the M news articles, K is an integer greater than or equal to 1;对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应的L个第二新闻组,其中,所述每个第二新闻组包括一篇原创新闻以及与所述原创新闻对应的H篇转载新闻,H为大于或者等于0的整数;Clustering one or more news articles in each of the first newsgroups to obtain L second newsgroups corresponding to each of the first newsgroups, wherein each of the second newsgroups includes One piece of original news and H pieces of reprinted news corresponding to the original news, where H is an integer greater than or equal to 0;根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,所述目标舆情评分用于表征所述每个第一新闻组对应的新闻事件对所述待评价企业的影响力;According to an original news and H pieces of reposted news included in each of the L second newsgroups, determine the target public opinion score of the news event corresponding to each of the first newsgroups, the target The public opinion score is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;根据所述每个第一新闻组对应的新闻事件的目标舆情评分,对所述待评价企业进行ESG评价,得到所述待评价企业的ESG指数;According to the target public opinion score of the news event corresponding to each first news group, ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;发送单元,用于向目标设备发送所述待评价企业的ESG指数,以便所述目标设备的用户根据所述待评价企业的ESG指数制定与所述待评价企业相关的投资决策。The sending unit is configured to send the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device makes an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
- 一种电子设备,包括:处理器和存储器,所述处理器与所述存储器相连,所述存储器用于存储计算机程序,所述处理器用于执行所述存储器中存储的计算机程序,以使得所述电子设备执行以下方法:An electronic device, comprising: a processor and a memory, the processor is connected to the memory, the memory is used to store a computer program, and the processor is used to execute the computer program stored in the memory, so that the The electronic device implements the following methods:获取待评价企业在预设时间段内的M篇新闻,其中,M为大于或者等于1的整数;Obtain M pieces of news about the enterprise to be evaluated within a preset time period, where M is an integer greater than or equal to 1;对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组对应一个新闻事件,所述每个第一新闻组包括所述M篇新闻中的一篇或多篇新闻,K为大于或者等于1的整数;Clustering the M pieces of news to obtain K first newsgroups, wherein each first newsgroup in the K first newsgroups corresponds to a news event, and each first newsgroup includes One or more news articles in the M news articles, K is an integer greater than or equal to 1;对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应的L个第二新闻组,其中,所述每个第二新闻组包括一篇原创新闻以及与所述原创新闻对应的H篇转载新闻,H为大于或者等于0的整数;Clustering one or more news articles in each of the first newsgroups to obtain L second newsgroups corresponding to each of the first newsgroups, wherein each of the second newsgroups includes One piece of original news and H pieces of reprinted news corresponding to the original news, where H is an integer greater than or equal to 0;根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,所述目标舆情评分用于表征所述每个第一新闻组对应的新闻事件对所述待评价企业的影响力;According to an original news and H pieces of reposted news included in each of the L second newsgroups, determine the target public opinion score of the news event corresponding to each of the first newsgroups, the target The public opinion score is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;根据所述每个第一新闻组对应的新闻事件的目标舆情评分,对所述待评价企业进行ESG评价,得到所述待评价企业的ESG指数;According to the target public opinion score of the news event corresponding to each first news group, ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;向目标设备发送所述待评价企业的ESG指数,以便所述目标设备的用户根据所述待评价企业的ESG指数制定与所述待评价企业相关的投资决策。Sending the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device makes an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
- 根据权利要求9所述的电子设备,其中,执行所述根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,包括:The electronic device according to claim 9, wherein, performing said one piece of original news and H pieces of reposted news included in each second newsgroup in said L second newsgroups, determine said each first newsgroup The target public opinion score of a news event corresponding to a news group includes:根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻,以及转载新闻的数量H,确定所述每个第一新闻组对应的新闻事件的缩放比例,所述每个第一新闻组对应的新闻事件的缩放比例表征了社会对所述每个第一新闻组对应的新闻事件的关注度;According to an original news included in each of the L second newsgroups, and the number H of reprinted news, determine the scaling ratio of the news event corresponding to each of the first newsgroups, the The scaling ratio of the news event corresponding to each first news group represents the degree of social attention to the news event corresponding to each first news group;根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻的新闻内容以及H篇转载新闻的新闻内容,确定所述每个第一新闻组对应的新闻事件的原始舆情评分;According to the news content of an original news and the news content of H reprinted news included in each second news group in the L second news groups, determine the original news event corresponding to each of the first news groups Score of public opinion;对所述每个第一新闻组对应的新闻事件的缩放比例以及原始舆情评分进行乘积处理,得到所述每个第一新闻组对应的新闻事件的目标舆情评分。Perform product processing on the scaling ratio of the news event corresponding to each first news group and the original public opinion score to obtain the target public opinion score of the news event corresponding to each first news group.
- 根据权利要求10所述的电子设备,其中,执行所述根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻,以及转载新闻的数量H,确定所述每个第一新闻组对应的新闻事件的缩放比例,包括:The electronic device according to claim 10, wherein performing said determining said each news according to an original news included in each of said L second newsgroups and the number H of reprinted news The zoom ratio of the news event corresponding to the first newsgroup, including:获取所述每个第二新闻组中的原创新闻的预设比例,所述预设比例表征了所述原创新闻在社会对所述每个第一新闻组对应的新闻事件的关注度上的贡献度;Obtaining a preset ratio of original news in each second news group, where the preset ratio represents the contribution of the original news to social attention to news events corresponding to each first news group Spend;根据所述每个第二新闻组中的原创新闻的预设比例以及所述每个第二新闻组包括的转载新闻的数量H,确定所述每个第二新闻组的缩放比例;Determine the scaling ratio of each second news group according to the preset ratio of original news in each second news group and the number H of reprinted news included in each second news group;对所述L个第二新闻组的缩放比例进行求和,得到所述每个第一新闻组对应的新闻事件的缩放比例。The scaling ratios of the L second newsgroups are summed to obtain the scaling ratios of the news events corresponding to each of the first newsgroups.
- 根据权利要求10或11所述的电子设备,其中,执行所述根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻的新闻内容以及H篇转载新闻的新闻内容,确定所述每个第一新闻组对应的新闻事件的原始舆情评分,包括:The electronic device according to claim 10 or 11, wherein, executing the news content of an original news and the news content of H reprinted news according to each second news group in the L second news groups. Content, determining the original public opinion score of the news event corresponding to each first news group, including:对所述每个第一新闻组中的新闻进行情感识别,得到所述每个第一新闻组对应的情感标签,其中,所述每个第一新闻组对应的情感标签用于表征所述每个第一新闻组对应的新闻事件为正面新闻事件或者负面新闻事件;Perform emotion recognition on the news in each first news group to obtain an emotion tag corresponding to each first news group, wherein the emotion tag corresponding to each first news group is used to represent each The news event corresponding to the first news group is a positive news event or a negative news event;获取所述每个第一新闻组中的每篇新闻的发表媒体;Acquiring the publishing media of each news item in each of the first newsgroups;根据所述每个第一新闻组中的每篇新闻的发表媒体,确定所述每个第一新闻组中的最 高级别的发表媒体;According to the publishing media of each piece of news in each of the first newsgroups, determine the highest-level publishing media in each of the first newsgroups;根据发表媒体、情感标签与舆情评分之间的映射关系,以及所述每个第一新闻组中的最高级别的发表媒体和所述每个第一新闻组对应的情感标签,确定所述每个第一新闻组对应的新闻事件的原始舆情评分。According to the mapping relationship between published media, emotional tags and public opinion scores, and the highest-level published media in each first news group and the corresponding emotional tags of each first news group, determine each The original public opinion score of the news event corresponding to the first news group.
- 根据权利要求12所述的电子设备,其中,执行所述对所述每个第一新闻组中的新闻进行情感识别,得到所述每个第一新闻组对应的情感标签,包括:The electronic device according to claim 12, wherein performing the emotion recognition on the news in each of the first newsgroups to obtain the emotion tags corresponding to each of the first newsgroups includes:从所述L个第二新闻组随机选择一个第二新闻组,并对所述第二新闻组中的任意一篇新闻进行情感识别,得到与该篇新闻的情感标签,将该篇新闻的情感标签作为所述每个第一新闻组对应的情感标签;Randomly select a second news group from the L second news groups, and carry out emotional recognition to any news in the second news group, obtain the emotional label with the news, and use the emotional label of the news The label is used as the sentiment label corresponding to each of the first newsgroups;或者,or,从所述每个第一新闻组包括的一篇或多篇新闻中随机选择一篇新闻,并对该篇新闻进行情感识别,得到与该篇新闻对应的情感标签,将该篇新闻对应的情感标签作为所述每个第一新闻组对应的情感标签。Randomly select a piece of news from one or more pieces of news included in each of the first newsgroups, and carry out emotion recognition to the piece of news, obtain the emotion label corresponding to the piece of news, and the emotion corresponding to the piece of news The label serves as the sentiment label corresponding to each first news group.
- 根据权利要求13所述的电子设备,其中,执行所述对所述M篇新闻进行聚类,得到K个第一新闻组,包括:The electronic device according to claim 13, wherein performing the clustering of the M pieces of news to obtain K first newsgroups, comprising:对所述M篇新闻中的每篇新闻进行语义信息提取,得到所述每篇新闻的语义向量,其中,所述每篇新闻的语义向量用于表征所述每篇新闻所描述的新闻事件;Performing semantic information extraction on each of the M pieces of news to obtain a semantic vector of each of the news, wherein the semantic vector of each of the news is used to represent the news events described in each of the news;确定所述M篇新闻中任意两篇新闻的语义向量之间的第一相似度;determining the first similarity between the semantic vectors of any two news in the M news;根据所述M篇新闻中任意两篇新闻的语义向量之间的第一相似度对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组中任意两篇新闻的语义向量之间的第一相似度大于第一阈值。According to the first similarity between the semantic vectors of any two news in the M news, the M news are clustered to obtain K first news groups, wherein, among the K first news groups The first similarity between the semantic vectors of any two news articles in each first news group is greater than the first threshold.
- 一种计算机可读存储介质,其中,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行以实现以下方法:A computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and the computer program is executed by a processor to implement the following method:获取待评价企业在预设时间段内的M篇新闻,其中,M为大于或者等于1的整数;Obtain M pieces of news about the enterprise to be evaluated within a preset time period, where M is an integer greater than or equal to 1;对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组对应一个新闻事件,所述每个第一新闻组包括所述M篇新闻中的一篇或多篇新闻,K为大于或者等于1的整数;Clustering the M pieces of news to obtain K first newsgroups, wherein each first newsgroup in the K first newsgroups corresponds to a news event, and each first newsgroup includes One or more news articles in the M news articles, K is an integer greater than or equal to 1;对所述每个第一新闻组中一篇或多篇新闻进行聚类,得到与所述每个第一新闻组对应的L个第二新闻组,其中,所述每个第二新闻组包括一篇原创新闻以及与所述原创新闻对应的H篇转载新闻,H为大于或者等于0的整数;Clustering one or more news articles in each of the first newsgroups to obtain L second newsgroups corresponding to each of the first newsgroups, wherein each of the second newsgroups includes One piece of original news and H pieces of reprinted news corresponding to the original news, where H is an integer greater than or equal to 0;根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,所述目标舆情评分用于表征所述每个第一新闻组对应的新闻事件对所述待评价企业的影响力;According to an original news and H pieces of reposted news included in each of the L second newsgroups, determine the target public opinion score of the news event corresponding to each of the first newsgroups, the target The public opinion score is used to characterize the influence of the news event corresponding to each first news group on the enterprise to be evaluated;根据所述每个第一新闻组对应的新闻事件的目标舆情评分,对所述待评价企业进行ESG评价,得到所述待评价企业的ESG指数;According to the target public opinion score of the news event corresponding to each first news group, ESG evaluation is performed on the enterprise to be evaluated, and the ESG index of the enterprise to be evaluated is obtained;向目标设备发送所述待评价企业的ESG指数,以便所述目标设备的用户根据所述待评价企业的ESG指数制定与所述待评价企业相关的投资决策。Sending the ESG index of the enterprise to be evaluated to the target device, so that the user of the target device makes an investment decision related to the enterprise to be evaluated according to the ESG index of the enterprise to be evaluated.
- 根据权利要求15所述的计算机可读存储介质,其中,执行所述根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻以及H篇转载新闻,确定所述每个第一新闻组对应的新闻事件的目标舆情评分,包括:The computer-readable storage medium according to claim 15, wherein, performing the said one piece of original news and H pieces of reprinted news included in each of said L second newsgroups, determining said The target public opinion score of the news event corresponding to each first news group, including:根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻,以及转载新闻的数量H,确定所述每个第一新闻组对应的新闻事件的缩放比例,所述每个第一新闻组对应的新闻事件的缩放比例表征了社会对所述每个第一新闻组对应的新闻事件的关注度;According to an original news included in each of the L second newsgroups, and the number H of reprinted news, determine the scaling ratio of the news event corresponding to each of the first newsgroups, the The scaling ratio of the news event corresponding to each first news group represents the degree of social attention to the news event corresponding to each first news group;根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻的新闻内容以及H 篇转载新闻的新闻内容,确定所述每个第一新闻组对应的新闻事件的原始舆情评分;According to the news content of an original news and the news content of H reposted news included in each second news group in the L second news groups, determine the original news event corresponding to each first news group Score of public opinion;对所述每个第一新闻组对应的新闻事件的缩放比例以及原始舆情评分进行乘积处理,得到所述每个第一新闻组对应的新闻事件的目标舆情评分。Perform product processing on the scaling ratio of the news event corresponding to each first news group and the original public opinion score to obtain the target public opinion score of the news event corresponding to each first news group.
- 根据权利要求16所述的计算机可读存储介质,其中,执行所述根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻,以及转载新闻的数量H,确定所述每个第一新闻组对应的新闻事件的缩放比例,包括:The computer-readable storage medium according to claim 16, wherein, performing said determining according to an original news included in each of the L second newsgroups and the number H of reprinted news The scaling ratio of the news event corresponding to each first news group includes:获取所述每个第二新闻组中的原创新闻的预设比例,所述预设比例表征了所述原创新闻在社会对所述每个第一新闻组对应的新闻事件的关注度上的贡献度;Obtaining a preset ratio of original news in each second news group, where the preset ratio represents the contribution of the original news to social attention to news events corresponding to each first news group Spend;根据所述每个第二新闻组中的原创新闻的预设比例以及所述每个第二新闻组包括的转载新闻的数量H,确定所述每个第二新闻组的缩放比例;Determine the scaling ratio of each second news group according to the preset ratio of original news in each second news group and the number H of reprinted news included in each second news group;对所述L个第二新闻组的缩放比例进行求和,得到所述每个第一新闻组对应的新闻事件的缩放比例。The scaling ratios of the L second newsgroups are summed to obtain the scaling ratios of the news events corresponding to each of the first newsgroups.
- 根据权利要求16或17所述的计算机可读存储介质,其中,执行所述根据所述L个第二新闻组中的每个第二新闻组包括的一篇原创新闻的新闻内容以及H篇转载新闻的新闻内容,确定所述每个第一新闻组对应的新闻事件的原始舆情评分,包括:The computer-readable storage medium according to claim 16 or 17, wherein, performing the news content of an original news and the reprinting of H articles according to each second newsgroup in the L second newsgroups The news content of the news determines the original public opinion score of the news event corresponding to each first news group, including:对所述每个第一新闻组中的新闻进行情感识别,得到所述每个第一新闻组对应的情感标签,其中,所述每个第一新闻组对应的情感标签用于表征所述每个第一新闻组对应的新闻事件为正面新闻事件或者负面新闻事件;Perform emotion recognition on the news in each first news group to obtain an emotion tag corresponding to each first news group, wherein the emotion tag corresponding to each first news group is used to represent each The news event corresponding to the first news group is a positive news event or a negative news event;获取所述每个第一新闻组中的每篇新闻的发表媒体;Acquiring the publishing media of each news item in each of the first newsgroups;根据所述每个第一新闻组中的每篇新闻的发表媒体,确定所述每个第一新闻组中的最高级别的发表媒体;determining the highest-level publishing media in each of the first newsgroups according to the publishing media of each news in each of the first newsgroups;根据发表媒体、情感标签与舆情评分之间的映射关系,以及所述每个第一新闻组中的最高级别的发表媒体和所述每个第一新闻组对应的情感标签,确定所述每个第一新闻组对应的新闻事件的原始舆情评分。According to the mapping relationship between published media, emotional tags and public opinion scores, and the highest-level published media in each first news group and the corresponding emotional tags of each first news group, determine each The original public opinion score of the news event corresponding to the first news group.
- 根据权利要求18所述的计算机可读存储介质,其中,执行所述对所述每个第一新闻组中的新闻进行情感识别,得到所述每个第一新闻组对应的情感标签,包括:The computer-readable storage medium according to claim 18, wherein performing the emotion recognition on the news in each of the first newsgroups to obtain the emotion tags corresponding to each of the first newsgroups includes:从所述L个第二新闻组随机选择一个第二新闻组,并对所述第二新闻组中的任意一篇新闻进行情感识别,得到与该篇新闻的情感标签,将该篇新闻的情感标签作为所述每个第一新闻组对应的情感标签;Randomly select a second news group from the L second news groups, and carry out emotional recognition to any news in the second news group, obtain the emotional label with the news, and use the emotional label of the news The label is used as the sentiment label corresponding to each of the first newsgroups;或者,or,从所述每个第一新闻组包括的一篇或多篇新闻中随机选择一篇新闻,并对该篇新闻进行情感识别,得到与该篇新闻对应的情感标签,将该篇新闻对应的情感标签作为所述每个第一新闻组对应的情感标签。Randomly select a piece of news from one or more pieces of news included in each of the first newsgroups, and carry out emotion recognition to the piece of news, obtain the emotion label corresponding to the piece of news, and the emotion corresponding to the piece of news The label serves as the sentiment label corresponding to each first newsgroup.
- 根据权利要求19所述的计算机可读存储介质,其中,执行所述对所述M篇新闻进行聚类,得到K个第一新闻组,包括:The computer-readable storage medium according to claim 19, wherein performing the clustering of the M articles of news obtains K first newsgroups, comprising:对所述M篇新闻中的每篇新闻进行语义信息提取,得到所述每篇新闻的语义向量,其中,所述每篇新闻的语义向量用于表征所述每篇新闻所描述的新闻事件;Performing semantic information extraction on each of the M pieces of news to obtain a semantic vector of each of the news, wherein the semantic vector of each of the news is used to represent the news events described in each of the news;确定所述M篇新闻中任意两篇新闻的语义向量之间的第一相似度;determining the first similarity between the semantic vectors of any two news in the M news;根据所述M篇新闻中任意两篇新闻的语义向量之间的第一相似度对所述M篇新闻进行聚类,得到K个第一新闻组,其中,所述K个第一新闻组中的每个第一新闻组中任意两篇新闻的语义向量之间的第一相似度大于第一阈值。According to the first similarity between the semantic vectors of any two news in the M news, the M news are clustered to obtain K first news groups, wherein, among the K first news groups The first similarity between the semantic vectors of any two news articles in each first news group is greater than the first threshold.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110748931.XA CN113190683B (en) | 2021-07-02 | 2021-07-02 | Enterprise ESG index determination method based on clustering technology and related product |
CN202110748931.X | 2021-07-02 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2023273295A1 true WO2023273295A1 (en) | 2023-01-05 |
Family
ID=76976971
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2022/071253 WO2023273295A1 (en) | 2021-07-02 | 2022-01-11 | Enterprise esg index determination method based on clustering technology, and related product |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN113190683B (en) |
WO (1) | WO2023273295A1 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113190683B (en) * | 2021-07-02 | 2021-09-17 | 平安科技(深圳)有限公司 | Enterprise ESG index determination method based on clustering technology and related product |
CN113570281A (en) * | 2021-08-20 | 2021-10-29 | 瑞格人工智能科技有限公司 | ESG index compiling method |
CN113837630A (en) * | 2021-09-28 | 2021-12-24 | 平安科技(深圳)有限公司 | Text recognition-based ESG index determination method in area and related product |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130226933A1 (en) * | 2012-02-24 | 2013-08-29 | Strategic Communication Advisors, Llc | System and method for assessing and ranking newsworthiness |
CN104317881A (en) * | 2014-04-11 | 2015-01-28 | 北京理工大学 | Method for reordering microblogs on basis of authorities of users' topics |
US20170004128A1 (en) * | 2015-07-01 | 2017-01-05 | Institute for Sustainable Development | Device and method for analyzing reputation for objects by data mining |
CN106776609A (en) * | 2015-11-19 | 2017-05-31 | 北京国双科技有限公司 | Reprint the statistical method and device of quantity in website |
CN108804701A (en) * | 2018-06-19 | 2018-11-13 | 苏州大学 | Personage's portrait model building method based on social networks big data |
CN109359857A (en) * | 2018-10-12 | 2019-02-19 | 网智天元科技集团股份有限公司 | A kind of influence of media force estimation method, apparatus and electronic equipment |
CN109508373A (en) * | 2018-11-13 | 2019-03-22 | 深圳前海微众银行股份有限公司 | Calculation method, equipment and the computer readable storage medium of enterprise's public opinion index |
CN111177508A (en) * | 2019-10-16 | 2020-05-19 | 李多 | Spreading index query method and device for specified news topic and storage medium |
CN111630518A (en) * | 2017-11-23 | 2020-09-04 | 持续可能发展所株式会社 | ESG-based enterprise evaluation execution device and operation method thereof |
CN112613296A (en) * | 2020-12-07 | 2021-04-06 | 深圳价值在线信息科技股份有限公司 | News importance degree acquisition method and device, terminal equipment and storage medium |
CN113190683A (en) * | 2021-07-02 | 2021-07-30 | 平安科技(深圳)有限公司 | Enterprise ESG index determination method based on clustering technology and related product |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101000627B (en) * | 2007-01-15 | 2010-05-19 | 北京搜狗科技发展有限公司 | Method and device for issuing correlation information |
US20120296845A1 (en) * | 2009-12-01 | 2012-11-22 | Andrews Sarah L | Methods and systems for generating composite index using social media sourced data and sentiment analysis |
CN103226577A (en) * | 2013-04-01 | 2013-07-31 | 儒豹(苏州)科技有限责任公司 | News clustering method |
CN110399478A (en) * | 2018-04-19 | 2019-11-01 | 清华大学 | Event finds method and apparatus |
CN108875018B (en) * | 2018-06-19 | 2021-06-29 | 网智天元科技集团股份有限公司 | News influence evaluation method and device and electronic equipment |
CN110472884A (en) * | 2019-08-20 | 2019-11-19 | 深圳前海微众银行股份有限公司 | ESG index monitoring method, device, terminal device and storage medium |
CN110532357B (en) * | 2019-09-04 | 2024-03-12 | 深圳前海微众银行股份有限公司 | ESG scoring system generation method, device, equipment and readable storage medium |
CN112418591A (en) * | 2020-09-22 | 2021-02-26 | 中财绿指(北京)信息咨询有限公司 | Enterprise ESG (electronic service guide) rating method based on weight distribution model |
-
2021
- 2021-07-02 CN CN202110748931.XA patent/CN113190683B/en active Active
-
2022
- 2022-01-11 WO PCT/CN2022/071253 patent/WO2023273295A1/en active Application Filing
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130226933A1 (en) * | 2012-02-24 | 2013-08-29 | Strategic Communication Advisors, Llc | System and method for assessing and ranking newsworthiness |
CN104317881A (en) * | 2014-04-11 | 2015-01-28 | 北京理工大学 | Method for reordering microblogs on basis of authorities of users' topics |
US20170004128A1 (en) * | 2015-07-01 | 2017-01-05 | Institute for Sustainable Development | Device and method for analyzing reputation for objects by data mining |
CN106776609A (en) * | 2015-11-19 | 2017-05-31 | 北京国双科技有限公司 | Reprint the statistical method and device of quantity in website |
CN111630518A (en) * | 2017-11-23 | 2020-09-04 | 持续可能发展所株式会社 | ESG-based enterprise evaluation execution device and operation method thereof |
CN108804701A (en) * | 2018-06-19 | 2018-11-13 | 苏州大学 | Personage's portrait model building method based on social networks big data |
CN109359857A (en) * | 2018-10-12 | 2019-02-19 | 网智天元科技集团股份有限公司 | A kind of influence of media force estimation method, apparatus and electronic equipment |
CN109508373A (en) * | 2018-11-13 | 2019-03-22 | 深圳前海微众银行股份有限公司 | Calculation method, equipment and the computer readable storage medium of enterprise's public opinion index |
CN111177508A (en) * | 2019-10-16 | 2020-05-19 | 李多 | Spreading index query method and device for specified news topic and storage medium |
CN112613296A (en) * | 2020-12-07 | 2021-04-06 | 深圳价值在线信息科技股份有限公司 | News importance degree acquisition method and device, terminal equipment and storage medium |
CN113190683A (en) * | 2021-07-02 | 2021-07-30 | 平安科技(深圳)有限公司 | Enterprise ESG index determination method based on clustering technology and related product |
Also Published As
Publication number | Publication date |
---|---|
CN113190683A (en) | 2021-07-30 |
CN113190683B (en) | 2021-09-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2023273295A1 (en) | Enterprise esg index determination method based on clustering technology, and related product | |
US8326777B2 (en) | Supplementing a trained model using incremental data in making item recommendations | |
US9703877B2 (en) | Computer-based evaluation tool for selecting personalized content for users | |
CN108460082B (en) | Recommendation method and device and electronic equipment | |
US8352386B2 (en) | Identifying training documents for a content classifier | |
CN103336793B (en) | A kind of personalized article recommends method and system thereof | |
CN102929861B (en) | Method and system for calculating text emotion index | |
CN104252456B (en) | A kind of weight method of estimation, apparatus and system | |
CN109299245B (en) | Method and device for recalling knowledge points | |
CN105917364B (en) | Ranking discussion topics in question-and-answer forums | |
JP2009211697A (en) | Information distribution system and information distribution method | |
CN108664515B (en) | A kind of searching method and device, electronic equipment | |
CN104081385A (en) | Representing information from documents | |
CN107194672B (en) | Review distribution method integrating academic expertise and social network | |
US20200074509A1 (en) | Business data promotion method, device, terminal and computer-readable storage medium | |
CN109492082A (en) | Pull down words recommending method, device, electronic equipment and storage medium | |
CN110838043A (en) | Commodity recommendation method and device | |
WO2023050652A1 (en) | Text recognition-based method for determining esg index in region, and related product | |
CN110826315A (en) | Method for identifying timeliness of short text by using neural network system | |
CN114969566A (en) | Distance-measuring government affair service item collaborative filtering recommendation method | |
Liang et al. | Detecting novel business blogs | |
CN112434126B (en) | Information processing method, device, equipment and storage medium | |
CN112330426A (en) | Product recommendation method, device and storage medium | |
CN109033050B (en) | article generation method, article generation device and storage medium | |
CN113781134B (en) | Article recommendation method, apparatus and computer readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22831131 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 22831131 Country of ref document: EP Kind code of ref document: A1 |