CN107644084B - Method and apparatus for generating information - Google Patents

Method and apparatus for generating information Download PDF

Info

Publication number
CN107644084B
CN107644084B CN201710867243.9A CN201710867243A CN107644084B CN 107644084 B CN107644084 B CN 107644084B CN 201710867243 A CN201710867243 A CN 201710867243A CN 107644084 B CN107644084 B CN 107644084B
Authority
CN
China
Prior art keywords
information
target information
detected
reverse
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710867243.9A
Other languages
Chinese (zh)
Other versions
CN107644084A (en
Inventor
聂晓萌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201710867243.9A priority Critical patent/CN107644084B/en
Publication of CN107644084A publication Critical patent/CN107644084A/en
Application granted granted Critical
Publication of CN107644084B publication Critical patent/CN107644084B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

Methods and apparatus for generating information are disclosed. One embodiment of the method comprises: acquiring a target information set of information to be detected; acquiring feature keywords of each item label information in the target information set, wherein the feature keywords are used for representing the support degree of the target information on the authenticity of the information to be detected, and comprise positive feature keywords and negative feature keywords; determining whether the information to be detected is first-class information or not based on the characteristic keywords contained in each item label information in the target information set; and in response to determining that the information to be detected is the first type of information, determining target information which contains the reverse side characteristic keywords in the target information set as reverse side target information of the information to be detected. The embodiment realizes the judgment of the authenticity of the information to be detected and obtains the reverse target information of the information to be detected.

Description

Method and apparatus for generating information
Technical Field
The present application relates to the field of computer technologies, and in particular, to the field of internet technologies, and in particular, to a method and an apparatus for generating information.
Background
With the development and popularity of the internet, more and more users are getting used to obtain information through the internet, for example, users can inquire about information that they want to know through a search engine. However, there are a lot of rumor information on the internet, for example, the hotspot information "holding chinese passport cannot sit on plane in China", "bitcoin luxo virus invades apple phone", "one drop of blood can measure cancer" and so on are the rumor information, such as the rumor information is forwarded and shared in a lot, so that they are ranked high in the search engine and often arranged in the first few positions of the search result, and the user often considers that the reliability of the information arranged in front of the search result is high, and therefore, the rumor is trusted lightly. The spread of rumor information pollutes the network environment, disturbs the social order and may even cause some serious consequences which cannot be predicted and recovered. At present, a method for judging the authenticity of information transmitted on a network and generating the rumor splitting information aiming at unrealistic information is lacked.
Disclosure of Invention
It is an object of the present application to propose an improved method and apparatus for generating information to solve the technical problems mentioned in the background section above.
In a first aspect, an embodiment of the present application provides a method for generating information, where the method includes: acquiring a target information set of information to be detected, wherein the target information in the target information set is webpage information related to the information to be detected; acquiring feature keywords of each item label information in the target information set, wherein the feature keywords are used for representing the support degree of the target information on the authenticity of the information to be detected, and comprise positive feature keywords and negative feature keywords; determining whether the information to be detected is first-class information or not based on the characteristic keywords contained in each item label information in the target information set; and in response to determining that the information to be detected is the first type of information, determining target information which contains the reverse side characteristic keywords in the target information set as reverse side target information of the information to be detected.
In some embodiments, the above method further comprises: and summarizing the reverse target information of at least one piece of information to be detected to generate a reverse target information set.
In some embodiments, the above method further comprises: obtaining at least one search result according to the information for searching received from the terminal; matching the search information and the search result in the at least one search result with the reverse target information in the reverse target information set respectively; and responding to the matching between the information for searching and/or the search result in the at least one search result and the reverse target information in the reverse target information set, and pushing prompt information to the terminal.
In some embodiments, the above method further comprises: receiving a prompt information viewing request sent by the terminal, wherein the prompt information viewing request is generated by the terminal according to a viewing operation executed by a user aiming at the prompt information; and acquiring reverse target information from the reverse target information set according to the prompt information viewing request, and sending the reverse target information to the terminal for display by the terminal.
In some embodiments, the pushing prompt information to the terminal in response to the matching between the information for search and/or the search result in the at least one search result and the reverse target information in the reverse target information set includes: responding to the matching of the information for searching and the reverse target information in the reverse target information set, and pushing first prompt information for displaying in a search box of the terminal to the terminal; and responding to the matching of the search result in the at least one search result and the reverse side target information in the reverse side target information set, and pushing second prompt information for displaying on the search result page of the terminal to the terminal.
In some embodiments, the determining whether the information to be detected is the first type of information based on the feature keyword included in each item label information in the target information set includes: for each piece of target information in the target information set, performing the following operations: matching keywords contained in the entry label information with feature keywords in a preset feature keyword set to determine the feature keywords contained in the entry label information, wherein each feature keyword in the feature keyword set is preset with a score; determining a support degree score of the to-be-detected information based on a support degree score factor, wherein the support degree score factor includes at least one of: the score of the reverse side characteristic key words contained in the target information set, the source credibility of each target information and the number of pieces of target information containing the reverse side characteristic key words; and determining whether the information to be detected is the first type of information according to the support degree score of the information to be detected.
In some embodiments, the determining whether the information to be detected is the first type information according to the support degree score of the information to be detected includes: and judging whether the support degree score of the information to be detected is lower than a preset threshold value or not, and if so, determining that the information to be detected is the first type of information.
In a second aspect, an embodiment of the present application provides an apparatus for generating information, where the apparatus includes: the system comprises a first acquisition unit, a second acquisition unit and a third acquisition unit, wherein the first acquisition unit is used for acquiring a target information set of information to be detected, and the target information in the target information set is webpage information related to the information to be detected; a second obtaining unit, configured to obtain a feature keyword of each item label information in the target information set, where the feature keyword is used to represent a support degree of target information on authenticity of the to-be-detected information, and the feature keyword includes a positive feature keyword and a negative feature keyword; a first determining unit, configured to determine whether the to-be-detected information is first-class information based on a feature keyword included in each item label information in the target information set; and the second determining unit is used for determining the target information containing the reverse side characteristic key words in the target information set as the reverse side target information of the information to be detected in response to the fact that the information to be detected is determined to be the first type of information.
In some embodiments, the above apparatus further comprises: and the generating unit is used for summarizing the reverse target information of at least one piece of information to be detected and generating a reverse target information set.
In some embodiments, the above apparatus further comprises: a first receiving unit for obtaining at least one search result according to the information for search received from the terminal; a first matching unit, configured to match the search information and a search result of the at least one search result with the reverse-side target information in the reverse-side target information set, respectively; and the pushing unit is used for responding to the matching of the searching information and/or the searching result in the at least one searching result and the reverse target information in the reverse target information set, and pushing prompt information to the terminal.
In some embodiments, the above apparatus further comprises: a second receiving unit, configured to receive a prompt information viewing request sent by the terminal, where the prompt information viewing request is generated by the terminal according to a viewing operation performed by a user for the prompt information; and the sending unit is used for acquiring the reverse target information from the reverse target information set according to the prompt information viewing request and sending the reverse target information to the terminal so as to be displayed by the terminal.
In some embodiments, the pushing unit includes: a first prompt information pushing unit, configured to, in response to that the information for search matches with the reverse-side target information in the reverse-side target information set, push first prompt information for display in a search box of the terminal to the terminal; and the second prompt information pushing unit is used for responding to the matching of the search result in the at least one search result and the reverse target information in the reverse target information set, and pushing second prompt information for displaying on the search result page of the terminal to the terminal.
In some embodiments, the first determining unit includes: a second matching unit, configured to perform the following operations for each piece of target information in the set of target information: matching keywords contained in the entry label information with feature keywords in a preset feature keyword set to determine the feature keywords contained in the entry label information, wherein each feature keyword in the feature keyword set is preset with a score; a third determining unit, configured to determine a support degree score of the to-be-detected information based on a support degree score factor, where the support degree score factor includes at least one of: the score of the reverse side characteristic key words contained in the target information set, the source credibility of each target information and the number of pieces of target information containing the reverse side characteristic key words; and the fourth determining unit is used for determining whether the information to be detected is the first type of information according to the support degree score of the information to be detected.
In some embodiments, the fourth determining unit is further configured to: and judging whether the support degree score of the information to be detected is lower than a preset threshold value or not, and if so, determining that the information to be detected is the first type of information.
In a third aspect, an embodiment of the present application provides a server, where the server includes: one or more processors; a storage device for storing one or more programs which, when executed by the one or more processors, cause the one or more processors to implement the method as described in any implementation manner of the first aspect.
In a fourth aspect, the present application provides a computer-readable storage medium, on which a computer program is stored, where the computer program is implemented, when executed by a processor, to implement the method described in any implementation manner of the first aspect.
The method and the device for generating information provided by the embodiment of the application firstly obtain a target information set of information to be detected, then obtain feature keywords of each item of label information in the target information set, wherein the feature keywords are used for representing the support degree of the target information on the authenticity of the information to be detected, then determine whether the information to be detected is first-class information based on the feature keywords contained in each item of label information in the target information set, and finally determine the target information containing reverse feature keywords in the target information set as reverse target information of the information to be detected in response to the determination that the information to be detected is first-class information, and obtaining the reverse target information of the information to be detected.
Drawings
Other features, objects and advantages of the present application will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings in which:
FIG. 1 is an exemplary system architecture diagram in which the present application may be applied;
FIG. 2 is a flow diagram of one embodiment of a method for generating information according to the present application;
FIG. 3 is a schematic illustration of an application scenario of a method for generating information according to the present application;
FIG. 4 is a schematic block diagram illustrating one embodiment of an apparatus for generating information according to the present application;
FIG. 5 is a block diagram of a computer system suitable for use in implementing a server according to embodiments of the present application.
Detailed Description
The present application will be described in further detail with reference to the following drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the relevant invention and not restrictive of the invention. It should be noted that, for convenience of description, only the portions related to the related invention are shown in the drawings.
It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present application will be described in detail below with reference to the embodiments with reference to the attached drawings.
Fig. 1 shows an exemplary system architecture 100 to which embodiments of the method for generating information or the apparatus for generating information of the present application may be applied.
As shown in fig. 1, the system architecture 100 may include terminal devices 101, 102, 103, a network 104, and a server 105. The network 104 serves as a medium for providing communication links between the terminal devices 101, 102, 103 and the server 105. Network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, to name a few.
The user may use the terminal devices 101, 102, 103 to interact with the server 105 via the network 104 to receive or send messages or the like. The terminal devices 101, 102, 103 may have various client applications installed thereon, such as a web browser application, a shopping-like application, a search-like application, an instant messaging tool, a mailbox client, social platform software, and the like.
The terminal devices 101, 102, 103 may be various electronic devices having a display screen and supporting web browsing, including but not limited to smart phones, tablet computers, e-book readers, MP3 players (Moving Picture Experts Group Audio Layer III, mpeg compression standard Audio Layer 3), MP4 players (Moving Picture Experts Group Audio Layer IV, mpeg compression standard Audio Layer 4), laptop portable computers, desktop computers, and the like.
The server 105 may be a server that provides various services, such as a data analysis server that performs data analysis. The data analysis server can analyze and process the search behavior data of the network user, judge the authenticity of the information to be detected and generate the rumor splitting information aiming at the unreal information.
It should be noted that the method for generating information provided in the embodiment of the present application is generally performed by the server 105, and accordingly, the apparatus for generating information is generally disposed in the server 105.
It should be understood that the number of terminal devices, networks, and servers in fig. 1 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.
With continued reference to FIG. 2, a flow 200 of one embodiment of a method for generating information in accordance with the present application is shown. The method for generating information comprises the following steps:
step 201, a target information set of information to be detected is obtained.
In the present embodiment, an electronic device (e.g., the server 105 shown in fig. 1) on which the method for generating information operates may acquire, as target information, web page information related to information to be detected from the internet through a web crawler. Here, the information to be detected may be manually set information, for example, a word or a piece of text manually set. The information to be detected may also be information obtained by analyzing, by the electronic device, search behavior data of a large number of netizens within a set time period, for example, statistical analysis may be performed on search keywords used by a large number of network users within the set time period, and the search keywords with the search times ranked in front and web page information (for example, web page keywords, titles of contents displayed on web pages, summaries of contents displayed on web pages, and the like) of search results of the search keywords may be used as the information to be detected. Optionally, the information to be detected may include at least one of the following: the information for searching the netizen (for example, a search keyword, a search sentence, etc.), a website link of a search result corresponding to the information for searching the netizen, and webpage information corresponding to the website link of the search result corresponding to the information for searching the netizen, wherein the webpage information may include, but is not limited to, a webpage keyword, a title of content displayed on a webpage, and a content summary displayed on the webpage.
Step 202, obtaining the feature keywords of each item label information in the target information set.
In this embodiment, the electronic device may acquire the feature keyword of each item tag information in the target information set in various manners, for example, the electronic device may compare a phrase included in each item tag information in the target information set with a preset feature keyword set, so as to acquire the feature keyword of each item tag information. The feature keywords can be used for representing the support degree of the target information on the authenticity of the information to be detected, and the feature keywords can comprise positive feature keywords and negative feature keywords. The positive feature keyword may be words such as "correct", "reliable", "true", and the like, which support the authenticity of the information to be detected, and the negative feature keyword may be words such as "ballad", "false", "unreliable", "misread", and the like, which contradict the authenticity of the information to be detected.
Step 203, determining whether the information to be detected is the first type of information based on the feature keywords contained in each item label information in the target information set.
In this embodiment, the electronic device may determine whether the information to be detected is first-class information according to a feature keyword included in each item tag information in a target information set, where the first-class information may be rumor information, for example, the electronic device may perform statistical analysis on target information including a positive feature keyword and target information including a negative feature keyword in the target information set, and determine whether the information to be detected is rumor information according to a result of the statistical analysis, for example, when it is determined that a ratio of the target information including the negative feature keyword in the target information set exceeds a preset ratio threshold according to the statistical result, the information to be detected may be the rumor information.
In some optional implementation manners of this embodiment, step 203 may specifically include:
first, for each piece of target information in the set of target information, the electronic device may perform the following operations: matching the keywords contained in the entry label information with the feature keywords in a preset feature keyword set, and determining the feature keywords contained in the entry label information, wherein each feature keyword in the feature keyword set is preset with a score, and the feature keywords in the feature keyword set can be manually set according to actual needs. Here, scores may be set in advance for the respective feature keywords in the feature keyword set, for example, scores of the negative feature keywords may be set to be negative numbers, and different scores may be set for different negative feature keywords, for example, a more objectionable negative feature keyword such as "barter", "false", etc. may be set to be a lower score, and a less objectionable negative feature keyword such as "misread", "doubtful", etc. may be set to be a relatively higher score. For another example, the score of the positive feature keyword may be set to be a positive number, and different scores may be set for different positive feature keywords, for example, a positive feature keyword with a higher support degree, such as "correct" or "true", may have a higher score, and a positive feature keyword with a lower support degree, such as "may be", may have a relatively lower score;
the electronic device may then determine the support degree score of the to-be-detected information based on the support degree score factors, and may calculate (e.g., using a weighted calculation method, etc.) the score corresponding to each support degree score factor to obtain the support degree score of the to-be-detected information. Wherein the support degree score factor comprises at least one of: the score of the reverse feature keyword included in the target information set, the source reliability of each target information, and the number of pieces of target information including the reverse feature keyword may be set, and the source reliability may be a numerical value for each target information, and for example, the source reliability of the target information acquired from an official website (e.g., official website of civil department, official website of business department) of each government and each organization may be set to a high score, the source reliability of the target information acquired from each media website (e.g., website of newseine, japanese people, news of new and unrestrained, etc.) may be set to a medium score, and the source reliability of the target information acquired from a social network (e.g., website of newsband the like) may be set to a low score;
finally, the electronic device may determine whether the information to be detected is first type information according to the support degree score of the information to be detected, where the first type information may be rumor information.
In some optional implementation manners, the determining whether the information to be detected is the first type of information according to the support degree score of the information to be detected may specifically include: the electronic device may determine whether the score of the support degree of the to-be-detected information is lower than a preset threshold, and if so, determine that the to-be-detected information is the first type of information, where the preset threshold may be a threshold manually set according to actual needs.
And 204, in response to the fact that the information to be detected is determined to be the first type of information, determining the target information which contains the reverse side characteristic keywords in the target information set as reverse side target information of the information to be detected.
In this embodiment, in response to determining that the information to be detected is the first type of information, the electronic device may determine target information in the target information set, which includes a reverse feature keyword, as reverse target information of the information to be detected. The reverse target information can be regarded as the rumor splitting information of the information to be detected, and the electronic equipment can also extract related information such as titles, abstracts, website links and the like of the reverse target information.
In some optional implementations of this embodiment, the method may further include: the electronic equipment can collect the reverse target information of at least one piece of information to be detected to generate a reverse target information set.
In some optional implementations of this embodiment, the method may further include:
first, the electronic device may obtain at least one search result from information for search received from the terminal, where the information for search may be information for information search input by a user through the terminal, and may be, for example, a search word or a search sentence. Generally, after a user inputs information for searching into a search box, a server feeds back a search result page to a terminal according to the information for searching, wherein the search result page comprises a list of search results, and each search result generally comprises a title of a webpage of the search result, a link of the webpage of the search result, a short text abstract of the webpage matched with the information for searching, and the like.
Then, the electronic device may match the search result in the search information and the at least one search result with the reverse side target information in the reverse side target information set, respectively. Here, the electronic device may match one, several (for example, several arranged at a position before a search result list) or all of the information for search and the at least one search result with each piece of the reverse target information in the reverse target information set one by one, respectively. For example, the electronic device may compare the information for search with each piece of the reverse side object information in the reverse side object information set, and determine whether the reverse side object information matches the information for search according to a comparison result. For another example, the electronic device may compare the title of the search result in the at least one search result with the titles of the pieces of reverse side target information in the set of reverse side target information, and determine whether to match according to the comparison result.
And finally, responding to the matching between the information for searching and/or the search result in the at least one search result and the reverse target information in the reverse target information set, and pushing prompt information to the terminal.
In some optional implementation manners, the pushing prompt information to the terminal in response to the matching between the information for search and/or the search result in the at least one search result and the reverse side target information in the reverse side target information set may specifically include:
in response to the information for searching matching with the back target information in the set of back target information, the electronic device may push, to the terminal, first prompt information for displaying in a search box of the terminal, where the first prompt information may be information displayed in the search box of the terminal in the form of a floating layer, a button, or the like, and the floating layer may be a feedback information layer that disappears automatically after a period of time. The first prompt message may be used to prompt the user and guide the user to perform further information search, for example, when the server confirms that the information "a blood drop cancer" input by the user matches with the negative target information in the negative target information set, a message similar to "rumor prompt: there is a first prompt of < one drop of blood cancer for the ballad information ", and the user can click on the first prompt to obtain ballad information related to" one drop of blood cancer ".
In response to the search result in the at least one search result matching the reverse target information in the reverse target information set, the electronic device may push, to the terminal, second prompt information for displaying on the search result page of the terminal, where the second prompt information may be used to guide the user to replace the search result matching the reverse target information by performing a specific operation (e.g., a sliding operation, a mouse dragging operation, etc.), for example, when the server confirms that a certain search result in the at least one search result matches the reverse target information in the reverse target information set, the electronic device may push, to the terminal, information similar to that "the search result may be rumor information, and the user may slide and view the rumor information in the arrow direction", and cause the rumor information (or the title of the web page corresponding to the rumor information, to be displayed by performing the corresponding sliding operation, Summary, web address, etc.) replaces the search results.
In some optional implementations, the method may further include: the electronic equipment firstly receives a prompt information viewing request sent by the terminal, wherein the prompt information viewing request is generated by the terminal according to a viewing operation executed by a user aiming at the prompt information; then, the reverse side target information can be acquired from the reverse side target information set according to the prompt information viewing request and is sent to the terminal so as to be displayed by the terminal.
With continued reference to fig. 3, fig. 3 is a schematic diagram of an application scenario of the method for generating information according to the present embodiment. In the application scenario of fig. 3, the server first obtains a target information set of information to be detected; then, acquiring the characteristic key words of each item label information in the target information set; then, determining whether the information to be detected is rumor information based on the characteristic keywords contained in each item label information in the target information set; and finally, in response to the fact that the information to be detected is rumor information, determining target information which contains the reverse side characteristic keywords in the target information set as reverse side target information of the information to be detected. And summarizing the reverse target information of the plurality of pieces of information to be detected to generate a reverse target information set. When the user transmits the information "cancer one drop blood test" for search to the server via the terminal 301, the server matches the information for search and at least one search result obtained based on the information for search with the reverse target information in the reverse target information set. The server responds to the matching between the information for searching and the search result in the at least one search result and the reverse side target information in the reverse side target information set, and pushes prompt information to the terminal 301, and the terminal displays the prompt information in a search box and a search result page, which is shown in fig. 3.
The method provided by the above embodiment of the present application judges the authenticity of the information to be detected based on the feature keywords included in each item label information in the target information set, and obtains the reverse target information of the information to be detected.
With further reference to fig. 4, as an implementation of the method shown in the above figures, the present application provides an embodiment of an apparatus for generating information, which corresponds to the method embodiment shown in fig. 2, and which is particularly applicable to various electronic devices.
As shown in fig. 4, the apparatus 400 for generating information of the present embodiment includes: a first acquisition unit 401, a second acquisition unit 402, a first determination unit 403, and a second determination unit 404. The first obtaining unit 401 is configured to obtain a target information set of information to be detected, where target information in the target information set is web page information related to the information to be detected; the second obtaining unit 402 is configured to obtain a feature keyword of each item label information in the target information set, where the feature keyword is used to represent a support degree of target information on authenticity of the to-be-detected information, and the feature keyword includes a positive feature keyword and a negative feature keyword; the first determining unit 403 is configured to determine whether the information to be detected is the first type of information based on the feature keywords included in each item label information in the target information set; the second determining unit 404 is configured to determine, in response to determining that the information to be detected is the first type of information, target information in the target information set, which includes a reverse side feature keyword, as reverse side target information of the information to be detected.
In this embodiment, specific processes of the first obtaining unit 401, the second obtaining unit 402, the first determining unit 403, and the second determining unit 404 of the apparatus 400 for generating information and technical effects brought by the specific processes may refer to related descriptions of step 201, step 202, step 203, and step 204 in the corresponding embodiment of fig. 2, which are not described herein again.
In some optional implementations of this embodiment, the apparatus further includes: and a generating unit (not shown in the figure) for summarizing the reverse target information of at least one piece of information to be detected to generate a reverse target information set.
In some optional implementations of this embodiment, the apparatus further includes: a first receiving unit (not shown in the figure) for obtaining at least one search result based on the information for search received from the terminal; a first matching unit (not shown in the figure) for matching the search result in the search information and the at least one search result with the reverse target information in the reverse target information set; and a pushing unit (not shown in the figure) for pushing prompt information to the terminal in response to the information for searching and/or a search result in the at least one search result matching with the reverse target information in the reverse target information set.
In some optional implementations of this embodiment, the apparatus further includes: a second receiving unit (not shown in the figure), configured to receive a prompt information viewing request sent by the terminal, where the prompt information viewing request is generated by the terminal according to a viewing operation performed by a user on the prompt information; a sending unit (not shown in the figure) for obtaining the reverse side target information from the reverse side target information set according to the prompt information viewing request and sending the reverse side target information to the terminal for displaying by the terminal.
In some optional implementations of this embodiment, the pushing unit includes: a first prompt information pushing unit (not shown in the figure) for pushing first prompt information for displaying in a search box of the terminal to the terminal in response to the information for searching matching with the reverse target information in the reverse target information set; and a second prompt information pushing unit (not shown in the figure) for pushing, to the terminal, second prompt information for displaying on the search result page of the terminal in response to a match between a search result of the at least one search result and the reverse-side target information in the reverse-side target information set.
In some optional implementations of this embodiment, the first determining unit 403 includes: a second matching unit (not shown in the figure), configured to perform the following operations for each piece of target information in the set of target information: matching keywords contained in the entry label information with feature keywords in a preset feature keyword set to determine the feature keywords contained in the entry label information, wherein each feature keyword in the feature keyword set is preset with a score; a third determining unit (not shown in the figure) for determining the support degree score of the information to be detected based on a support degree score factor, wherein the support degree score factor includes at least one of the following: the score of the reverse side characteristic key words contained in the target information set, the source credibility of each target information and the number of pieces of target information containing the reverse side characteristic key words; a fourth determining unit (not shown in the figure) for determining whether the information to be detected is the first type information according to the support degree score of the information to be detected.
In some optional implementations of this embodiment, the fourth determining unit is further configured to: and judging whether the support degree score of the information to be detected is lower than a preset threshold value or not, and if so, determining that the information to be detected is the first type of information.
Referring now to FIG. 5, a block diagram of a computer system 500 suitable for use in implementing a server according to embodiments of the present application is shown. The server shown in fig. 5 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present application.
As shown in fig. 5, the computer system 500 includes a Central Processing Unit (CPU)501 that can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM) 502 or a program loaded from a storage section 506 into a Random Access Memory (RAM) 503. In the RAM 503, various programs and data necessary for the operation of the system 500 are also stored. The CPU 501, ROM 502, and RAM 503 are connected to each other via a bus 504. An Input/Output (I/O) interface 505 is also connected to bus 504.
The following components are connected to the I/O interface 505: a storage section 506 including a hard disk and the like; and a communication section 507 including a Network interface card such as a LAN (Local Area Network) card, a modem, or the like. The communication section 507 performs communication processing via a network such as the internet. The driver 508 is also connected to the I/O interface 505 as necessary. A removable medium 509 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 508 as necessary, so that a computer program read out therefrom is mounted into the storage section 506 as necessary.
In particular, according to an embodiment of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method illustrated in the flow chart. In such an embodiment, the computer program may be downloaded and installed from a network through the communication section 507 and/or installed from the removable medium 509. The computer program performs the above-described functions defined in the method of the present application when executed by the Central Processing Unit (CPU) 501. It should be noted that the computer readable medium described herein can be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present application, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In this application, however, a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present application. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The units described in the embodiments of the present application may be implemented by software or hardware. The described units may also be provided in a processor, and may be described as: a processor includes a first acquisition unit, a second acquisition unit, a first determination unit, and a second determination unit. Where the names of these units do not in some cases constitute a limitation on the unit itself, for example, the first acquisition unit may also be described as a "unit that acquires a target information set of information to be detected".
As another aspect, the present application also provides a computer-readable medium, which may be contained in the apparatus described in the above embodiments; or may be present separately and not assembled into the device. The computer readable medium carries one or more programs which, when executed by the apparatus, cause the apparatus to: acquiring a target information set of information to be detected, wherein the target information in the target information set is webpage information related to the information to be detected; acquiring feature keywords of each item label information in the target information set, wherein the feature keywords are used for representing the support degree of the target information on the authenticity of the information to be detected, and comprise positive feature keywords and negative feature keywords; determining whether the information to be detected is first-class information or not based on the characteristic keywords contained in each item label information in the target information set; and in response to determining that the information to be detected is the first type of information, determining target information which contains the reverse side characteristic keywords in the target information set as reverse side target information of the information to be detected.
The above description is only a preferred embodiment of the application and is illustrative of the principles of the technology employed. It will be appreciated by those skilled in the art that the scope of the invention herein disclosed is not limited to the particular combination of features described above, but also encompasses other arrangements formed by any combination of the above features or their equivalents without departing from the spirit of the invention. For example, the above features may be replaced with (but not limited to) features having similar functions disclosed in the present application.

Claims (14)

1. A method for generating information, the method comprising:
acquiring a target information set of information to be detected, wherein the target information in the target information set is webpage information related to the information to be detected;
acquiring feature keywords of each item of label information in the target information set, wherein the feature keywords are used for representing the support degree of the target information on the authenticity of the information to be detected, and comprise positive feature keywords and negative feature keywords;
determining whether the information to be detected is first-class information or not based on feature keywords contained in each item label information in the target information set, wherein the first-class information is rumor information;
in response to determining that the information to be detected is rumor information, determining target information which contains reverse side feature keywords in the target information set as reverse side target information of the information to be detected; wherein, the reverse target information is the rumor splitting information of the information to be detected;
summarizing the reverse target information of at least one piece of information to be detected to generate a reverse target information set;
matching the information for searching received from the terminal with the reverse target information in the reverse target information set;
and responding to the matching of the information for searching and the reverse target information in the reverse target information set, and pushing prompt information to the terminal.
2. The method of claim 1, further comprising:
obtaining at least one search result according to the information for searching;
matching the information for searching and the search result in the at least one search result with the reverse target information in the reverse target information set respectively;
and responding to the information for searching and the search result in the at least one search result, or the search result in the at least one search result is matched with the reverse target information in the reverse target information set, and pushing prompt information to the terminal.
3. The method of claim 2, further comprising:
receiving a prompt message viewing request sent by the terminal, wherein the prompt message viewing request is generated by the terminal according to a viewing operation executed by a user aiming at the prompt message;
and acquiring reverse target information from the reverse target information set according to the prompt information viewing request, and sending the reverse target information to the terminal for display by the terminal.
4. The method according to claim 2, wherein the pushing of the prompt information to the terminal in response to the information for searching and/or the search result of the at least one search result matching the reverse target information of the set of reverse target information comprises:
responding to the matching of the information for searching and the reverse target information in the reverse target information set, and pushing first prompt information for displaying in a search box of the terminal to the terminal;
and responding to the matching of the search result in the at least one search result and the reverse side target information in the reverse side target information set, and pushing second prompt information for displaying on a search result page of the terminal to the terminal.
5. The method according to claim 1, wherein the determining whether the information to be detected is the first type of information based on the feature keyword included in each item label information in the target information set includes:
for each piece of target information in the set of target information, performing the following operations: matching keywords contained in the entry label information with feature keywords in a preset feature keyword set to determine the feature keywords contained in the entry label information, wherein each feature keyword in the feature keyword set is preset with a score;
determining a support degree score of the to-be-detected information based on a support degree score factor, wherein the support degree score factor comprises at least one of: the target information in the target information set comprises scores of the reverse side characteristic keywords, source credibility of each target information and the number of pieces of target information comprising the reverse side characteristic keywords;
and determining whether the information to be detected is the first type of information according to the support degree score of the information to be detected.
6. The method according to claim 5, wherein the determining whether the information to be detected is the first type information according to the support degree score of the information to be detected comprises:
and judging whether the support degree score of the information to be detected is lower than a preset threshold value or not, and if so, determining that the information to be detected is the first type of information.
7. An apparatus for generating information, the apparatus comprising:
the system comprises a first acquisition unit, a second acquisition unit and a third acquisition unit, wherein the first acquisition unit is used for acquiring a target information set of information to be detected, and the target information in the target information set is webpage information related to the information to be detected;
the second acquisition unit is used for acquiring the feature keywords of each item of label information in the target information set, wherein the feature keywords are used for representing the support degree of target information on the authenticity of the information to be detected, and comprise positive feature keywords and negative feature keywords;
a first determining unit, configured to determine whether the information to be detected is first-class information based on a feature keyword included in each item label information in the target information set, where the first-class information is rumor information;
a second determining unit, configured to determine, in response to determining that the information to be detected is rumor information, target information that includes a reverse side feature keyword in the target information set as reverse side target information of the information to be detected; wherein, the reverse target information is the rumor splitting information of the information to be detected;
the generating unit is used for summarizing the reverse target information of at least one piece of information to be detected and generating a reverse target information set;
a first matching unit, configured to match information for search received from a terminal with reverse side target information in the reverse side target information set;
and the pushing unit is used for responding to the matching of the information for searching and the reverse target information in the reverse target information set and pushing prompt information to the terminal.
8. The apparatus of claim 7, further comprising:
the first receiving unit is used for obtaining at least one search result according to the information for searching;
the first matching unit is further configured to match the information for search and a search result in the at least one search result with the reverse-side target information in the reverse-side target information set, respectively;
the pushing unit is further configured to push prompt information to the terminal in response to the information for search and a search result in the at least one search result, or a search result in the at least one search result matching with reverse target information in the reverse target information set.
9. The apparatus of claim 8, further comprising:
a second receiving unit, configured to receive a prompt information viewing request sent by the terminal, where the prompt information viewing request is generated by the terminal according to a viewing operation performed by a user for the prompt information;
and the sending unit is used for acquiring the reverse target information from the reverse target information set according to the prompt information viewing request and sending the reverse target information to the terminal for displaying by the terminal.
10. The apparatus of claim 8, wherein the pushing unit comprises:
the first prompt information pushing unit is used for responding to the matching of the information for searching and the reverse target information in the reverse target information set, and pushing first prompt information used for displaying in a search box of the terminal to the terminal;
and the second prompt information pushing unit is used for responding to the matching of the search result in the at least one search result and the reverse side target information in the reverse side target information set, and pushing second prompt information which is used for being displayed on a search result page of the terminal to the terminal.
11. The apparatus according to claim 7, wherein the first determining unit comprises:
a second matching unit, configured to perform the following operations for each piece of target information in the set of target information: matching keywords contained in the entry label information with feature keywords in a preset feature keyword set to determine the feature keywords contained in the entry label information, wherein each feature keyword in the feature keyword set is preset with a score;
a third determining unit, configured to determine a support degree score of the to-be-detected information based on a support degree score factor, where the support degree score factor includes at least one of: the target information in the target information set comprises scores of the reverse side characteristic keywords, source credibility of each target information and the number of pieces of target information comprising the reverse side characteristic keywords;
and the fourth determining unit is used for determining whether the information to be detected is the first type of information according to the support degree score of the information to be detected.
12. The apparatus of claim 11, wherein the fourth determining unit is further configured to:
and judging whether the support degree score of the information to be detected is lower than a preset threshold value or not, and if so, determining that the information to be detected is the first type of information.
13. A server, comprising:
one or more processors;
a storage device for storing one or more programs,
the one or more programs, when executed by the one or more processors, cause the one or more processors to implement the method recited in any of claims 1-6.
14. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the method according to any one of claims 1-6.
CN201710867243.9A 2017-09-22 2017-09-22 Method and apparatus for generating information Active CN107644084B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710867243.9A CN107644084B (en) 2017-09-22 2017-09-22 Method and apparatus for generating information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710867243.9A CN107644084B (en) 2017-09-22 2017-09-22 Method and apparatus for generating information

Publications (2)

Publication Number Publication Date
CN107644084A CN107644084A (en) 2018-01-30
CN107644084B true CN107644084B (en) 2021-05-04

Family

ID=61112104

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710867243.9A Active CN107644084B (en) 2017-09-22 2017-09-22 Method and apparatus for generating information

Country Status (1)

Country Link
CN (1) CN107644084B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130132851A1 (en) * 2011-11-22 2013-05-23 International Business Machines Corporation Sentiment estimation of web browsing user
CN103235818A (en) * 2013-04-27 2013-08-07 北京百度网讯科技有限公司 Information push method and device based on webpage emotion tendentiousness
CN104679739A (en) * 2013-11-27 2015-06-03 江苏华御信息技术有限公司 Method for controlling spreading of unreal information
CN106599286A (en) * 2016-12-23 2017-04-26 北京奇虎科技有限公司 Information monitoring rumor refuting realization method and apparatus, and mobile terminal

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130132851A1 (en) * 2011-11-22 2013-05-23 International Business Machines Corporation Sentiment estimation of web browsing user
CN103235818A (en) * 2013-04-27 2013-08-07 北京百度网讯科技有限公司 Information push method and device based on webpage emotion tendentiousness
CN104679739A (en) * 2013-11-27 2015-06-03 江苏华御信息技术有限公司 Method for controlling spreading of unreal information
CN106599286A (en) * 2016-12-23 2017-04-26 北京奇虎科技有限公司 Information monitoring rumor refuting realization method and apparatus, and mobile terminal

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
面向公共危机预警的网络舆情分析研究;董坚峰;《中国博士学位论文全文数据库(信息科技辑)》;20150515;全文 *

Also Published As

Publication number Publication date
CN107644084A (en) 2018-01-30

Similar Documents

Publication Publication Date Title
CN107256267B (en) Query method and device
CN109145280B (en) Information pushing method and device
CN107679211B (en) Method and device for pushing information
CN107577807B (en) Method and device for pushing information
CN107241260B (en) News pushing method and device based on artificial intelligence
CN109543058B (en) Method, electronic device, and computer-readable medium for detecting image
US9069868B2 (en) Computer device for reading e-book and server for being connected with the same
CN106960030B (en) Information pushing method and device based on artificial intelligence
US9720904B2 (en) Generating training data for disambiguation
US20190171724A1 (en) Method and apparatus for determining hot event
CN110069698B (en) Information pushing method and device
CN106919711B (en) Method and device for labeling information based on artificial intelligence
CN107731229B (en) Method and apparatus for recognizing speech
CN107526718B (en) Method and device for generating text
CN106681598B (en) Information input method and device
CN109446442B (en) Method and apparatus for processing information
CN107944032B (en) Method and apparatus for generating information
CN107908662B (en) Method and device for realizing search system
CN106886594B (en) Method and device for displaying information
CN110737824B (en) Content query method and device
CN110019906B (en) Method and apparatus for displaying information
CN111897950A (en) Method and apparatus for generating information
CN112214770B (en) Malicious sample identification method, device, computing equipment and medium
KR102151322B1 (en) Information push method and device
CN105955988B (en) Information searching method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant