CN110837590A - Information pushing method and device, computer equipment and storage medium - Google Patents

Information pushing method and device, computer equipment and storage medium Download PDF

Info

Publication number
CN110837590A
CN110837590A CN201910988193.9A CN201910988193A CN110837590A CN 110837590 A CN110837590 A CN 110837590A CN 201910988193 A CN201910988193 A CN 201910988193A CN 110837590 A CN110837590 A CN 110837590A
Authority
CN
China
Prior art keywords
information
industry
paragraph
preset
industry information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910988193.9A
Other languages
Chinese (zh)
Other versions
CN110837590B (en
Inventor
钱文杰
娄颖颖
王佳丽
俞冰
林方舟
邱子轩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Dasou Vehicle Software Technology Co Ltd
Original Assignee
Zhejiang Dasou Vehicle Software Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Dasou Vehicle Software Technology Co Ltd filed Critical Zhejiang Dasou Vehicle Software Technology Co Ltd
Priority to CN201910988193.9A priority Critical patent/CN110837590B/en
Publication of CN110837590A publication Critical patent/CN110837590A/en
Application granted granted Critical
Publication of CN110837590B publication Critical patent/CN110837590B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/906Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The method comprises the steps that a server acquires industry information, eliminates the industry information according to preset elimination rules, screens out important information from the eliminated industry information, classifies the important information according to preset classification rules, and pushes the classified industry information to a viewing system. In the application, the server screens the industry information after acquiring the industry information, so that the major information is timely pushed to the viewing system, the timeliness of pushing the major information is improved, manual intervention is not required in the whole information pushing process, and the labor cost is reduced.

Description

Information pushing method and device, computer equipment and storage medium
Technical Field
The present application relates to the field of internet technologies, and in particular, to an information pushing method and apparatus, a computer device, and a storage medium.
Background
With the rapid development of computer technology and network technology, information explosion has become a feature of this age. It becomes relatively simple to obtain information such as industry dynamics, policy changes, etc. from the network, and it is very important for enterprise development to obtain the information of these real-time changes.
The conventional general information acquisition mode is a web crawler technology, the web crawler technology is an information crawling technology based on keywords, and according to a set target website and the keywords, a web crawler tool is used for automatically crawling related information on the target website. However, the crawler tool is too mechanized, crawls completely based on keywords, and often crawls a lot of irrelevant information, so that business personnel are required to perform further screening after the crawler acquires the information.
However, for important industry information concerned by enterprises, the timeliness of crawling the network information by using a crawler tool is low, and the labor cost is high.
Disclosure of Invention
In view of the foregoing, it is desirable to provide an information pushing method, apparatus, computer device and storage medium.
In a first aspect, an information pushing method is provided, where the method includes:
acquiring industry information; the industry information comprises a website data source, keywords, article titles, article full texts, publishers, webpage links, publication time and crawling time;
removing the industry information according to a preset removing rule to obtain the removed industry information;
determining major information in the removed industry information;
classifying the major information in the eliminated industry information according to a preset classification rule to obtain classified industry information;
and pushing the classified industry information to a viewing system.
In the embodiment, the server acquires the latest industry information in the original database, removes the acquired latest industry information, determines the important information in the removed industry information, classifies the important information, and timely pushes the classified important information to the viewing system, so that the timeliness of pushing the important information is improved, the whole information pushing process is free from manual intervention, and the labor cost is reduced.
In one embodiment, the culling rule includes at least one of the following rules:
eliminating industry information including preset stop words in article titles;
eliminating industry information which does not contain preset industry related keywords in article titles;
and eliminating repeated industry information which is pushed within a preset time period.
In this embodiment, the server performs elimination and screening on the acquired new industry information, eliminates information that a current industry information article title contains too many stop words, eliminates information that the current industry information article title does not contain an industry keyword, eliminates repeated industry information that has been pushed within a preset time period, and retains industry information with high industry relevance after the elimination processing, thereby facilitating determination of important information in the next step.
In one embodiment, the determining the important information in the removed industry information includes:
extracting industry key words in article titles of the removed industry information;
comparing the industry keywords with industry major event keywords in a preset keyword dictionary;
and if the industry key words are consistent with the industry major event key words in the key word dictionary, determining the removed industry information as major information.
In this embodiment, the server determines the important information of the removed industry information, further screens the industry information, extracts the industry keywords in the article title of the current industry information, compares the industry keywords with the important event keywords, and determines whether the current industry information is the important information, so that the determination method is simple, and the industry information with relatively low industry relevance is further filtered.
In one embodiment, the preset classification rule includes: the corresponding relation between the industry key words of the chapter title in the industry information and the information categories, and the priority order of each information category.
In this embodiment, the server presets the corresponding relationship between the business keyword and the information category and the priority order of each information category. The industry information is conveniently classified and sorted later and then pushed according to the priority order.
In one embodiment, the classifying the major information in the removed industry information according to a preset classification rule to obtain the classified industry information includes:
determining the information category of each important information according to the corresponding relation and the keywords of the article title of each important information to obtain the classified industry information;
and sorting the classified industry information according to the priority order of the information categories.
In this embodiment, the server classifies the information types of the industry information, sorts and pushes the industry information according to the priority of the information types, so that the pertinence of the industry information is more clear, and pushes the industry information according to the priority order, so that the industry information with high attention cannot be submerged in other information, and the purpose of pushing the important information is achieved.
In one embodiment, the pushing the classified industry information to the viewing system includes:
acquiring abstract information of the classified industry information;
and pushing the summary information to a viewing system.
In this embodiment, the server extracts summary information of each classified industry information, and pushes the summary information to the viewing system, so that the full text of the article without pushing the industry information is achieved, the purpose of pushing resources is saved, and meanwhile, the problem that the expression of the industry information is unclear due to the article title of the industry information being pushed is solved.
In one embodiment, the obtaining summary information of the classified industry information includes:
acquiring the kth paragraph information of the classified industry information;
judging whether the character length of the kth paragraph information is larger than or equal to a preset first threshold value or not;
if the character length of the kth paragraph information is smaller than a first threshold value, acquiring the (k + 1) th paragraph information, and judging whether the character length of the (k + 1) th paragraph information is larger than or equal to the first threshold value;
if the character length of the kth paragraph information is larger than or equal to the first threshold, judging whether the character length of the kth paragraph information is smaller than or equal to a preset second threshold;
if the character length of the kth paragraph information is less than or equal to a second threshold value, taking the kth paragraph information as summary information;
if the character length of the kth paragraph information is larger than a second threshold value, splitting sentences in the kth paragraph information, combining the sentences obtained by splitting according to a preset rule, and determining the combined paragraph information as abstract information; the length of the combined paragraph information character is larger than or equal to a first threshold value and smaller than or equal to a second threshold value.
In this embodiment, the server performs character processing on the character length of the kth natural paragraph information of the acquired current industry information, so that the character length of the final paragraph information is greater than or equal to a first threshold value and less than or equal to a second threshold value, and then pushes the paragraph information as summary information. The character length of the abstract information is standardized, the occupation of resources is reduced to a certain extent, and meanwhile, information redundancy can be avoided.
In a second aspect, an information pushing apparatus is provided, the apparatus comprising:
the acquisition module is used for acquiring industry information; the industry information comprises a website data source, keywords, article titles, article full texts, publishers, webpage links, publication time and crawling time;
the removing module is used for removing the industry information according to a preset removing rule to obtain the removed industry information;
the determining module is used for determining important information in the eliminated industry information;
the classification module is used for classifying the major information in the eliminated industry information according to a preset classification rule to obtain the classified industry information;
and the pushing module is used for pushing the classified industry information to the checking system.
In a third aspect, a computer device is provided, which includes a memory and a processor, where the memory stores a computer program, and the processor implements the information pushing method provided in any one of the embodiments of the first aspect when executing the computer program.
In a fourth aspect, a computer-readable storage medium is provided, on which a computer program is stored, where the computer program, when executed by a processor, implements the information pushing method provided in any one of the embodiments of the first aspect.
The beneficial effects brought by the technical scheme provided by the embodiment of the application at least comprise:
the application relates to an information pushing method, an information pushing device, computer equipment and a storage medium. The server obtains the industry information, removes the industry information according to a preset removing rule to obtain the removed industry information, determines the important information in the removed industry information, classifies the important information in the removed industry information according to a preset classification rule to obtain the classified industry information, and finally pushes the classified industry information to a viewing system. In the application, because the server is after acquireing new trade information, get rid of, confirm, categorised the major information who obtains preferred propelling movement to this trade information, make at trade information propelling movement in-process, this major information can be preferentially pushed to the viewing system in, the purpose of the timely preferred propelling movement of major information has been realized, and whole information propelling movement process does not need artifical screening, the promptness of relying on the crawler instrument to crawl network information has been improved, the cost of labor has been reduced simultaneously.
Drawings
FIG. 1 is a diagram of an application environment of an information pushing method in an embodiment;
FIG. 2 is a flowchart illustrating an information pushing method according to an embodiment;
FIG. 3 is a flowchart illustrating an information pushing method according to another embodiment;
FIG. 4 is a flowchart illustrating an information pushing method according to another embodiment;
FIG. 5 is a flowchart illustrating an information pushing method according to another embodiment;
FIG. 6 is a flowchart illustrating an information pushing method according to another embodiment;
FIG. 7 is a block diagram of an information pushing device in an embodiment;
FIG. 8 is a block diagram of an information pushing device in another embodiment;
FIG. 9 is a block diagram of an information pushing device in another embodiment;
FIG. 10 is a block diagram of an information pushing device in another embodiment;
FIG. 11 is a block diagram of an information pushing device in another embodiment;
FIG. 12 is a diagram illustrating an internal structure of a computer device according to an embodiment.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
The information pushing method provided by the application can be applied to the application environment shown in fig. 1. Wherein the viewing system terminal 101 and the server 102 communicate via a network. The server 102 acquires new industry information, processes the industry information, and then pushes the finally obtained important information to the viewing system terminal 101. The viewing system terminal 101 may be, but not limited to, various personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices, and the server 102 may be implemented by an independent server or a server cluster formed by a plurality of servers.
The following describes in detail the technical solutions of the present application and how the technical solutions of the present application solve the above technical problems by embodiments and with reference to the drawings. The following several specific embodiments may be combined with each other, and details of the same or similar concepts or processes may not be repeated in some embodiments. It should be noted that, in the information pushing method provided in the embodiments of fig. 2 to fig. 7 of the present application, the execution subject may be a server, or may be an information pushing apparatus, and the information pushing apparatus may become part or all of the server through software, hardware, or a combination of software and hardware. In the following method embodiments, the following method embodiments are all described by taking the example where the execution subject is a server.
In an embodiment, as shown in fig. 2, an information pushing method is provided, which is described by taking an example that the method is applied to a server in an application environment of fig. 1, and the embodiment relates to a specific process that after the server acquires new industry information, the server processes the industry information to obtain important information that is preferentially pushed, so that the important information can be preferentially pushed to a viewing system in an industry information pushing process, and the method includes the following steps:
s201, acquiring industry information; the industry information comprises a website data source, keywords, article titles, article full texts, publishers, webpage links, publication time and crawling time.
The server can acquire the industry information from the original database, and when the server monitors that the original database has new industry information, the server acquires the latest industry information from the original database. The acquired industry information comprises a current website data source, a current keyword and crawling time, an article title and an article full text of the currently acquired industry information, a publisher and publishing time of the currently acquired industry information, and a webpage link of the currently acquired industry information.
In this embodiment, before the server acquires the industry information, the industry information can be crawled through a crawler tool according to a pre-specified keyword and a website database, wherein periodic crawling can be performed according to a preset period, real-time crawling can also be performed, and the crawled industry information is stored in an original database. When the latest industry information is monitored to be added into the original database, the server acquires the latest industry information from the original database for the subsequent related processing. For example, firstly, the server may pre-designate the keyword as "used car" or the like, the designated website database may be various large information websites such as the chinese daily news network, the Tencent network, the Xinlang network, the east wealth network, etc., the crawler tool pre-designates the keyword as the industry information of "used car" in the east wealth network, and adds the crawled industry information into the original database, when the server monitors that the latest industry information related to the 'used car' is added in the original database, the latest industry information is obtained, the industry information can include oriental wealth network, "used cars", 27 minutes and 38 seconds at 16 days 9 and 3 in 2019, "used cars market coming younger trend-great car hunters choosing and popularizing new generation consumption concept", article full text, financial investment network, 26 minutes at 16 days 9 and 3 in 2019 and http: html, eastmoney, com/a/201909031226571784. This is not a limitation in this embodiment.
S202, eliminating the industry information according to a preset eliminating rule to obtain the eliminated industry information.
The preset removing rule is used for further screening the latest industry information acquired by the server from the original database, for example, the removing rule may be to remove industry information of stop words included in the industry information, and/or remove industry information that does not include keywords in the titles of industry information articles, and/or remove some repeatedly pushed industry information, and/or remove some incomplete industry information, and the like.
In this embodiment, the server removes part of the acquired industry information according to a preset removal rule, which is equivalent to performing a preliminary screening on the current industry information, and performs the next processing on the obtained removed industry information. For example, the server may remove information that includes stop words in the title of the industry information article, where the stop words may include "in", "inside", "also", "of", "it", "be", and the like; and/or the server may also reject information that does not include keywords in the title of the industry information article, where the keywords may include "used cars", "vehicle finance", "cars", "vehicles", and the like, and/or the server may also reject repeated industry information that has been pushed within three days, and/or the server may also reject incomplete industry information that is obtained due to other reasons, which is not limited in this embodiment.
S203, determining important information in the eliminated industry information.
The important information refers to industry information with a preset high priority level for industries, and corresponding important event keywords are set in the important information and stored in a keyword dictionary. There are various ways to determine whether the removed industry information is the important information, for example, whether the industry information is the important information may be determined by determining whether the chapter title industry keywords in the removed industry information include the important event keywords, whether the industry information is the important information may also be determined by determining the number of times that the chapter global industry keywords in the removed industry information include the important event keywords, and whether the industry information is the important information may also be determined by determining the priority level of the removed industry information, which is not limited in this embodiment.
In this embodiment, the corresponding major event keywords set by the major information are stored in a keyword dictionary, when the server determines the major information of the removed industry information, the server obtains the major event keywords from the keyword dictionary, and simultaneously obtains the article title industry keywords of the removed industry information for comparison, if the article title industry keywords of the removed industry information include any major event keyword, the removed industry information is determined to be the major information. For example, the important event keyword is preset as "used car", the eliminated industry information article is marked as "used car market coming youngness trend-big car searcher selecting new generation consumption concept", the industry keyword of the article title is "used car", the industry keyword of the article title includes the important event keyword, so that the industry information is determined as the important information, the industry information with priority level of more than 3 is preset as the important information, the article is marked as "used car market coming youngness trend-big car searcher selecting new generation consumption concept", and the industry information is determined as the important information according to the preset priority classification of 1 level. This is not a limitation in this embodiment.
S204, classifying the major information in the eliminated industry information according to a preset classification rule to obtain the classified industry information.
The preset classification rule is used for classifying the industry information according to preset information types, different information types correspond to different priority orders, and major information in the eliminated industry information is classified to obtain the classified industry information. Optionally, the server may push the classified industry information according to a priority order and/or push the classified industry information according to an information category.
In this embodiment, the classification is performed according to a preset classification rule, the categories may include policy dynamics, market dynamics, personnel dynamics, enterprise dynamics, and other categories, and optionally, different categories correspond to different priority orders. For example, the priority order of the policy dynamic class may be set to a level 1 order, and the priority order of the enterprise dynamic class may be set to a level 4 order, which is not limited in this embodiment. In the classification process, the industry keywords in the article title of the current industry information can be obtained and compared with the category keywords, if the industry keywords of the current industry information comprise at least one of the category keywords, the current industry information is classified into the information type corresponding to the category keywords, optionally, one industry information can correspond to a plurality of information types, and the highest priority order is taken as the sorting basis when the priority sorting is carried out. If the industry key words of the current industry information do not contain any category key words, the industry information is classified into other categories. Illustratively, if the industry keywords in the article title of the current industry information include keywords of a policy dynamic class, the industry information is classified as the policy dynamic class, and the priority of the policy dynamic class is 1 according to a preset priority order rule, the industry information type is pushed according to the current highest priority order, that is, the industry information is pushed as the industry information with the priority of 1. This is not a limitation in this embodiment.
S205, the classified industry information is pushed to a viewing system.
The classified industry information is pushed to a checking system, the checking system refers to an enterprise internal office system which is communicated with the server through a network, and the enterprise internal office system can be concentrated on one terminal device or formed by a plurality of terminal devices. The server pushes the processed industry information to a viewing system, wherein the pushing mode can be that pushing is carried out according to the priority order of the industry information, pushing can also be carried out according to the time order, pushing can also be carried out according to the classified information category, and after pushing, any user in the system can timely obtain the industry information through the viewing system.
In this embodiment, optionally, the server may sort and push the classified industry information to the viewing system according to a category priority order, where the category priority order may be set as policy dynamic 1-priority, market dynamic 2-priority, personnel dynamic 3-priority, enterprise dynamic 4-priority, and others do not have priorities. Illustratively, if the classified industry information belongs to the policy dynamic category, the priority of the industry information is level 1, which is equivalent to that the pushing sequence of the industry information is 1, the server sets the industry information to be level 1 to push the industry information to the viewing system, or the classified industry information belongs to the policy dynamic category, the server only pushes the industry information of which the information category is the policy dynamic category within a period of time, or the server pushes the industry information according to the time sequence. This is not a limitation in this embodiment.
In the information pushing method, the server acquires the latest industry information from the original database, then removes part of the industry information according to a preset removing rule to obtain the removed industry information, then determines the important information in the removed industry information, classifies the important information in the removed industry information according to a preset classification rule to obtain the classified industry information, and finally pushes the classified industry information to the checking system. According to the method, the server removes the obtained industry information, the major information is screened from the removed industry information, the major information is classified according to information types, and finally the classified major industry information is timely pushed to the viewing system, so that the timeliness of pushing the major industry information is improved, manual intervention is not needed in the whole information pushing process, and the labor cost is reduced.
In one embodiment, the culling rule includes at least one of the following rules: the industry information which contains preset stop words in the article title is removed, the industry information which does not contain preset industry related keywords in the article title is removed, and repeated industry information which is pushed within a preset time period is removed.
Specifically, the industry information including a preset stop word is removed from the article title, where the stop word refers to a word or a word with too high use frequency, and the word or the word is often an article, a preposition, an adverb, a conjunctive word, or the like, for example, "in", "inside", "also", "it", "being", and "being" are all stop words, and the stop word exists on almost every web page due to too high use frequency.
Specifically, industry information of an article title which does not contain preset industry-related keywords is removed, wherein the industry-related keywords are preset keywords related to industries, the keywords are stored in a keyword dictionary, when a server obtains new industry information, the article title of the industry information is scanned and detected by the industry keywords, and if the article title does not contain the industry keywords, the server selects to remove the industry information corresponding to the article title.
Specifically, the repeated industry information which is pushed within a preset time period can be removed, the repeated industry information which is pushed by a server within a preset time period can be removed, wherein the repeated industry information judgment mode can be that the server performs word segmentation comparison on an acquired new industry information Chinese chapter title and a pushed industry information Chinese chapter title in an original database according to a preset word segmentation rule, word segmentation refers to the division of an article title on word segments, the word segments in the current industry information article title are compared with the word segments in the pushed industry information Chinese chapter title, if the repetition rate of the word segments exceeds a preset repetition threshold value, the current industry information is regarded as the pushed repeated information, and the server removes the industry information.
In this embodiment, after the server obtains the latest industry information from the original database, the server may perform scanning detection on stop words for the article title of the industry information, and if the article title contains too many stop words, the article title is meaningless, and the server rejects the corresponding industry information. For example, if the article title of the industry information obtained by the server from the original database is "what is the current value of the used vehicle", the scanning detects that the stop words in the article title are "current", "existing", "yes" and "what", because the article title contains too many stop words, the server selects to remove the industry information corresponding to the article title. The method can also be used for carrying out industry keyword scanning detection on the article title of the industry information according to a pre-established keyword dictionary containing industry keywords, and eliminating the industry information which does not contain preset industry related keywords in the article title. For example, the industry keywords included in the keyword dictionary may be "used car", "used car finance", "vehicle rental", "car rental", and the like, and when the article title of the new industry information acquired by the server is "driving in tens of millions of cars during fatigue driving", and the article title does not include any industry keyword, the server rejects the industry information corresponding to the article title. Or, the pushed industry information within a preset time period may be removed, for example, the preset time period may be 3 days before the current industry information acquisition time, that is, the server removes the repeated industry information that has been pushed within 3 days before the current industry information acquisition time. When judging whether the current industry information is the pushed industry information, firstly, performing word segmentation comparison on the article title of the current industry information and the article title of the pushed industry information to obtain word segments in the article titles of the current industry information and the pushed industry information, and performing repetition rate calculation, specifically, the repetition rate can be obtained by the ratio of the number of repeated word segments to the total number of word segments, setting a repetition threshold value to be 0.6, if the repetition rate exceeds the repetition threshold value to be 0.6, judging that the current industry information is the pushed industry information, and rejecting the current industry information by a server. This is not a limitation in this embodiment.
In this embodiment, the server performs elimination and screening on the acquired new industry information, specifically, eliminates information that a current industry information article title contains too many stop words, eliminates information that the current industry information article title does not contain an industry keyword, eliminates repeated industry information that has been pushed within a preset time period, and retains industry information with high industry relevance after elimination, thereby facilitating determination of important information in the next step.
In one embodiment, as shown in fig. 3, the step 203 "determining important information in the removed industry information" in the above embodiment includes:
s301, extracting the industry key words in the article titles of the eliminated industry information.
The industry keywords refer to preset keywords with high relevance to the industry, and the keywords are stored in a keyword dictionary. The server extracts the industry keywords in the article title of the removed industry information, and compares the industry keywords with preset industry major event keywords to determine whether the current industry information is major information.
In this embodiment, the server extracts the industry keywords of the chapter title in the industry information after being removed, compares the industry keywords with the major event keywords in the keyword dictionary, and determines whether the current industry information is major information. For example, the important industrial event keywords in the keyword dictionary may include "used cars", "used cars finance", and the like, and the article title of the removed industrial information may be "used cars market younger trend — the concept of new generation consumption by the choice of large car hunters", and the industrial keyword in the article title of the industrial information is extracted as "used cars". This is not a limitation in this embodiment.
S302, comparing the industry keywords with industry major event keywords in a preset keyword dictionary.
The server can preset the content included by the industry major event keywords, and after extracting the industry keywords in the article title of the removed industry information, the industry keywords are compared with the industry major event keywords.
In this embodiment, for example, the server may set that the industry major event keywords in the keyword dictionary include "used cars", "used cars finance", and the like, the industry keywords in the extracted article title of the industry information after being rejected include "used cars", and compare the industry keywords "used cars" with the industry major event keywords "used cars", "used cars finance", and the like one by one, and the comparison result is used to determine whether the industry information is major information, which is not limited in this embodiment.
S303, if the industry key words are consistent with the industry major event key words in the key word dictionary, determining the removed industry information as major information.
In this embodiment, the server previously sets an industry major event keyword with a high industry relevance, and after extracting the industry keyword in the article title of the removed industry information, compares the industry keyword with the industry major event keyword. For example, the industry major event keywords may include "used cars", "used cars finance", and the like, and the extracted article title of the removed industry information may be "used cars market coming youngster trend — a new generation consumption concept selected by a large car searcher", where the industry keywords include "used cars", and the industry keywords "used cars" are compared with the industry major event keywords "used cars", "used cars finance", and the like, and the comparison result is consistent, it is indicated that the industry information is major information, which is not limited in this embodiment.
In this embodiment, the server determines the important information of the removed industry information, further screens the industry information, extracts the industry keywords in the article title of the current industry information, compares the industry keywords with the important event keywords, and determines that the current industry information is the important information if the comparison result is consistent, so that the determination method is simple, and the industry information with relatively low industry relevance is further filtered.
In one embodiment, the preset classification rule includes: the corresponding relation between the key words of the chapter title in the industry information and the information categories, and the priority order of each information category.
The information type refers to a type preset by the server, the server associates and corresponds the preset information type with an industry keyword in the keyword dictionary, so that when the server acquires the industry information, the server extracts the industry keyword of a chapter title in the industry information and compares the industry keyword with the industry keyword to obtain the information type corresponding to the industry information, and in addition, different information types correspond to different priority sequences and are used for preferentially pushing the information type with higher priority to a viewing system in an information pushing process.
Optionally, in an embodiment, as shown in fig. 4, in the step S204 "classify the important information in the removed industry information according to a preset classification rule to obtain the classified industry information" in the above embodiment, the method includes:
s401, determining the information category of each important information according to the corresponding relation and the keywords of the article title of each important information to obtain the classified industry information.
In this embodiment, the information category may include policy dynamic, market dynamic, personnel dynamic, enterprise dynamic, and others, and the server associates and corresponds the preset information category with the industry keyword in the keyword dictionary, for example, the information category corresponding to the keyword "used car market" is "market dynamic", the information category corresponding to the keyword "used car finance" is "policy dynamic", and the industry keyword extracting the chapter title in the industry information from the server is "used car market", and classifies the industry information as market dynamic, which is not limited in this embodiment.
S402, sorting the classified industry information according to the priority order of the information types.
The priority order of the information categories refers to setting different priority orders for different information categories according to different information categories and preset priority rules, and then sorting the classified industry information according to the set priority orders.
In this embodiment, the information categories may include policy-dynamic, market-dynamic, personnel-dynamic, enterprise-dynamic, and others, and for example, the policy-dynamic category may be set to priority 1, the market-dynamic category may be set to priority 2, the personnel-dynamic category may be set to priority 3, the enterprise-dynamic category may be set to priority 4, and the others may be set to priority 5 or no priority. After classifying the acquired industry information into market dynamics, the server sets the priority order corresponding to the market dynamics as priority 2, and sets the industry information as the 2 nd priority for pushing, which is not limited in this embodiment.
In this embodiment, the server classifies the information types of the industry information, sorts and pushes the industry information according to the priority of the information types, so that the pertinence of the industry information is more clear, and pushes the industry information according to the priority order, so that the industry information with high attention cannot be submerged in other information, and the purpose of pushing the important information is achieved.
In one embodiment, as shown in fig. 5, the step 205 "pushing the classified business information to the viewing system" in the above embodiment includes:
s501, abstract information of the classified industry information is obtained.
The summary information refers to paragraph information which is formed by character length processing of the kth natural paragraph information of each industry information after the server acquires each classified industry information, and accords with the character length of standard paragraph information, and the summary information is used for information push later.
In this embodiment, after acquiring the classified industry information, the server scans and extracts the full text of the industry information, and may perform character length processing on the candidate abstract information by extracting the kth natural paragraph information in the full text of the article as the candidate abstract information, where the paragraph information after the character length processing is the abstract information. This is not a limitation in this embodiment.
Optionally, in an embodiment, one implementation manner of the step 501 "obtaining summary information of each classified industry information" may be a method provided in this embodiment, as shown in fig. 6, including:
s601, acquiring the kth paragraph information of the classified industry information.
The classified industry information refers to industry information obtained by classifying the industry information according to preset classification rules. The kth paragraph information refers to the information of the kth natural paragraph in the article text of the current industry information.
In this embodiment, after the server classifies the industry information, the server scans the full text of the classified industry information to obtain the kth paragraph information, and is used to compare the character length of the current kth paragraph information with the preset minimum paragraph information character length. This is not a limitation in this embodiment.
S602, judging whether the character length of the kth paragraph information is larger than or equal to a preset first threshold value; if the length of the k-th paragraph information is smaller than the first threshold, S603 is executed, and if the length of the k-th paragraph information is greater than or equal to the first threshold, S604 is executed.
After the server obtains the paragraph information, the character length of the paragraph information is obtained, whether the current k-th paragraph information length is greater than or equal to a preset first threshold value or not is judged, and if the character length of the k-th paragraph information is smaller than the first threshold value, step 603 is executed; if the length of the k-th paragraph information is greater than or equal to the first threshold, go to step 604; specifically, the first threshold refers to a minimum paragraph information character length preset by the server.
In this embodiment, exemplarily, the initial value of k is set to be 1, the first threshold is 15 character lengths, and if the character length of the 1 st paragraph information acquired by the server is 10, and the character length of the 1 st paragraph information is smaller than the first threshold, step 603 is executed; if the character length of the 1 st paragraph information obtained by the server is 15 or 30, and the character length of the 1 st paragraph information is equal to or greater than the first threshold, step 604 is executed. This is not a limitation in this embodiment.
S603, acquiring the (k + 1) th paragraph information, and judging whether the character length of the (k + 1) th paragraph information is larger than or equal to a first threshold value.
In this embodiment, the same example as described above is used to explain that, when the character length of the 1 st paragraph information acquired by the server is 10 and the character length of the 1 st paragraph information is smaller than the first threshold, the server acquires k +1 paragraph information, that is, the server acquires the 2 nd paragraph information, and then compares the character length of the 2 nd paragraph information with the first threshold, that is, returns to execute step 601. This is not a limitation in this embodiment.
S604, judging whether the character length of the kth paragraph information is less than or equal to a preset second threshold value; if the length of the k-th paragraph information is less than or equal to the second threshold, S605 is executed, and if the length of the k-th paragraph information is greater than the second threshold, S606 is executed.
The preset second threshold refers to a standard paragraph information character length preset by the server, the server determines whether the current k-th paragraph information length is less than or equal to the preset second threshold, and if the k-th paragraph information character length is less than or equal to the second threshold, step 605 is executed; if the length of the k-th paragraph information is greater than the second threshold, go to step 606.
In this embodiment, the second threshold is set to be 50, and in the same example as above, when the character length of the 1 st paragraph information acquired by the server is 15 or 30, and the character length of the 1 st paragraph information is smaller than the second threshold, step 605 is executed; if the character length of the 1 st paragraph information obtained by the server is 70, step 606 is executed. This is not a limitation in this embodiment.
And S605, taking the kth paragraph information as summary information.
In this embodiment, the same example is described above, where the character length of the 1 st paragraph information acquired by the server through the comparison and determination of the first threshold and the comparison and determination of the second threshold is 15 or 30, and is smaller than the second threshold, the 1 st paragraph information is used as summary information for pushing to the viewing system.
S606, splitting sentences in the kth paragraph information, combining the split sentences according to a preset rule, and determining the combined paragraph information as summary information; the length of the combined paragraph information character is larger than or equal to a first threshold value and smaller than or equal to a second threshold value.
The preset splitting rule may include modes of retaining statements within the character length, deleting other statements exceeding the character length, or deleting statements containing more stop words in the first natural paragraph, retaining statements containing more keywords, and then combining the retained statements, which is not limited in this embodiment.
In this embodiment, by using the same example as described above, if the character length of the 1 st paragraph information acquired by the server is 70, the server splits the 1 st paragraph information into sentences, and then combines partial sentences from the split sentence set to obtain combined paragraph information, so that the character length of the combined paragraph information is greater than or equal to the first threshold and less than or equal to the second threshold, for example, the character length of the combined paragraph information may be 45, which satisfies the above-mentioned character length requirement range, which is not limited in this embodiment.
And S502, pushing the summary information to a viewing system.
The summary information refers to paragraph information which is formed after the kth natural paragraph information of the current industry information is processed by character length and accords with the character length of standard paragraph information, the paragraph information is used for being pushed to a viewing system, and the viewing system can be an enterprise internal office system or other systems.
In this embodiment, after the server performs the character length processing on the kth natural paragraph information of the current industry information, the paragraph information with the character length in the preset range is finally obtained, and the paragraph information is used as the summary information of the current information to be pushed to the viewing system. In the same example, the second threshold is set to be 50, the server splits and combines the first paragraph information with the character length of 80 characters through a sentence to obtain a standard paragraph with the character length of 50 characters as summary information of the current industry information, and pushes the summary information to the viewing system, where the viewing system may be an enterprise internal office system, a management-oriented system, or other employees, and this is not limited in this embodiment.
In this embodiment, the server extracts summary information of each classified industry information, specifically, first obtains the kth natural paragraph information of the current industry information, performs first threshold judgment processing and second threshold judgment processing on the character length of the natural paragraph information, executes corresponding operations according to the result so that the character length of the final paragraph information is greater than or equal to the first threshold and less than or equal to the second threshold, and then pushes the paragraph information as the summary information. The character length of the abstract information is standardized, the occupation of resources is reduced to a certain extent, and meanwhile, information redundancy can be avoided.
It should be understood that although the various steps in the flow charts of fig. 1-6 are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least some of the steps in fig. 1-6 may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, and the order of performance of the sub-steps or stages is not necessarily sequential, but may be performed in turn or alternating with other steps or at least some of the sub-steps or stages of other steps.
In one embodiment, as shown in fig. 7, an information pushing apparatus 800 is provided, which includes: an obtaining module 801, a removing module 802, a determining module 803, a classifying module 804 and a pushing module 805, wherein:
an obtaining module 801, configured to obtain industry information; the industry information comprises a website data source, keywords, article titles, article full texts, publishers, webpage links, publication time and crawling time;
the removing module 802 is configured to remove the industry information according to a preset removing rule to obtain removed industry information;
the determining module 803 is configured to determine major information in the removed industry information;
the classification module 804 is configured to classify the major information in the eliminated industry information according to a preset classification rule to obtain the classified industry information;
the pushing module 805 is configured to push the classified industry information to the viewing system.
In one embodiment, the culling rules include at least one of the following rules:
eliminating industry information including preset stop words in article titles;
eliminating industry information which does not contain preset industry related keywords in article titles;
and eliminating repeated industry information which is pushed within a preset time period.
In one embodiment, as shown in fig. 8, the determining module 803 on the basis of fig. 7 includes an extracting unit 8031, a comparing unit 8032 and a determining unit 8033, wherein:
the extraction unit 8031 is used for extracting the industry keywords in the article titles of the removed industry information;
a comparing unit 8032, configured to compare the industry keyword with an industry major event keyword in a preset keyword dictionary;
the determining unit 8033 is configured to determine that the removed industry information is important information if the industry keyword is consistent with the industry major event keyword in the keyword dictionary.
In an embodiment, the preset classification rule includes: the corresponding relation between the industry key words of the chapter title in the industry information and the information categories, and the priority order of each information category.
In one embodiment, as shown in fig. 9, on the basis of fig. 7, the classification module 804 includes a category determination unit 8041 and an ordering unit 8042, wherein:
a category determining unit 8041, configured to determine the information category of each piece of important information according to the correspondence and the keyword of the article title of each piece of important information, so as to obtain the classified industry information;
the sorting unit 8042 is configured to sort the classified industry information according to the priority order of the information categories.
In one embodiment, as shown in fig. 10, on the basis of fig. 7, the pushing module 805 includes a fetching unit 8051 and a pushing unit 8052, where:
an obtaining unit 8051, configured to obtain summary information of each classified industry information;
the pushing unit 8052 is configured to push the summary information to the viewing system.
In one embodiment, as shown in fig. 11, the acquiring unit 8051 in the pushing module 805 includes an acquiring subunit 80511 and a determining processing subunit 80512 on the basis of fig. 10, where:
an acquiring subunit 80511, configured to acquire the kth paragraph information of each classified industry information;
a judgment processing subunit 80512, configured to judge whether the character length of the kth paragraph information is greater than or equal to a preset first threshold; if the character length of the kth paragraph information is smaller than a first threshold value, acquiring the (k + 1) th paragraph information, and judging whether the character length of the (k + 1) th paragraph information is larger than or equal to the first threshold value; if the character length of the kth paragraph information is larger than or equal to the first threshold, judging whether the character length of the kth paragraph information is smaller than or equal to a preset second threshold; if the character length of the kth paragraph information is less than or equal to a second threshold value, taking the kth paragraph information as summary information; if the character length of the kth paragraph information is larger than a second threshold value, splitting sentences in the kth paragraph information, combining the sentences obtained by splitting according to a preset rule, and determining the combined paragraph information as abstract information; the length of the combined paragraph information character is larger than or equal to a first threshold value and smaller than or equal to a second threshold value.
For the specific limitations of the information pushing apparatus, reference may be made to the above limitations of the information pushing method, which will not be described herein again. All or part of the modules in the information pushing device can be realized by software, hardware and a combination thereof. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.
In one embodiment, a computer device is provided, which may be a terminal, and its internal structure diagram may be as shown in fig. 12. The computer device comprises a processor, a memory, a network interface, a database, a display screen and an input device which are connected through a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of an operating system and computer programs in the non-volatile storage medium. The network interface of the computer device is used for communicating with an external terminal through a network connection. The computer program is executed by a processor to implement an information pushing method. The database of the computer equipment is used for storing information push data. The display screen of the computer equipment can be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer equipment can be a touch layer covered on the display screen, a key, a track ball or a touch pad arranged on the shell of the computer equipment, an external keyboard, a touch pad or a mouse and the like.
Those skilled in the art will appreciate that the architecture shown in fig. 12 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as particular computing devices may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.
In one embodiment, a computer device is provided, comprising a memory and a processor, the memory having a computer program stored therein, the processor implementing the following steps when executing the computer program:
acquiring industry information; the industry information comprises a website data source, keywords, article titles, article full texts, publishers, webpage links, publication time and crawling time;
removing the industry information according to a preset removing rule to obtain the removed industry information;
determining major information in the removed industry information;
classifying the major information in the eliminated industry information according to a preset classification rule to obtain classified industry information;
and pushing the classified industry information to a viewing system.
The implementation principle and technical effect of the computer device provided by the embodiment of the present application are similar to those of the method embodiment described above, and are not described herein again.
In one embodiment, a computer-readable storage medium is provided, having a computer program stored thereon, which when executed by a processor, performs the steps of:
acquiring industry information; the industry information comprises a website data source, keywords, article titles, article full texts, publishers, webpage links, publication time and crawling time;
removing the industry information according to a preset removing rule to obtain the removed industry information;
determining major information in the removed industry information;
classifying the major information in the eliminated industry information according to a preset classification rule to obtain classified industry information;
and pushing the classified industry information to a viewing system.
The implementation principle and technical effect of the computer-readable storage medium provided by this embodiment are similar to those of the above-described method embodiment, and are not described herein again.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (10)

1. An information pushing method, the method comprising:
acquiring industry information; the industry information comprises a website data source, keywords, article titles, article full texts, publishers, webpage links, publication time and crawling time;
removing the industry information according to a preset removing rule to obtain the removed industry information;
determining important information in the eliminated industry information;
classifying the major information in the eliminated industry information according to a preset classification rule to obtain classified industry information;
and pushing the classified industry information to a viewing system.
2. The method of claim 1, wherein the culling rules include at least one of:
eliminating the industry information containing preset stop words in the article title;
eliminating the industry information which does not contain preset industry related keywords in the article title;
and eliminating repeated industry information which is pushed within a preset time period.
3. The method according to claim 1 or 2, wherein the determining of the important information in the removed industry information comprises:
extracting industry key words in the article titles of the industry information after being removed;
comparing the industry keywords with industry major event keywords in a preset keyword dictionary;
and if the industry key words are consistent with the industry major event key words in the key word dictionary, determining the eliminated industry information as the major information.
4. The method according to claim 1 or 2, wherein the preset classification rule comprises: the corresponding relation between the industry key words of the seal title in the industry information and the information categories, and the priority order of each information category.
5. The method according to claim 4, wherein the classifying the important information in the eliminated industry information according to a preset classification rule to obtain the classified industry information comprises:
determining the information category of each important information according to the corresponding relation and the industry keywords of the article title of each important information to obtain the classified industry information;
and sequencing the classified industry information according to the priority order of the information categories.
6. The method of claim 1, wherein pushing the categorized business information to a viewing system comprises:
acquiring abstract information of each classified industry information;
and pushing the summary information to the viewing system.
7. The method of claim 6, wherein said obtaining summary information of each of said classified business information messages comprises:
acquiring the kth paragraph information of each classified industry information;
judging whether the character length of the kth paragraph information is larger than or equal to a preset first threshold value or not;
if the character length of the kth paragraph information is smaller than the first threshold, acquiring the (k + 1) th paragraph information, and judging whether the character length of the (k + 1) th paragraph information is larger than or equal to the first threshold;
if the character length of the kth paragraph information is larger than or equal to the first threshold, judging whether the character length of the kth paragraph information is smaller than or equal to a preset second threshold;
if the character length of the kth paragraph information is smaller than or equal to the second threshold, taking the kth paragraph information as the summary information;
if the character length of the kth paragraph information is larger than the second threshold, splitting the sentences in the kth paragraph information, combining the sentences obtained by splitting according to a preset rule, and determining the combined paragraph information as the summary information; the length of the combined paragraph information character is greater than or equal to the first threshold and less than or equal to the second threshold.
8. An information pushing apparatus, the apparatus comprising:
the acquisition module is used for acquiring industry information; the industry information comprises a website data source, keywords, article titles, article full texts, publishers, webpage links, publication time and crawling time;
the removing module is used for removing the industry information according to a preset removing rule to obtain the removed industry information;
the determining module is used for determining important information in the industry information after being eliminated;
the classification module is used for classifying the major information in the eliminated industry information according to a preset classification rule to obtain the classified industry information;
and the pushing module is used for pushing the classified industry information to a checking system.
9. A computer device comprising a memory and a processor, wherein the memory stores a computer program, and wherein the computer program when executed by the processor implements the information push method according to any one of claims 1 to 7.
10. A computer-readable storage medium, on which a computer program is stored, wherein the program, when executed by a processor, implements the information push method according to any one of claims 1 to 7.
CN201910988193.9A 2019-10-17 2019-10-17 Information pushing method and device, computer equipment and storage medium Active CN110837590B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910988193.9A CN110837590B (en) 2019-10-17 2019-10-17 Information pushing method and device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910988193.9A CN110837590B (en) 2019-10-17 2019-10-17 Information pushing method and device, computer equipment and storage medium

Publications (2)

Publication Number Publication Date
CN110837590A true CN110837590A (en) 2020-02-25
CN110837590B CN110837590B (en) 2023-04-07

Family

ID=69575441

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910988193.9A Active CN110837590B (en) 2019-10-17 2019-10-17 Information pushing method and device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN110837590B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111611399A (en) * 2020-04-15 2020-09-01 广发证券股份有限公司 Information event mapping system and method based on natural language processing
CN112579941A (en) * 2020-12-17 2021-03-30 京东数字科技控股股份有限公司 Information processing method, device, equipment and storage medium
CN113961789A (en) * 2020-07-21 2022-01-21 北京市天元网络技术股份有限公司 Industry information early warning method and device, electronic equipment and storage medium
CN114386886A (en) * 2022-03-24 2022-04-22 杭州小电科技股份有限公司 Charging module management method, mobile power supply equipment, charging module management device and computer equipment

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101446959A (en) * 2008-12-30 2009-06-03 深圳市迅雷网络技术有限公司 Internet-based news recommendation method and system thereof
CN102831234A (en) * 2012-08-31 2012-12-19 北京邮电大学 Personalized news recommendation device and method based on news content and theme feature
CN103389975A (en) * 2012-05-07 2013-11-13 腾讯科技(深圳)有限公司 News recommending method and system
CN104679875A (en) * 2015-03-10 2015-06-03 杭州凡闻科技有限公司 Method for classifying information data based on digital newspaper
CN104809154A (en) * 2015-03-19 2015-07-29 百度在线网络技术(北京)有限公司 Method and device for recommending information
US20170091846A1 (en) * 2015-09-25 2017-03-30 Universal Entertainment Corporation Information provision system, information provision method, and storage medium
WO2018027362A1 (en) * 2016-08-06 2018-02-15 胡明祥 Moments information-based advertisement pushing method and pushing system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101446959A (en) * 2008-12-30 2009-06-03 深圳市迅雷网络技术有限公司 Internet-based news recommendation method and system thereof
CN103389975A (en) * 2012-05-07 2013-11-13 腾讯科技(深圳)有限公司 News recommending method and system
CN102831234A (en) * 2012-08-31 2012-12-19 北京邮电大学 Personalized news recommendation device and method based on news content and theme feature
CN104679875A (en) * 2015-03-10 2015-06-03 杭州凡闻科技有限公司 Method for classifying information data based on digital newspaper
CN104809154A (en) * 2015-03-19 2015-07-29 百度在线网络技术(北京)有限公司 Method and device for recommending information
US20170091846A1 (en) * 2015-09-25 2017-03-30 Universal Entertainment Corporation Information provision system, information provision method, and storage medium
WO2018027362A1 (en) * 2016-08-06 2018-02-15 胡明祥 Moments information-based advertisement pushing method and pushing system

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
D.R.CAMPBELL ETC.: "INFORMATION PUSH:STRATEGIES FOR IMPROVING EFFICIENCY OF DESIGN INFORMATION MANAGEMENT", 《FOURTH INTERNATION SEMINAR AND WORKSHOP》 *
WANG HAO ETC.: "Intelligent Information Recommendation Method in Web-Based Argumentation Support System", 《2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION》 *
井思源: "新闻资讯APP信息推送的运营策略研究", 《中国优秀硕士学位论文全文数据库(信息科技辑)》 *
涂新辉: "《基于概念的信息检索方法》", 30 April 2015, 华中师范大学出版社 *
陈铭权: "基于主题模型的用户兴趣建模及在新闻推荐中的应用", 《中国优秀硕士学位论文全文数据库(信息科技辑)》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111611399A (en) * 2020-04-15 2020-09-01 广发证券股份有限公司 Information event mapping system and method based on natural language processing
CN113961789A (en) * 2020-07-21 2022-01-21 北京市天元网络技术股份有限公司 Industry information early warning method and device, electronic equipment and storage medium
CN112579941A (en) * 2020-12-17 2021-03-30 京东数字科技控股股份有限公司 Information processing method, device, equipment and storage medium
CN114386886A (en) * 2022-03-24 2022-04-22 杭州小电科技股份有限公司 Charging module management method, mobile power supply equipment, charging module management device and computer equipment

Also Published As

Publication number Publication date
CN110837590B (en) 2023-04-07

Similar Documents

Publication Publication Date Title
CN110837590B (en) Information pushing method and device, computer equipment and storage medium
CN109815333B (en) Information acquisition method and device, computer equipment and storage medium
US8965894B2 (en) Automated web page classification
US9996588B2 (en) Managing a search
US10489510B2 (en) Sentiment analysis of product reviews from social media
CN108959559B (en) Question and answer pair generation method and device
CN112749284B (en) Knowledge graph construction method, device, equipment and storage medium
CN110377725B (en) Data generation method and device, computer equipment and storage medium
CN110674319A (en) Label determination method and device, computer equipment and storage medium
WO2015053236A1 (en) Device for collecting contradictory expression and computer program for same
CA3138556A1 (en) Apparatuses, storage medium and method of querying data based on vertical search
WO2016200667A1 (en) Identifying relationships using information extracted from documents
CN112651236B (en) Method and device for extracting text information, computer equipment and storage medium
Anderson et al. An intelligent online grooming detection system using AI technologies
CN111737548A (en) Click verification code identification method and device, computer equipment and storage medium
CN112818200A (en) Data crawling and event analyzing method and system based on static website
CN116151233A (en) Data labeling and generating method, model training method, device and medium
CN110851709B (en) Information pushing method and device, computer equipment and storage medium
Leopold et al. Refactoring of process model activity labels
Fatima et al. New graph-based text summarization method
US11625536B2 (en) System and method for identification and profiling adverse events
CN112579781A (en) Text classification method and device, electronic equipment and medium
Endalie et al. Hybrid feature selection for Amharic news document classification
Wulandhari et al. Corruption Cases Mapping Based on Indonesia’s Corruption Perception Index
CN114169331A (en) Address resolution method, device, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant