CN105045890A - Method and device for determining hot news in target news source - Google Patents

Method and device for determining hot news in target news source Download PDF

Info

Publication number
CN105045890A
CN105045890A CN201510456929.XA CN201510456929A CN105045890A CN 105045890 A CN105045890 A CN 105045890A CN 201510456929 A CN201510456929 A CN 201510456929A CN 105045890 A CN105045890 A CN 105045890A
Authority
CN
China
Prior art keywords
news
hot
hot news
block
candidate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510456929.XA
Other languages
Chinese (zh)
Inventor
邢皖甲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201510456929.XA priority Critical patent/CN105045890A/en
Publication of CN105045890A publication Critical patent/CN105045890A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The invention aims to provide a method and a device for determining hot news in a target news source. Particularly, candidate hot news in the target news source is determined, wherein the candidate hot news is positioned in a hot news block of the target news source; and according to access characteristic information of the candidate hot news, the hot news in the candidate hot news is determined. Compared to the prior art, the candidate hot news in the target news source is determined, wherein the candidate hot news is positioned in the hot news block of the target news source; and according to the access characteristic information of the candidate hot news, the hot news in the candidate hot news is determined, so that automated mining of the hot news is realized, the identification rate of the hot news is increased, the identification cost is reduced, the efficiency for obtaining the hot news by users is improved, and the user experience is enhanced.

Description

Determine the method and apparatus of the hot news in targeted news source
Technical field
The present invention relates to Internet technical field, particularly relating to a kind of technology for determining the hot news in targeted news source.
Background technology
The determination of hot news is very easy to the acquisition of user to news information with providing.But, in prior art, usually adopt the mode of manual sorting to determine hot news, this mode obviously needs larger human cost, and ageing poor, can not in time for user provides hot news, correspondingly, the efficiency that user obtains hot news is also reduced.
Summary of the invention
An object of the present invention is to provide a kind of method and apparatus for determining the hot news in targeted news source.
According to an aspect of the present invention, provide a kind of method for determining the hot news in targeted news source, wherein, the method comprises:
Determine the candidate's hot news in targeted news source, wherein, described candidate's hot news is arranged in the hot news block in described targeted news source;
According to the access characteristic information of described candidate's hot news, from described candidate's hot news, determine hot news.
According to a further aspect in the invention, additionally provide a kind of focus determination equipment for determining the hot news in targeted news source, wherein, this focus determination equipment comprises:
For determining the device of the candidate's hot news in targeted news source, wherein, described candidate's hot news is arranged in the hot news block in described targeted news source;
For the access characteristic information according to described candidate's hot news, from described candidate's hot news, determine the device of hot news.
Compared with prior art, one embodiment of the present of invention are by determining the candidate's hot news in targeted news source, wherein, described candidate's hot news is arranged in the hot news block in described targeted news source, thus according to the access characteristic information of described candidate's hot news, hot news is determined from described candidate's hot news, achieve the automatic excavating of hot news, improve the discrimination of hot news, and reduce identification cost, also improve the efficiency that user obtains hot news, and improve Consumer's Experience.
Accompanying drawing explanation
By reading the detailed description done non-limiting example done with reference to the following drawings, other features, objects and advantages of the present invention will become more obvious:
Fig. 1 illustrates the equipment schematic diagram of a kind of focus determination equipment for determining the hot news in targeted news source according to one aspect of the invention;
Fig. 2 illustrates the equipment schematic diagram of a kind of focus determination equipment for determining the hot news in targeted news source in accordance with a preferred embodiment of the present invention;
Fig. 3 illustrates a kind of method flow diagram for determining the hot news in targeted news source according to a further aspect of the present invention;
Fig. 4 illustrates a kind of method flow diagram for determining the hot news in targeted news source in accordance with a preferred embodiment of the present invention.
In accompanying drawing, same or analogous Reference numeral represents same or analogous parts.
Embodiment
Below in conjunction with accompanying drawing, the present invention is described in further detail.
Fig. 1 illustrates a kind of focus determination equipment 1 for determining the hot news in targeted news source according to one aspect of the invention, wherein, focus determination equipment 1 comprises the device (hereinafter referred to as " candidate's determining device 11 ") for determining the candidate's hot news in targeted news source, wherein, described candidate's hot news is arranged in the hot news block in described targeted news source; For the access characteristic information according to described candidate's hot news, from described candidate's hot news, determine the device (hereinafter referred to as " focus determining device 12 ") of hot news.
Particularly, candidate's determining device 11 determines the candidate's hot news in targeted news source, and wherein, described candidate's hot news is arranged in the hot news block in described targeted news source; Focus determining device 12, according to the access characteristic information of described candidate's hot news, determines hot news from described candidate's hot news.
At this, focus determination equipment 1 includes but not limited to that the network equipment, subscriber equipment or the network equipment and subscriber equipment are by the mutually integrated equipment formed of network.At this, the described network equipment includes but not limited to as network host, single network server, multiple webserver collection or the realization such as set of computers based on cloud computing; Or realized by subscriber equipment.At this, cloud is formed by based on a large amount of main frame of cloud computing (CloudComputing) or the webserver, and wherein, cloud computing is the one of Distributed Calculation, the super virtual machine be made up of a group loosely-coupled computing machine collection.At this, described subscriber equipment can be that any one can to carry out the electronic product of man-machine interaction, such as computing machine, mobile phone, smart mobile phone, PDA, wearable device, palm PC PPC or panel computer etc. with user by modes such as keyboard, mouse, touch pad, touch-screen or handwriting equipments.Described network includes but not limited to internet, wide area network, Metropolitan Area Network (MAN), LAN (Local Area Network), VPN, wireless self-organization network (AdHoc network) etc.Those skilled in the art will be understood that above-mentioned focus determination equipment 1 is only citing; other network equipments that are existing or that may occur from now on or subscriber equipment are as being applicable to the present invention; also within scope should being included in, and this is contained at this with way of reference.At this, the network equipment and subscriber equipment include a kind of can according in advance setting or the instruction stored, automatically carry out the electronic equipment of numerical evaluation and information processing, its hardware includes but not limited to microprocessor, special IC (ASIC), programmable gate array (FPGA), digital processing unit (DSP), embedded device etc.
Particularly, candidate's determining device 11 determines the candidate's hot news in targeted news source, and wherein, described candidate's hot news is arranged in the hot news block in described targeted news source.
At this, described targeted news source refer to can publish news and browse for the network user website (news portal as large-scale in country, business door, local items door etc.), the page, news app etc.
At this, described candidate's hot news refers to it is likely the news of hot news.
At this, described hot news block refer to specify in described targeted news source or targeted news source is carried out to page analysis obtains, publish the region of hot news.
Those skilled in the art will be understood that above-mentioned targeted news source, hot news block is only citing; other targeted news sources that are existing or that may occur from now on or hot news block are as being applicable to the present invention; also within scope should being included in, and this is contained at this with way of reference.
Such as, for targeted news source as news website news-page1, hot news block predetermined in this website is focus module hot-news-module, then candidate's determining device 11 can will be positioned at all news of hot news block and focus module hot-news-module if new1-new10 is all as candidate's hot news of this news website news-page1 in news website news-page1.
Those skilled in the art will be understood that the mode of the above-mentioned candidate's hot news determined in targeted news source is only citing; other are existing or may occur that the mode of the candidate's hot news really set the goal in news sources is as being applicable to the present invention from now on; also within scope should being included in, and this is contained at this with way of reference.
Focus determining device 12, according to the access characteristic information of described candidate's hot news, determines hot news from described candidate's hot news.
At this, described hot news refers to the news comparing and pay close attention to by user or welcome.
At this, the access characteristic information of described candidate's hot news refers to the access feedback information of user to this candidate's hot news, as amount of reading/reading frequency, number of reviews/comment frequency, the amount of sharing/share frequency etc.Those skilled in the art will be understood that above-mentioned access characteristic information is only citing, and other access characteristic information that are existing or that may occur from now on, as being applicable to the present invention, within also should being included in scope, and are contained in this at this with way of reference.
At this, focus determining device 12 determines that from described candidate's hot news the mode of hot news includes but not limited to following at least any one:
1) according to the access characteristic information of described candidate's hot news, in conjunction with the Aging Characteristic information of described candidate's hot news, from described candidate's hot news, hot news is determined.
At this, the Aging Characteristic information of described candidate's hot news refer to this candidate's hot news issuing time and/or from its be published to can from network crawled to the time etc. experienced.In a particular embodiment, the issuing time of candidate's hot news is relatively the closer to current time, and its probability belonging to hot news is larger; Candidate's hot news from its be published to can from network crawled to the time experienced shorter, its probability belonging to hot news is also larger.
Such as, for targeted news source as news website news-page1, candidate's determining device 11 determines that the candidate's hot news in this targeted news source is new1-new10, suppose that the issuing time of new2-new5 in candidate hot news new1-new10 is relatively near current time, then focus determining device 12 determines that candidate hot news new2-new5 is hot news.
2) according to the access characteristic information of described candidate's hot news, in conjunction with the focus class information of described candidate's hot news, from described candidate's hot news, hot news is determined.
Such as, for targeted news source as news website news-page1, candidate's determining device 11 determines that the candidate's hot news in this targeted news source is new1-new10, suppose that the focus grade of new3-new5 in candidate hot news new1-new10 is higher than other candidate's hot news, then focus determining device 12 can determine that candidate hot news new3-new5 is hot news.
Those skilled in the art will be understood that and above-mentionedly from candidate's hot news, determine that the mode of hot news is only citing; other existing or may occur from now on from candidate's hot news, determine that the mode of hot news is as being applicable to the present invention; also within scope should being included in, and this is contained at this with way of reference.
At this, it will be appreciated by those skilled in the art that in a particular embodiment, described access characteristic information, described Aging Characteristic information and the triplicity of described focus class information also can be got up to determine whether candidate's hot news is hot news by the present invention.
Preferably, focus determination equipment 1 also comprises the issue operational ton information for being published in related news source according to described candidate's hot news, determine the device (hereinafter referred to as " focus grade determining device ", not shown) of described focus class information.Particularly, the issue operational ton information that focus grade determining device is published in related news source according to described candidate's hot news, determines described focus class information.
At this, described related news source refers to other news sources being different from described targeted news source.At this, described issue operational ton information refers to the information such as total degree, issue/renewal frequency that described candidate's hot news is published in related news source.In a particular embodiment, described focus class information can have certain corresponding relation with described issue operational ton information, as the focus grade news that is I level has the issue operational ton information of certain scope.Those skilled in the art will be understood that above-mentioned issue operational ton information is only citing; other issue operational ton information that are existing or that may occur from now on are as being applicable to the present invention; also within scope should being included in, and this is contained at this with way of reference.
Such as, for candidate hot news new2 and new3, suppose that the total degree that candidate hot news new2 is published in related news source is 100 times, and the total degree that candidate hot news new3 is published in related news source is 30 times, focus grade is the total degree be published in related news source corresponding to the news of I level is [50, + ∞), and focus grade is the news of II level, and the corresponding total degree be published in related news source is [20, 50), then focus grade determining device can determine that the focus class information of candidate hot news new2 and new3 is respectively I level and II level.
Those skilled in the art will be understood that and above-mentionedly determine that the mode of described focus class information is only citing; described in other determinations that are existing or that may occur from now on, the mode of focus class information is as being applicable to the present invention; also within scope should being included in, and this is contained at this with way of reference.
At this, present invention achieves the automatic excavating of hot news, improve the discrimination of hot news, and reduce identification cost.
Constant work between each device of focus determination equipment 1.Particularly, candidate's determining device 11 continues the candidate's hot news determined in targeted news source, and wherein, described candidate's hot news is arranged in the hot news block in described targeted news source; Focus determining device 12 continues the access characteristic information according to described candidate's hot news, from described candidate's hot news, determine hot news.At this, it will be understood by those skilled in the art that, the determination of candidate's hot news, the determination of hot news is constantly carried out respectively, until focus determination equipment 1 stops determining described candidate's hot news in a long time between each device that described " continuing " refers to focus determination equipment 1.
Preferably, focus determination equipment 1 also comprises for according to the hot news determined from multiple news sources, sets up or upgrade the device (hereinafter referred to as " hot news storehouse apparatus for establishing ", not shown) in hot news storehouse.Particularly, hot news storehouse apparatus for establishing, according to the hot news determined from multiple news sources, is set up or upgrades hot news storehouse, as being order by the focus class information of hot news, the hot information determined is arranged from multiple news sources.
At this, described hot news storehouse can be used for when user accesses news website or open news app client, news focus higher grade in hot news storehouse is initiatively supplied to user, also can be used for when user inquires about hot news, matching inquiry is carried out, to improve the accuracy of the efficiency providing hot news to user and the hot news provided from this storehouse.
Fig. 2 illustrates the equipment schematic diagram of a kind of focus determination equipment for determining the hot news in targeted news source in accordance with a preferred embodiment of the present invention, wherein, focus determination equipment 1 comprises candidate's determining device 11 ' and focus determining device 12 ', wherein, candidate's determining device 11 ' comprises unit (hereinafter referred to as " the first determining unit 111 ' ") for determining the hot news block in targeted news source and for determining the candidate's hot news in described hot news block, using the unit (hereinafter referred to as " the second determining unit 112 ' ") as the candidate's hot news in described targeted news source.Particularly, the first determining unit 111 ' determines the hot news block in targeted news source; Second determining unit 112 ' determines the candidate's hot news in described hot news block, using as the candidate's hot news in described targeted news source; Focus determining device 12 ', according to the access characteristic information of described candidate's hot news, determines hot news from described candidate's hot news.At this, it will be appreciated by those skilled in the art that focus determining device 12 ' is identical or substantially identical with the content of corresponding intrument in Fig. 1 embodiment, for simplicity's sake, therefore does not repeat them here.
Particularly, first determining unit 111 ' determines the hot news block in targeted news source, as using the physical block in targeted news source as described in hot news block, or, also whether can meet predetermined focus block judgment rule according to the news block in targeted news source, determine the hot news block in this targeted news source; Wherein, described predetermined focus block judgment rule comprises following at least any one:
News block described in-Ruo comprises predetermined focus block identification information, then this news block belongs to hot news block;
News block described in-Ruo belongs to the focus block of specifying, then this news block belongs to hot news block.
Such as, for targeted news source as news website news-page1, suppose that the first determining unit 111 ' carries out page analysis to this news website, such as find that the news block news-module-1 in this website comprises predetermined focus block identification information according to the css of the page or dom tree node, then the first determining unit 111 ' determines that news block news-module-1 is the hot news block in news website news-page1.At this, whether described predetermined focus block identification information belongs to hot news block for identifying news block, its can be hot character mark,! Number mark etc.At this; those skilled in the art will be understood that above-mentioned focus block identification information is only citing; other focus block identification information that are existing or that may occur from now on, as being applicable to the present invention, within also should being included in scope, and are contained in this at this with way of reference.
For another example, for targeted news source as news website news-page1, suppose that the news block news-module-2 in this website belongs to the focus block of specifying, the focus block of human configuration in this way, then the first determining unit 111 ' determines that news block news-module-2 is the hot news block in news website news-page1.
Those skilled in the art will be understood that the mode of the above-mentioned hot news block determined in targeted news source is only citing; other are existing or may occur that the mode of the hot news block really set the goal in news sources is as being applicable to the present invention from now on; also within scope should being included in, and this is contained at this with way of reference.
Second determining unit 112 ' determines the candidate's hot news in described hot news block, all news as will be described in hot news block all as described candidate's hot news, using as the candidate's hot news in described targeted news source; Or, also according to the focus characteristic information of news in described hot news block, described candidate's hot news can be determined.Preferably, described focus characteristic information comprises following at least any one:
The title style information of news in-described hot news block;
The focus identification information of news in-described hot news block.
At this, described title style information comprises the whether overstriking of the font size of title, font, the title whether information such as highlighted display.
At this, whether described focus identification information belongs to hot news for identifying news, its can be hot character mark,! Number any mark such as mark, red blockage.
Such as, for targeted news source as news website news-page1, its hot news block is news block news-module-1, suppose that the second determining unit 112 ' finds that the title of news new1 ' in this hot news block and new2 ' is highlighted display, or, the font of title is overstriking, or, there is focus identification information as hot mark etc., then the second determining unit 112 ' can determine that news new1 ' and new2 ' is for the candidate's hot news in news block news-module-1, thus obtain the candidate hot news of targeted news source as news website news-page1.
Those skilled in the art will be understood that and above-mentionedly determine that the mode of the candidate's hot news in described hot news block is only citing; the mode of the candidate's hot news in hot news block described in other determinations that are existing or that may occur from now on is as being applicable to the present invention; also within scope should being included in, and this is contained at this with way of reference.
Fig. 3 illustrates a kind of method flow diagram for determining the hot news in targeted news source according to a further aspect of the present invention.
Wherein, the method comprising the steps of S1 and step S2.Particularly, in step sl, focus determination equipment 1 determines the candidate's hot news in targeted news source, and wherein, described candidate's hot news is arranged in the hot news block in described targeted news source; In step s 2, focus determination equipment 1, according to the access characteristic information of described candidate's hot news, determines hot news from described candidate's hot news.
At this, focus determination equipment 1 includes but not limited to that the network equipment, subscriber equipment or the network equipment and subscriber equipment are by the mutually integrated equipment formed of network.At this, the described network equipment includes but not limited to as network host, single network server, multiple webserver collection or the realization such as set of computers based on cloud computing; Or realized by subscriber equipment.At this, cloud is formed by based on a large amount of main frame of cloud computing (CloudComputing) or the webserver, and wherein, cloud computing is the one of Distributed Calculation, the super virtual machine be made up of a group loosely-coupled computing machine collection.At this, described subscriber equipment can be that any one can to carry out the electronic product of man-machine interaction, such as computing machine, mobile phone, smart mobile phone, PDA, wearable device, palm PC PPC or panel computer etc. with user by modes such as keyboard, mouse, touch pad, touch-screen or handwriting equipments.Described network includes but not limited to internet, wide area network, Metropolitan Area Network (MAN), LAN (Local Area Network), VPN, wireless self-organization network (AdHoc network) etc.Those skilled in the art will be understood that above-mentioned focus determination equipment 1 is only citing; other network equipments that are existing or that may occur from now on or subscriber equipment are as being applicable to the present invention; also within scope should being included in, and this is contained at this with way of reference.At this, the network equipment and subscriber equipment include a kind of can according in advance setting or the instruction stored, automatically carry out the electronic equipment of numerical evaluation and information processing, its hardware includes but not limited to microprocessor, special IC (ASIC), programmable gate array (FPGA), digital processing unit (DSP), embedded device etc.
Particularly, in step sl, focus determination equipment 1 determines the candidate's hot news in targeted news source, and wherein, described candidate's hot news is arranged in the hot news block in described targeted news source.
At this, described targeted news source refer to can publish news and browse for the network user website (news portal as large-scale in country, business door, local items door etc.), the page, news app etc.
At this, described candidate's hot news refers to it is likely the news of hot news.
At this, described hot news block refer to specify in described targeted news source or targeted news source is carried out to page analysis obtains, publish the region of hot news.
Those skilled in the art will be understood that above-mentioned targeted news source, hot news block is only citing; other targeted news sources that are existing or that may occur from now on or hot news block are as being applicable to the present invention; also within scope should being included in, and this is contained at this with way of reference.
Such as, for targeted news source as news website news-page1, hot news block predetermined in this website is focus module hot-news-module, then in step sl, focus determination equipment 1 can will be positioned at all news of hot news block and focus module hot-news-module if new1-new10 is all as candidate's hot news of this news website news-page1 in news website news-page1.
Those skilled in the art will be understood that the mode of the above-mentioned candidate's hot news determined in targeted news source is only citing; other are existing or may occur that the mode of the candidate's hot news really set the goal in news sources is as being applicable to the present invention from now on; also within scope should being included in, and this is contained at this with way of reference.
In step s 2, focus determination equipment 1, according to the access characteristic information of described candidate's hot news, determines hot news from described candidate's hot news.
At this, described hot news refers to the news comparing and pay close attention to by user or welcome.
At this, the access characteristic information of described candidate's hot news refers to the access feedback information of user to this candidate's hot news, as amount of reading/reading frequency, number of reviews/comment frequency, the amount of sharing/share frequency etc.Those skilled in the art will be understood that above-mentioned access characteristic information is only citing, and other access characteristic information that are existing or that may occur from now on, as being applicable to the present invention, within also should being included in scope, and are contained in this at this with way of reference.
At this, in step s 2, focus determination equipment 1 determines that from described candidate's hot news the mode of hot news includes but not limited to following at least any one:
1) according to the access characteristic information of described candidate's hot news, in conjunction with the Aging Characteristic information of described candidate's hot news, from described candidate's hot news, hot news is determined.
At this, the Aging Characteristic information of described candidate's hot news refer to this candidate's hot news issuing time and/or from its be published to can from network crawled to the time etc. experienced.In a particular embodiment, the issuing time of candidate's hot news is relatively the closer to current time, and its probability belonging to hot news is larger; Candidate's hot news from its be published to can from network crawled to the time experienced shorter, its probability belonging to hot news is also larger.
Such as, for targeted news source as news website news-page1, in step sl, focus determination equipment 1 determines that the candidate's hot news in this targeted news source is new1-new10, suppose that the issuing time of new2-new5 in candidate hot news new1-new10 is relatively near current time, then in step s 2, focus determination equipment 1 determines that candidate hot news new2-new5 is hot news.
2) according to the access characteristic information of described candidate's hot news, in conjunction with the focus class information of described candidate's hot news, from described candidate's hot news, hot news is determined.
Such as, for targeted news source as news website news-page1, in step sl, focus determination equipment 1 determines that the candidate's hot news in this targeted news source is new1-new10, suppose that the focus grade of new3-new5 in candidate hot news new1-new10 is higher than other candidate's hot news, then in step s 2, focus determination equipment 1 can determine that candidate hot news new3-new5 is hot news.
Those skilled in the art will be understood that and above-mentionedly from candidate's hot news, determine that the mode of hot news is only citing; other existing or may occur from now on from candidate's hot news, determine that the mode of hot news is as being applicable to the present invention; also within scope should being included in, and this is contained at this with way of reference.
At this, it will be appreciated by those skilled in the art that in a particular embodiment, described access characteristic information, described Aging Characteristic information and the triplicity of described focus class information also can be got up to determine whether candidate's hot news is hot news by the present invention.
Preferably, the method also comprises step S3 (not shown).Particularly, in step s3, the issue operational ton information that focus determination equipment 1 is published in related news source according to described candidate's hot news, determines described focus class information.
At this, described related news source refers to other news sources being different from described targeted news source.At this, described issue operational ton information refers to the information such as total degree, issue/renewal frequency that described candidate's hot news is published in related news source.In a particular embodiment, described focus class information can have certain corresponding relation with described issue operational ton information, as the focus grade news that is I level has the issue operational ton information of certain scope.Those skilled in the art will be understood that above-mentioned issue operational ton information is only citing; other issue operational ton information that are existing or that may occur from now on are as being applicable to the present invention; also within scope should being included in, and this is contained at this with way of reference.
Such as, for candidate hot news new2 and new3, suppose that the total degree that candidate hot news new2 is published in related news source is 100 times, and the total degree that candidate hot news new3 is published in related news source is 30 times, focus grade is the total degree be published in related news source corresponding to the news of I level is [50, + ∞), and focus grade is the news of II level, and the corresponding total degree be published in related news source is [20, 50), then in step s3, focus determination equipment 1 can determine that the focus class information of candidate hot news new2 and new3 is respectively I level and II level.
Those skilled in the art will be understood that and above-mentionedly determine that the mode of described focus class information is only citing; described in other determinations that are existing or that may occur from now on, the mode of focus class information is as being applicable to the present invention; also within scope should being included in, and this is contained at this with way of reference.
At this, present invention achieves the automatic excavating of hot news, improve the discrimination of hot news, and reduce identification cost.
Constant work between each step of the method.Particularly, in step sl, focus determination equipment 1 continues the candidate's hot news determined in targeted news source, and wherein, described candidate's hot news is arranged in the hot news block in described targeted news source; In step s 2, focus determination equipment 1 continues the access characteristic information according to described candidate's hot news, from described candidate's hot news, determine hot news.At this, it will be understood by those skilled in the art that, the determination of candidate's hot news, the determination of hot news is constantly carried out respectively, until focus determination equipment 1 stops determining described candidate's hot news in a long time between each step that described " continuing " refers to the method.
Preferably, focus determination equipment 1 also comprises step S4 (not shown).Particularly, in step s 4 which, focus determination equipment 1, according to the hot news determined from multiple news sources, is set up or upgrades hot news storehouse, as being order by the focus class information of hot news, the hot information determined is arranged from multiple news sources.
At this, described hot news storehouse can be used for when user accesses news website or open news app client, news focus higher grade in hot news storehouse is initiatively supplied to user, also can be used for when user inquires about hot news, matching inquiry is carried out, to improve the accuracy of the efficiency providing hot news to user and the hot news provided from this storehouse.
Fig. 4 illustrates a kind of method flow diagram for determining the hot news in targeted news source in accordance with a preferred embodiment of the present invention.
Wherein, the method comprising the steps of S1 ' and step S2 ', wherein, step S1 ' comprises step S11 ' and step S12 '.Particularly, in step S11 ', focus determination equipment 1 determines the hot news block in targeted news source; In step S12 ', focus determination equipment 1 determines the candidate's hot news in described hot news block, using as the candidate's hot news in described targeted news source; In step S2 ', focus determination equipment 1, according to the access characteristic information of described candidate's hot news, determines hot news from described candidate's hot news.At this, it will be appreciated by those skilled in the art that step S2 ' is identical or substantially identical with the content of corresponding step in Fig. 3 embodiment, for simplicity's sake, therefore does not repeat them here.
Particularly, in step S11 ', focus determination equipment 1 determines the hot news block in targeted news source, as using the physical block in targeted news source as described in hot news block, or, also whether can meet predetermined focus block judgment rule according to the news block in targeted news source, determine the hot news block in this targeted news source; Wherein, described predetermined focus block judgment rule comprises following at least any one:
News block described in-Ruo comprises predetermined focus block identification information, then this news block belongs to hot news block;
News block described in-Ruo belongs to the focus block of specifying, then this news block belongs to hot news block.
Such as, for targeted news source as news website news-page1, suppose in step S11 ', focus determination equipment 1 carries out page analysis to this news website, such as find that the news block news-module-1 in this website comprises predetermined focus block identification information according to the css of the page or dom tree node, then in step S11 ', focus determination equipment 1 determines that news block news-module-1 is the hot news block in news website news-page1.At this, whether described predetermined focus block identification information belongs to hot news block for identifying news block, its can be hot character mark,! Number mark etc.At this; those skilled in the art will be understood that above-mentioned focus block identification information is only citing; other focus block identification information that are existing or that may occur from now on, as being applicable to the present invention, within also should being included in scope, and are contained in this at this with way of reference.
For another example, for targeted news source as news website news-page1, suppose that the news block news-module-2 in this website belongs to the focus block of specifying, the focus block of human configuration in this way, then in step S11 ', focus determination equipment 1 determines that news block news-module-2 is the hot news block in news website news-page1.
Those skilled in the art will be understood that the mode of the above-mentioned hot news block determined in targeted news source is only citing; other are existing or may occur that the mode of the hot news block really set the goal in news sources is as being applicable to the present invention from now on; also within scope should being included in, and this is contained at this with way of reference.
In step S12 ', focus determination equipment 1 determines the candidate's hot news in described hot news block, all news as will be described in hot news block all as described candidate's hot news, using as the candidate's hot news in described targeted news source; Or, also according to the focus characteristic information of news in described hot news block, described candidate's hot news can be determined.Preferably, described focus characteristic information comprises following at least any one:
The title style information of news in-described hot news block;
The focus identification information of news in-described hot news block.
At this, described title style information comprises the whether overstriking of the font size of title, font, the title whether information such as highlighted display.
At this, whether described focus identification information belongs to hot news for identifying news, its can be hot character mark,! Number any mark such as mark, red blockage.
Such as, for targeted news source as news website news-page1, its hot news block is news block news-module-1, suppose in step S12 ', focus determination equipment 1 finds that the title of news new1 ' in this hot news block and new2 ' is highlighted display, or, the font of title is overstriking, or, there is focus identification information as hot mark etc., then in step S12 ', focus determination equipment 1 can determine that news new1 ' and new2 ' is for the candidate's hot news in news block news-module-1, thus obtain the candidate hot news of targeted news source as news website news-page1.
Those skilled in the art will be understood that and above-mentionedly determine that the mode of the candidate's hot news in described hot news block is only citing; the mode of the candidate's hot news in hot news block described in other determinations that are existing or that may occur from now on is as being applicable to the present invention; also within scope should being included in, and this is contained at this with way of reference.
It should be noted that the present invention can be implemented in the assembly of software and/or software restraint, such as, special IC (ASIC), general object computing machine or any other similar hardware device can be adopted to realize.In one embodiment, software program of the present invention can perform to realize step mentioned above or function by processor.Similarly, software program of the present invention (comprising relevant data structure) can be stored in computer readable recording medium storing program for performing, such as, and RAM storer, magnetic or CD-ROM driver or flexible plastic disc and similar devices.In addition, steps more of the present invention or function can adopt hardware to realize, such as, as coordinating with processor thus performing the circuit of each step or function.
In addition, a part of the present invention can be applied to computer program, such as computer program instructions, when it is performed by computing machine, by the operation of this computing machine, can call or provide according to method of the present invention and/or technical scheme.And call the programmed instruction of method of the present invention, may be stored in fixing or moveable recording medium, and/or be transmitted by the data stream in broadcast or other signal bearing medias, and/or be stored in the working storage of the computer equipment run according to described programmed instruction.At this, comprise a device according to one embodiment of present invention, this device comprises the storer for storing computer program instructions and the processor for execution of program instructions, wherein, when this computer program instructions is performed by this processor, trigger this plant running based on the aforementioned method according to multiple embodiment of the present invention and/or technical scheme.
To those skilled in the art, obviously the invention is not restricted to the details of above-mentioned one exemplary embodiment, and when not deviating from spirit of the present invention or essential characteristic, the present invention can be realized in other specific forms.Therefore, no matter from which point, all should embodiment be regarded as exemplary, and be nonrestrictive, scope of the present invention is limited by claims instead of above-mentioned explanation, and all changes be therefore intended in the implication of the equivalency by dropping on claim and scope are included in the present invention.Any Reference numeral in claim should be considered as the claim involved by limiting.In addition, obviously " comprising " one word do not get rid of other unit or step, odd number does not get rid of plural number.Multiple unit of stating in device claim or device also can be realized by software or hardware by a unit or device.First, second word such as grade is used for representing title, and does not represent any specific order.

Claims (18)

1., for determining a method for the hot news in targeted news source, wherein, the method comprises:
Determine the candidate's hot news in targeted news source, wherein, described candidate's hot news is arranged in the hot news block in described targeted news source;
According to the access characteristic information of described candidate's hot news, from described candidate's hot news, determine hot news.
2. method according to claim 1, wherein, determine that the candidate's hot news in targeted news source comprises:
-hot news the block determining in targeted news source;
-candidate's the hot news determining in described hot news block, using as the candidate's hot news in described targeted news source.
3. method according to claim 2, wherein, determine that the hot news block in targeted news source comprises:
-whether meet predetermined focus block judgment rule according to the news block in targeted news source, determine the hot news block in this targeted news source;
Wherein, described predetermined focus block judgment rule comprises following at least any one:
News block described in-Ruo comprises predetermined focus block identification information, then this news block belongs to hot news block;
News block described in-Ruo belongs to the focus block of specifying, then this news block belongs to hot news block.
4. according to the method in claim 2 or 3, wherein, determine that the candidate's hot news in described hot news block comprises:
-according to the focus characteristic information of news in described hot news block, determine described candidate's hot news.
5. method according to claim 4, wherein, described focus characteristic information comprises following at least any one:
The title style information of news in-described hot news block;
The focus identification information of news in-described hot news block.
6. method according to any one of claim 1 to 5, wherein, from described candidate's hot news, determine that hot news comprises:
-according to the access characteristic information of described candidate's hot news, in conjunction with the Aging Characteristic information of described candidate's hot news, from described candidate's hot news, determine hot news.
7. method according to any one of claim 1 to 6, wherein, from described candidate's hot news, determine that hot news comprises:
-according to the access characteristic information of described candidate's hot news, in conjunction with the focus class information of described candidate's hot news, from described candidate's hot news, determine hot news.
8. method according to claim 7, wherein, the method also comprises:
-issue operational ton the information that is published in related news source according to described candidate's hot news, determines described focus class information.
9. method according to any one of claim 1 to 8, wherein, the method also comprises:
According to the hot news determined from multiple news sources, set up or upgrade hot news storehouse.
10. for determining a focus determination equipment for the hot news in targeted news source, wherein, this focus determination equipment comprises:
For determining the device of the candidate's hot news in targeted news source, wherein, described candidate's hot news is arranged in the hot news block in described targeted news source;
For the access characteristic information according to described candidate's hot news, from described candidate's hot news, determine the device of hot news.
11. focus determination equipment according to claim 10, wherein, determine that the device of the candidate's hot news in targeted news source comprises:
-for determining the unit of the hot news block in targeted news source;
-for determining the candidate's hot news in described hot news block, using the unit as the candidate's hot news in described targeted news source.
12. focus determination equipment according to claim 11, wherein, determine the unit of the hot news block in targeted news source for:
-whether meet predetermined focus block judgment rule according to the news block in targeted news source, determine the hot news block in this targeted news source;
Wherein, described predetermined focus block judgment rule comprises following at least any one:
News block described in-Ruo comprises predetermined focus block identification information, then this news block belongs to hot news block;
News block described in-Ruo belongs to the focus block of specifying, then this news block belongs to hot news block.
13. focus determination equipment according to claim 11 or 12, wherein, determine the unit of the candidate's hot news in described hot news block for:
-according to the focus characteristic information of news in described hot news block, determine described candidate's hot news.
14. focus determination equipment according to claim 13, wherein, described focus characteristic information comprises following at least any one:
The title style information of news in-described hot news block;
The focus identification information of news in-described hot news block.
15. according to claim 10 to the focus determination equipment according to any one of 14, wherein, determine from described candidate's hot news the device of hot news for:
-according to the access characteristic information of described candidate's hot news, in conjunction with the Aging Characteristic information of described candidate's hot news, from described candidate's hot news, determine hot news.
16. according to claim 10 to the focus determination equipment according to any one of 15, wherein, determine from described candidate's hot news the device of hot news for:
-according to the access characteristic information of described candidate's hot news, in conjunction with the focus class information of described candidate's hot news, from described candidate's hot news, determine hot news.
17. focus determination equipment according to claim 16, wherein, this focus determination equipment also comprises:
-issue operational ton information for being published in related news source according to described candidate's hot news, determines the device of described focus class information.
18. according to claim 10 to the focus determination equipment according to any one of 17, and wherein, this focus determination equipment also comprises:
For according to the hot news determined from multiple news sources, set up or upgrade the device in hot news storehouse.
CN201510456929.XA 2015-07-29 2015-07-29 Method and device for determining hot news in target news source Pending CN105045890A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510456929.XA CN105045890A (en) 2015-07-29 2015-07-29 Method and device for determining hot news in target news source

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510456929.XA CN105045890A (en) 2015-07-29 2015-07-29 Method and device for determining hot news in target news source

Publications (1)

Publication Number Publication Date
CN105045890A true CN105045890A (en) 2015-11-11

Family

ID=54452437

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510456929.XA Pending CN105045890A (en) 2015-07-29 2015-07-29 Method and device for determining hot news in target news source

Country Status (1)

Country Link
CN (1) CN105045890A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106021425A (en) * 2016-05-13 2016-10-12 北京奇虎科技有限公司 Hot news mining method and device
CN107784010A (en) * 2016-08-29 2018-03-09 上海掌门科技有限公司 A kind of method and apparatus for being used to determine the temperature information of theme of news
CN108897774A (en) * 2018-05-31 2018-11-27 腾讯科技(深圳)有限公司 A kind of method, equipment and storage medium obtaining hot news
US11308164B2 (en) 2018-09-17 2022-04-19 Yandex Europe Ag Method and system for generating push notifications related to digital news

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102004792A (en) * 2010-12-07 2011-04-06 百度在线网络技术(北京)有限公司 Method and system for generating hot-searching word
CN102436601A (en) * 2011-11-09 2012-05-02 江苏联著实业有限公司 Mobile Internet news value evaluation system
CN103020090A (en) * 2011-09-27 2013-04-03 腾讯科技(深圳)有限公司 Method and device for providing link recommendation
CN103164427A (en) * 2011-12-13 2013-06-19 中国移动通信集团公司 Method and device of news aggregation
CN103324637A (en) * 2012-03-23 2013-09-25 腾讯科技(深圳)有限公司 Method and system for mining hotspot message
CN103577501A (en) * 2012-08-10 2014-02-12 深圳市世纪光速信息技术有限公司 Hot topic searching system and hot topic searching method
CN104657496A (en) * 2015-03-09 2015-05-27 杭州朗和科技有限公司 Method and equipment for calculating information hot value

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102004792A (en) * 2010-12-07 2011-04-06 百度在线网络技术(北京)有限公司 Method and system for generating hot-searching word
CN103020090A (en) * 2011-09-27 2013-04-03 腾讯科技(深圳)有限公司 Method and device for providing link recommendation
CN102436601A (en) * 2011-11-09 2012-05-02 江苏联著实业有限公司 Mobile Internet news value evaluation system
CN103164427A (en) * 2011-12-13 2013-06-19 中国移动通信集团公司 Method and device of news aggregation
CN103324637A (en) * 2012-03-23 2013-09-25 腾讯科技(深圳)有限公司 Method and system for mining hotspot message
CN103577501A (en) * 2012-08-10 2014-02-12 深圳市世纪光速信息技术有限公司 Hot topic searching system and hot topic searching method
CN104657496A (en) * 2015-03-09 2015-05-27 杭州朗和科技有限公司 Method and equipment for calculating information hot value

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106021425A (en) * 2016-05-13 2016-10-12 北京奇虎科技有限公司 Hot news mining method and device
CN107784010A (en) * 2016-08-29 2018-03-09 上海掌门科技有限公司 A kind of method and apparatus for being used to determine the temperature information of theme of news
CN108897774A (en) * 2018-05-31 2018-11-27 腾讯科技(深圳)有限公司 A kind of method, equipment and storage medium obtaining hot news
US11308164B2 (en) 2018-09-17 2022-04-19 Yandex Europe Ag Method and system for generating push notifications related to digital news

Similar Documents

Publication Publication Date Title
EP2940557B1 (en) Method and device used for providing input candidate item corresponding to input character string
CN102035883B (en) Method and device for optimizing webpage in network equipment
CN112597182B (en) Optimization method, device, terminal and storage medium of data query statement
CN103699619A (en) Method and device for providing search results
CN106991175B (en) Customer information mining method, device, equipment and storage medium
CN103838754A (en) Information searching device and method
CN101772766A (en) Method and system for user centered information searching
CN105045890A (en) Method and device for determining hot news in target news source
CN105243058A (en) Webpage content translation method and electronic apparatus
CN107908616B (en) Method and device for predicting trend words
CN104090904A (en) Method and equipment for providing target search result
CN104361092A (en) Searching method and device
CN105302461A (en) Method and equipment for providing target page in mobile application
CN103136213A (en) Method and device for providing related words
CN113190741A (en) Searching method, searching device, electronic equipment and storage medium
CN102541282A (en) Method, device and system for reediting completed words and phrases through icon moving
CN102999576A (en) Method and equipment for confirming page description information corresponding to target pages
CN104809207A (en) Search method and device
CN107294905B (en) Method and device for identifying user
CN103631796A (en) Website sort management method and electronic device
CN113407818A (en) Automatic information retrieval
CN105224654A (en) A kind of Web browsing mode changing method and electronic equipment
CN102982135A (en) Method and device used for providing presented information
CN112541645A (en) Data processing method and system along with vehicle product project development and related device
CN105243106A (en) Method and apparatus used for generating inquiry results

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20151111