CN103020126B - The access control method of Web content and device - Google Patents

The access control method of Web content and device Download PDF

Info

Publication number
CN103020126B
CN103020126B CN201210468106.5A CN201210468106A CN103020126B CN 103020126 B CN103020126 B CN 103020126B CN 201210468106 A CN201210468106 A CN 201210468106A CN 103020126 B CN103020126 B CN 103020126B
Authority
CN
China
Prior art keywords
web content
access
end side
network
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201210468106.5A
Other languages
Chinese (zh)
Other versions
CN103020126A (en
Inventor
刘鎏
秦吉胜
周浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201210468106.5A priority Critical patent/CN103020126B/en
Publication of CN103020126A publication Critical patent/CN103020126A/en
Application granted granted Critical
Publication of CN103020126B publication Critical patent/CN103020126B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a kind of access control method and device of Web content.The access control method of a kind of Web content that the embodiment of the present invention provides, comprise: the network access information collecting end side access side in predetermined amount of time, this network access information comprises the web page title information of the unique identification of end side access side, the webpage of end side access side access; Obtain the identification information of various Web content; To appear at end side access side network access information web page title information in the identification information of Web content add up, and the occurrence number of identification information is greater than the Web content controlling threshold value and is chosen for the Web content that this end side access side pays close attention to; The Web content chosen is sent to service page, the Web content chosen is shown to corresponding end side access side in service page.

Description

The access control method of Web content and device
Technical field
The present invention relates to Internet technical field, particularly a kind of access control method of Web content and device.
Background technology
Along with the continuous expansion of network size, the data volume carried in webpage is also day by day various, how to make user in webpage, find the information self paid close attention to be that current each developer is devoted to one of problem solved fast.For this problem, the solution of a kind of recommendation mechanisms of usual employing, such as, by website optimize side some popular collection of TV plays be arranged on the recommended location in webpage or look steadily destination locations, when the user accessed the web page, this collection of TV plays is recommended user, thus is convenient to user and finds this collection of TV plays fast.
But website operator adopts the identical way of recommendation to all users in existing scheme, as adopted the identical recommendation page to all users, recommend identical video display collection of drama.But, the quantity of the video display collection of drama recommended in the page and coverage rate are all very limited, and the demand of different user is different, concerning a certain user, the video display collection of drama that existing scheme is recommended is likely what this user did not pay close attention to, existing scheme can not carry out the recommendation of video display collection of drama for each user, user's quick obtaining cannot be made to the information paid close attention to, and Consumer's Experience is poor.
Summary of the invention
In view of the above problems, the present invention is proposed to provide a kind of overcoming the problems referred to above or the access control method of Web content solved the problem at least in part and device.
According to one aspect of the present invention, embodiments provide a kind of access control method of Web content, comprising:
Collect the network access information of end side access side in predetermined amount of time, this network access information comprises the web page title information of the unique identification of end side access side, the webpage of end side access side access;
Obtain the identification information of various Web content;
To appear at end side access side network access information web page title information in the identification information of Web content add up, and the occurrence number of identification information is greater than the Web content controlling threshold value and is chosen for the Web content that this end side access side pays close attention to;
The Web content chosen is sent to service page, the Web content chosen is shown to corresponding end side access side in service page.
Wherein, above-mentioned network access information also comprises the web page address information of the webpage of end side access side access, to appear at end side access side network access information web page title information in the identification information of Web content add up before, said method also comprises:
According to the web page address information in network access information, judge webpage that end side access side accesses whether in focus web page listings, if, the identification information of the Web content appeared in the web page title information of this network access information is added up, if not, the identification information of the Web content appeared in the web page title information of this network access information is not added up.
Wherein, above-mentioned to appear at end side access side network access information info web in the identification information of Web content carry out statistics and comprise:
For in predetermined amount of time, the network access information of different time arranges different weighted values, wherein, when the very first time early than the second time time, the weighted value arranged for the network access information that generates in the very first time is less than the weighted value that the network access information for generating in the second time is arranged;
During the identification information of the Web content occurred in the info web of a statistics network access information, by the result of product of weighted value corresponding with this network access information for the occurrence number of the identification information of Web content, as the number of times that the identification information adding up the Web content obtained occurs in the info web of this network access information.
Wherein, before the Web content chosen is sent to service page, said method also comprises:
When the quantity of the end side access side accessing first network content and second network content is greater than amount threshold simultaneously, confirm that first network content and second network content are similar network content;
When being only the Web content of end side access side concern, be the Web content that this end side access side may pay close attention to by second network contents selection by first network contents selection; When being only the Web content of end side access side concern, be the Web content that this end side access side may pay close attention to by first network contents selection by second network contents selection;
Above-mentioned the Web content chosen is sent to service page, comprises the Web content chosen to be shown to corresponding end side access side in service page:
The Web content that the Web content pay close attention to the end side access side chosen and end side access side may pay close attention to is sent to service page, the Web content that end side access side pays close attention to is shown to this end side access side with the Web content that may pay close attention in service page.
Wherein, the above-mentioned occurrence number by identification information is greater than and controls the Web content of threshold value and be chosen for the Web content that this end side access side pays close attention to and comprise:
The occurrence number of identification information be greater than and control threshold value and the Web content meeting filtering rule is chosen for the Web content that this end side access side pays close attention to, wherein, filtering rule comprises following at least one rule:
The quantity comprising word in the identification information of Web content is more than two;
The assessment grade of Web content is greater than level threshold.
Wherein, above-mentioned the Web content chosen is sent to service page, comprises the Web content chosen to be shown to corresponding end side access side in service page:
According to the occurrence number order from big to small of the identification information of the Web content chosen, obtain the displaying order of the Web content chosen, the Web content chosen and displaying order are sent to service page, the web page contents chosen is shown to corresponding end side access side according to displaying order in service page;
Above-mentioned the Web content chosen is shown to corresponding end side access side in service page after, said method also comprises:
Obtain the access times of end side access side by the Web content of service page access display;
According to the access times order from big to small of the Web content chosen, upgrade the displaying order of the Web content chosen, the Web content chosen and the displaying order after upgrading are sent to service page, the web page contents chosen is shown to corresponding end side access side according to the displaying order after renewal in service page.
Wherein, above-mentioned Web content is the movie and television contents in network.
According to one aspect of the present invention, embodiments provide a kind of access control apparatus of Web content, comprising:
End side access side information collection unit, be suitable for connecting every the schedule time and network, from network, collect the network access information of end side access side in predetermined amount of time, this network access information comprises the web page title information of the unique identification of end side access side, the webpage of end side access side access;
Network-content acquisition unit, is suitable for the identification information obtaining various Web content;
Web content chooses unit, be suitable for adding up the identification information of the Web content in the web page title information of the network access information appearing at end side access side, and the occurrence number of identification information be greater than the Web content that the Web content controlling threshold value is chosen for this end side access side concern;
Web content transmitting element, is suitable for the service page be sent to by the Web content chosen in network, the Web content chosen is shown to corresponding end side access side in service page.
Above-mentioned network access information also comprises the web page address information of the webpage of end side access side access,
Said apparatus also comprises focus webpage judging unit, be suitable for appear at end side access side network access information web page title information in the identification information of Web content add up before, according to the web page address information in network access information, judge webpage that end side access side accesses whether in focus web page listings, if, allow Web content to choose the identification information of unit to the Web content appeared in the web page title information of this network access information to add up, if not, forbid that Web content is chosen the identification information of unit to the Web content appeared in the web page title information of this network access information and added up.
Wherein, above-mentioned Web content chooses unit, be suitable for the network access information of different time in predetermined amount of time and different weighted values is set, wherein, when the very first time early than the second time time, the weighted value arranged for the network access information that generates in the very first time is less than the weighted value that the network access information for generating in the second time is arranged; During the identification information of the Web content occurred in the info web of a statistics network access information, by the result of product of weighted value corresponding with this network access information for the occurrence number of the identification information of Web content, as the number of times that the identification information adding up the Web content obtained occurs in the info web of this network access information.
Wherein, Web content chooses unit, be suitable for before the Web content chosen being sent to the service page in network, when the end side access side quantity of to access first network content and second network content is greater than amount threshold simultaneously, confirm that first network content and second network content are similar network content; When being only the Web content of end side access side concern, be the Web content that this end side access side may pay close attention to by second network contents selection by first network contents selection; When being only the Web content of end side access side concern, be the Web content that this end side access side may pay close attention to by first network contents selection by second network contents selection;
Web content transmitting element, be suitable for the service page be sent to by the Web content that the Web content of the end side access side chosen concern and end side access side may be paid close attention in network, the Web content that end side access side pays close attention to is shown to this end side access side with the Web content that may pay close attention in service page.
Wherein, Web content chooses unit, is suitable for the occurrence number of identification information to be greater than to control threshold value and the Web content meeting filtering rule is chosen for the Web content that this end side access side pays close attention to, and wherein, filtering rule comprises following at least one rule:
The quantity comprising word in the identification information of Web content is more than two;
The assessment grade of Web content is greater than level threshold.
Wherein, Web content transmitting element, be suitable for the occurrence number order from big to small according to the identification information of the Web content chosen, obtain the displaying order of the Web content chosen, the Web content chosen and displaying order are sent to the service page in network, the web page contents chosen is shown to corresponding end side access side according to displaying order in service page; And
Web content transmitting element, is also suitable for after the Web content chosen is shown to corresponding end side access side in service page, obtains the access times of end side access side by the Web content of service page access display; According to the access times order from big to small of the Web content chosen, upgrade the displaying order of the Web content chosen, the Web content chosen and the displaying order after upgrading are sent to service page, the web page contents chosen is shown to corresponding end side access side according to the displaying order after renewal in service page.
Wherein, the Web content in said apparatus is the movie and television contents in network.
The embodiment of the present invention is by the network access information of collection terminal side access side, know the network access behavior of end side access side, then the network access information of end side access side and various Web content are mated, when the number of times of the identification information appearing at the Web content in network access information is greater than control threshold value, confirm that this Web content is the network that this end side access side pays close attention to, this Web content be sent to service page and show, thus different Web contents can be provided for different end side access sides.Embodiments provide a kind of access control mechanisms of personalization, can the network access behavior of guiding terminal side access side, ensure the information of end side access side quick obtaining to concern, enhance Consumer's Experience.
Above-mentioned explanation is only the general introduction of technical solution of the present invention, in order to technological means of the present invention can be better understood, and can be implemented according to the content of instructions, and can become apparent, below especially exemplified by the specific embodiment of the present invention to allow above and other objects of the present invention, feature and advantage.
Accompanying drawing explanation
By reading hereafter detailed description of the preferred embodiment, various other advantage and benefit will become cheer and bright for those of ordinary skill in the art.Accompanying drawing only for illustrating the object of preferred implementation, and does not think limitation of the present invention.And in whole accompanying drawing, represent identical parts by identical reference symbol.In the accompanying drawings:
Fig. 1 shows a kind of according to an embodiment of the invention access control method process flow diagram of Web content;
Fig. 2 shows the access control apparatus structural representation of Web content according to an embodiment of the invention;
Fig. 3 shows the structural representation of communication system according to an embodiment of the invention;
Fig. 4 shows the structural representation according to another communication system of one embodiment of the invention.
Fig. 5 shows distributed according to an embodiment of the invention key-value pair query engine system construction drawing;
Fig. 6 shows the schematic diagram utilizing KEY-VALUE mechanism data query from memory node according to an embodiment of the invention.
Embodiment
Below with reference to accompanying drawings exemplary embodiment of the present disclosure is described in more detail.Although show exemplary embodiment of the present disclosure in accompanying drawing, however should be appreciated that can realize the disclosure in a variety of manners and not should limit by the embodiment set forth here.On the contrary, provide these embodiments to be in order to more thoroughly the disclosure can be understood, and complete for the scope of the present disclosure can be conveyed to those skilled in the art.
One embodiment of the invention provides a kind of Web content recommendation mechanisms of personalization, according to the history access record of end side access side, dopes the possible preference of end side access side, thus recommends individualized content to end side access side.One embodiment of the invention provides a kind of access control method of Web content, see Fig. 1, comprising:
S100: the network access information collecting end side access side in predetermined amount of time, this network access information comprises the web page title information of the unique identification of end side access side, the webpage of end side access side access;
S102: the identification information obtaining various Web content;
S104: to appear at end side access side network access information web page title information in the identification information of Web content add up, and the occurrence number of identification information is greater than the Web content controlling threshold value and is chosen for the Web content that this end side access side pays close attention to;
S106: the Web content chosen is sent to service page, to be shown to corresponding end side access side by the Web content chosen in service page.
Above-mentioned steps S100 to S106 can be performed by the server of end side, and this server can connect with network in timing, performs step S100 and S102, the network access information of collection terminal side access side, and obtains the identification information of Web content.After the collection of network access information, this server can disconnect the connection with network, under off-line state, performs step S104 and processes network access information, select the Web content that end side access side pays close attention to.Then, this server connects with network again, performs step S106 and the Web content chosen is sent to service page.
Web content in the present embodiment is including, but not limited to the movie and television contents in network, and the identification information of Web content can be the title of movie and television contents.
By upper, the embodiment of the present invention is by the network access information of collection terminal side access side, know the network access behavior of end side access side, then the network access information of end side access side and various Web content are mated, when the number of times of the identification information appearing at the Web content in network access information is greater than control threshold value, confirm that this Web content is the network that this end side access side pays close attention to, this Web content be sent to service page and show, thus different Web contents can be provided for different end side access sides.Embodiments provide a kind of access control mechanisms of personalization, can the network access behavior of guiding terminal side access side, ensure the information of end side access side quick obtaining to concern, enhance Consumer's Experience.
On basis embodiment illustrated in fig. 1, network access information also comprises the web page address information of the webpage of end side access side access, this web page address information can be the URL(URL(uniform resource locator) of webpage, Uniform/UniversalResourceLocator), then before execution step S104, the present embodiment also comprises: according to the web page address information in network access information, judge webpage that end side access side accesses whether in focus web page listings, if, the identification information of the Web content appeared in the web page title information of this network access information is added up, if not, the identification information of the Web content appeared in the web page title information of this network access information is not added up.The address of above-mentioned focus web page listings record focus webpage, this focus webpage be rate of people logging in higher, be subject to great amount of terminals side access side pay close attention to webpage.When the webpage of end side access side access belongs to focus webpage, the access behavior of this end side access side is added up.
Further, consider that Web content that end side access side pays close attention to can change along with the change of time, in the step S104 of the present embodiment to appear at end side access side network access information web page title information in the identification information of Web content carry out statistics and comprise: in predetermined amount of time, the network access information of different time arranges different weighted values, wherein, when the very first time early than the second time time, the weighted value arranged for the network access information that generates in the very first time is less than the weighted value that the network access information for generating in the second time is arranged; During the identification information of the Web content occurred in the info web of a statistics network access information, by the result of product of weighted value corresponding with this network access information for the occurrence number of the identification information of Web content, as the number of times that the identification information adding up the Web content obtained occurs in the info web of this network access information.This processing mode, consider focus at a specified future date and the current concerns of user, attenuation model is introduced by arranging weighted value for network access information, the time that this attenuation model lower network visit information generates more early, the weighted value arranged is less, thus weakening the focus at a specified future date of end side access side, the current concerns of strengthening end side access side, selects the Web content that end side access side pays close attention to more exactly.
Further, before execution step S104, the present embodiment also comprises: when the quantity of the end side access side accessing first network content and second network content is greater than amount threshold simultaneously, confirms that first network content and second network content are similar network content; When being only the Web content of end side access side concern, be the Web content that this end side access side may pay close attention to by second network contents selection by first network contents selection; When being only the Web content of end side access side concern, be the Web content that this end side access side may pay close attention to by first network contents selection by second network contents selection.This processing mode, introduce the processing mode of project-based collaborative filtering, such as, if when above-mentioned amount threshold gets 2, if end side access side A have accessed Web content A and Web content C, end side access side B have accessed Web content A, Web content B and Web content C, end side access side C have accessed Web content C, then think that Web content A and Web content C is similar network content, based on this, Web content Web content C may be able to paid close attention to as end side access side C.Namely the Web content can paid close attention to reference to most of end side access side when choosing the Web content that an end side access side pays close attention to, pick out the similarity between Web content, supplementing of the Web content that other higher for the Web content similarity of accessing with end side access side Web content is paid close attention to as this end side access side, thus improve the coverage rate of the Web content chosen.
Further, in order to avoid by the contents selection of some unexpected winners being the Web content that end side access side pays close attention to, also comprise in step S104: the occurrence number of identification information is greater than and controls threshold value and the Web content meeting filtering rule is chosen for the Web content that this end side access side pays close attention to, wherein, filtering rule comprises following at least one rule:
The quantity comprising word in the identification information of rule one, Web content is more than two;
The assessment grade of rule two, Web content is greater than level threshold.
When the identification information of Web content only comprises a word, as " love ", single word be matched to that power can be significantly higher than multiple word be matched to power, then probably all results matched are all the identification informations of the Web content only comprising individual character, this significantly can reduce the accuracy of the Web content selected, the problem that the matching error rate that the identification information that can effectively solve the Web content only comprising a word by above-mentioned regular brings is higher, improves the accuracy of coupling.
In addition, under some scenes, although the occurrence number of the identification information of Web content is greater than control threshold value, but Web content popularity corresponding to this identification information is not high, assessment grade is on the low side, at this moment there is " overmatching " problem, when this Web content being chosen for the Web content of end side access side concern, less to the navigational significance of the access behavior of end side access side, then effectively can solve " overmatching " problem by above-mentioned rule.Such as, the numerical value of assessment grade is arranged in the scope of 0 to 1, and level threshold is set to 0.4, then the occurrence number of identification information is greater than to the Web content controlling threshold value, when the numerical value of the assessment grade of this Web content is greater than 0.4, this Web content is chosen for the Web content that end side access side pays close attention to, otherwise, this Web content is not chosen for the Web content that end side access side pays close attention to.
Further, according to the occurrence number of the identification information of the Web content chosen order from big to small in step S106, obtain the displaying order of the Web content chosen, the Web content chosen and displaying order are sent to service page, the web page contents chosen is shown to corresponding end side access side according to displaying order in service page.Namely control the displaying order of Web content, what the Web content that end side access side pays close attention to most is placed in service page recommended location looks steadily destination locations (as top) most, is convenient to the Web content that end side access side finds concern fast.
Further, after step s 106, the present embodiment also comprises the access times obtaining the Web content that end side access side is shown by service page access;
According to the access times order from big to small of the Web content chosen, upgrade the displaying order of the Web content chosen, the Web content chosen and the displaying order after upgrading are sent to service page, the web page contents chosen is shown to corresponding end side access side according to the displaying order after renewal in service page.This processing mode, after showing Web content, according to the access of end side access side to this Web content, the displaying order of adjustment Web content, facilitates the Web content that end side access side finds concern fast further.
Another embodiment of the present invention provides a kind of access control apparatus of Web content, see Fig. 2, comprising:
End side access side information collection unit 200, be suitable for connecting every the schedule time and network, from network, collect the network access information of end side access side in predetermined amount of time, this network access information comprises the web page title information of the unique identification of end side access side, the webpage of end side access side access;
Network-content acquisition unit 202, is suitable for the identification information obtaining various Web content;
Web content chooses unit 204, be suitable for adding up the identification information of the Web content in the web page title information of the network access information appearing at end side access side, and the occurrence number of identification information be greater than the Web content that the Web content controlling threshold value is chosen for this end side access side concern;
Web content transmitting element 206, is suitable for the service page be sent to by the Web content chosen in network, the Web content chosen is shown to corresponding end side access side in service page.
Above-mentioned each unit can realize on the server of network side.
Web content in the present embodiment is including, but not limited to the movie and television contents in network, and the identification information of Web content can be the title of movie and television contents.
On basis embodiment illustrated in fig. 2, further, above-mentioned network access information also comprises the web page address information of the webpage of end side access side access, said apparatus also comprises focus webpage judging unit, be suitable for appear at end side access side network access information web page title information in the identification information of Web content add up before, according to the web page address information in network access information, judge webpage that end side access side accesses whether in focus web page listings, if, allow Web content to choose the identification information of unit to the Web content appeared in the web page title information of this network access information to add up, if not, forbid that Web content is chosen the identification information of unit to the Web content appeared in the web page title information of this network access information and added up.
Wherein, above-mentioned Web content chooses unit, be suitable for the network access information of different time in predetermined amount of time and different weighted values is set, wherein, when the very first time early than the second time time, the weighted value arranged for the network access information that generates in the very first time is less than the weighted value that the network access information for generating in the second time is arranged; During the identification information of the Web content occurred in the info web of a statistics network access information, by the result of product of weighted value corresponding with this network access information for the occurrence number of the identification information of Web content, as the number of times that the identification information adding up the Web content obtained occurs in the info web of this network access information.
Wherein, Web content chooses unit, be suitable for before the Web content chosen being sent to the service page in network, when the end side access side quantity of to access first network content and second network content is greater than amount threshold simultaneously, confirm that first network content and second network content are similar network content; When being only the Web content of end side access side concern, be the Web content that this end side access side may pay close attention to by second network contents selection by first network contents selection; When being only the Web content of end side access side concern, be the Web content that this end side access side may pay close attention to by first network contents selection by second network contents selection; At this moment, Web content transmitting element, the Web content be suitable for the Web content of the end side access side chosen concern and end side access side may be paid close attention to is sent to service page, the Web content that end side access side pays close attention to is shown to this end side access side with the Web content that may pay close attention in service page.
Wherein, Web content chooses unit, is suitable for the occurrence number of identification information to be greater than to control threshold value and the Web content meeting filtering rule is chosen for the Web content that this end side access side pays close attention to, and wherein, filtering rule comprises following at least one rule:
The quantity comprising word in the identification information of Web content is more than two;
The assessment grade of Web content is greater than level threshold.
Wherein, Web content transmitting element, be suitable for the occurrence number order from big to small according to the identification information of the Web content chosen, obtain the displaying order of the Web content chosen, the Web content chosen and displaying order are sent to service page, the web page contents chosen is shown to corresponding end side access side according to displaying order in service page; And Web content transmitting element, is also suitable for after the Web content chosen is shown to corresponding end side access side in service page, obtain the access times of end side access side by the Web content of service page access display; According to the access times order from big to small of the Web content chosen, upgrade the displaying order of the Web content chosen, the Web content chosen and the displaying order after upgrading are sent to service page, the web page contents chosen is shown to corresponding end side access side according to the displaying order after renewal in service page.
In apparatus of the present invention embodiment, the specific works mode of each unit can see method and system embodiment of the present invention.
From the above mentioned, the embodiment of the present invention is by the network access information of collection terminal side access side, know the network access behavior of end side access side, then the network access information of end side access side and various Web content are mated, when the number of times of the identification information appearing at the Web content in network access information is greater than control threshold value, confirm that this Web content is the network that this end side access side pays close attention to, this Web content be sent to service page and show, thus different Web contents can be provided for different end side access sides.Embodiments provide a kind of access control mechanisms of personalization, can the network access behavior of guiding terminal side access side, ensure the information of end side access side quick obtaining to concern, enhance Consumer's Experience.
Further embodiment of this invention additionally provides a kind of communication system, see Fig. 3, this communication system comprises first server 300 and second server 302, second server 302 comprises the access control apparatus 304 of Web content, and the access control apparatus 304 of Web content comprises end side access side information collection unit, network-content acquisition unit, Web content choose unit and Web content transmitting element.
See Fig. 4, show the structural drawing of the another kind of communication system that the present embodiment provides.First server 300 is set up with network all the time and is connected, be in online state, first server 300 provides the interactive interface with end side access side, the request of access of receiving terminal side access side, and shows Web content etc. according to request of access to end side access side.First server 300 can adopt the mode of server cluster to realize.
Second server 302 can connect with network in timing, by the network access information of end side access side information collection unit collection terminal side access side, and is obtained the identification information of various Web content by network-content acquisition unit.After the collection of network access information, second server 302 can disconnect the connection with network, under off-line state, chooses unit process network access information by Web content, selects the Web content that end side access side pays close attention to.Then, second server 302 connects with network again, and the Web content chosen is sent to the service page in first server 300 by Web content transmitting element.
For the ease of the network access information of second server 302 collection terminal side access side, network can carry out record to the daily web page access behavior of end side access side, as utilized first server 300 by the unique identification MID(MachineID of the webpage URL of the daily access of net shield client collection terminal side access side, web page title (TITLE) information and end side access side).Exemplary, can by these information with the format record of MID:URL:TITLE in daily record.In addition, to Web content with after upgrading the assessment grade of Web content, the information that network can also upgrade is recorded in daily record, such as, first server 300 is utilized to upgrade a video display episode data VIDEO_DATA every day by reptile, record format in daily record is ID:NAME:CATE, wherein NAME represents the title (as movie and television play name) of movie and television contents, ID is to should the unique number of movie and television contents, CATE represents the classification (film, TV, animation, variety) that this movie and television contents belongs to, and the identification information of movie and television contents can adopt movie and television play name; And upgraded the score data VIDEO_SCORE of a movie and television contents every day by reptile, record format NAME:SCORE, wherein NAME represents movie and television play name, and SCORE represents assessment grade.Second server 302 can obtain the network access information of end side access side and the identification information of various Web content by collector journal.
Focus webpage judging unit is also comprised in the access control apparatus 304 of Web content, this focus webpage judging unit Web content choose unit to appear at end side access side network access information web page title information in the identification information of Web content add up before, the focus web page listings of collecting is utilized to screen network access information, such as the network access information of the form of MID:URL:TITLE, the URL of focus webpage is recorded in focus web page listings, when focus webpage judging unit confirms that the URL in network access information is present in focus web page listings, allow Web content to choose the identification information of unit to the Web content appeared in the web page title information of this network access information to add up, otherwise, forbid that Web content is chosen the identification information of unit to the Web content appeared in the web page title information of this network access information and added up.
Exemplary, when Web content is movie and television contents, Web content chooses unit using the identification information of the title (as movie and television play name) of movie and television contents as Web content, according to the mode of string matching, the movie and television play name appeared in TITLE is added up, obtains the movie and television contents that each end side access side pays close attention to.
Such as, form is an object lesson of the video display episode data VIDEO_DATA of ID:NAME:CATE is { 10001: commit suicide malicious teacher: TV 10002: new three states: TV 10004: leave: film };
Form is two object lessons of the network access information of the end side access side of MID:URL:TITLE:
MID000:http: //zhidao.baidu.com/question/454345118.html: how many days malicious teacher the 5th season of committing suicide upgrades a collection
MID000:http: //baike.baidu.com/view/1408185.htm: commit suicide malicious teacher
Then Web content chooses the statistics of unit to the web page access behavior of this end side access side, adopt MID: video display ID: during the form of occurrence number, can MID000:10001:2 be expressed as, the movie and television contents list of preferences for each end side access side can be obtained by this statistics.
But Web content chooses unit when choosing the Web content that end side access side pays close attention to, and is obtained, therefore likely can there is the problem of " overmatching ", by string matching such as, in following example:
Movie and television contents data
VIDEO_DATA:{10001: commit suicide malicious teacher: TV 10002: new three states: TV 10004: leave: film };
The network access information of end side access side
MID001:http: //zhidao.baidu.com/question/98133945.html: what meaning of leaving
MID002:http: //zhidao.baidu.com/question/289506183.html: we think to leave from Shanghai in Xiangshan, Ningbo
MID003:http: //zhidao.baidu.com/question/118891690.html: cannot opening program
Have " leaving " character string in the network access information of three above-mentioned end side access sides, according to the method for string matching, { 10004: leave: film } can be added in the movie and television contents list of preferences of three end side access sides; But " leaving " is not high as video display popularity, opinion rating is also on the low side, is a unexpected winner movie and television contents, releases poor effect at service page, in order to weaken the impact of this " overmatching ", needs to filter.For solving " overmatching " problem, it is the occurrence number of identification information be greater than to control threshold value and the Web content that the assessment grade meeting Web content is greater than level threshold is chosen for the Web content that this end side access side pays close attention to that Web content chooses filtering rule that unit adopts.
Further, the access control apparatus of Web content can also utilize Web content transmitting element to sort according to the height of attention rate to the Web content that each end side access side pays close attention to, the movie and television play name movie and television contents that occurrence number is maximum in web page title information TITLE is the movie and television contents that attention rate is the highest, and the displaying order of video display collection of drama is the attention rate order from high in the end of video display collection of drama.
The Web content that each end side access side obtained pays close attention to by second server and displaying order are sent to the service page of first server by Web content transmitting element, these data can be stored to online storage engines by first server, the form that storage format can adopt key assignments (KEY-VALUE) right, Key is the unique identification of end side access side, Value comprises the Web content (as movie and television contents VIDEO) of this end side access side concern and the displaying order of Web content, and form can be expressed as:
MID:VIDEO1+VIDEO2+VIDEO3…+VIDEON。
When service page receives the request of access of end side access side transmission, second server, according to the MID in this request of access, extracts the Value that MID is corresponding, is displayed by Value on service page from storage engines.
See Fig. 5, show the one distributed KEY-VALUE query engine system construction drawing that the embodiment of the present invention provides.This system comprises client (Client), Nginx/UDP (engine/User Datagram Protoco (UDP)) server, agent node (StorageProxy), meta data server/backup server (ConfigServer) and memory node (StorageNode).
Wherein, Fig. 5 shows the client of the multiple concurrent requests end side access side, as client 1 and client 2.Client is mainly used for initiating network access request, and native system can support multilingual client (as C/C++/Python/PHP etc.).
Between client and memory node, comprise Nginx/UDP server and agent node.Nginx/UDP server is a transmission equipment required in distributing communication system usually, and those skilled in the art also can adopt other equipment, even do not adopt.Also not necessarily relation one to one between Nginx/UDP server and agent node.
Agent node is responsible for the request of access of customer in response end and request of access is forwarded to memory node.Agent node can obtain routing table information from meta data server/backup server, know the routing table between client Key and memory node address, according to this routing table by the request forward of client to the memory node in downstream, and the response bag of memory node is passed to client.
Meta data server/backup server is responsible for safeguarding overall routing table information, and monitors the existing state of all memory nodes, and in memory node inefficacy with when increasing memory node newly, meta data server/backup server plays crucial coordinative role.
Memory node is responsible for the actual storage of data, the data stored can adopt the form of block, see Fig. 6, show the schematic diagram according to KEY-VALUE mechanism data query from memory node, Hash operation is adopted to obtain block corresponding to this client (Value) according to client Key, then the mapping table searching block and memory node finds the memory node storing this block, from the memory node found, corresponding data block is extracted, obtain this Query Result (responding bag).
Wherein, memory node a is the redundant node of the redundant node of the host node of the host node of block _ 0, block _ 1, block _ 6, block _ 7; Memory node b is the host node of the host node of the redundant node of the redundant node of block _ 0, block _ 1, block _ 2, block _ 3; Memory node c is the host node of the host node of the redundant node of the redundant node of block _ 2, block _ 3, block _ 4, block _ 5; Memory node d is the host node of the host node of the redundant node of the redundant node of block _ 4, block _ 5, block _ 6, block _ 7; Such storage mode, comprises and ensures in equally distributed situation, and each block can be stored on a host node, is also stored in a redundant node simultaneously.
From the above mentioned, the embodiment of the present invention is by the network access information of collection terminal side access side, know the network access behavior of end side access side, then the network access information of end side access side and various Web content are mated, when the number of times of the identification information appearing at the Web content in network access information is greater than control threshold value, confirm that this Web content is the network that this end side access side pays close attention to, this Web content be sent to service page and show, thus different Web contents can be provided for different end side access sides.Embodiments provide a kind of access control mechanisms of personalization, can the network access behavior of guiding terminal side access side, ensure the information of end side access side quick obtaining to concern, enhance Consumer's Experience.
Intrinsic not relevant to any certain computer, virtual system or miscellaneous equipment with display at this algorithm provided.Various general-purpose system also can with use based on together with this teaching.According to description above, the structure constructed required by this type systematic is apparent.In addition, the present invention is not also for any certain programmed language.It should be understood that and various programming language can be utilized to realize content of the present invention described here, and the description done language-specific is above to disclose preferred forms of the present invention.
In instructions provided herein, describe a large amount of detail.But can understand, embodiments of the invention can be put into practice when not having these details.In some instances, be not shown specifically known method, structure and technology, so that not fuzzy understanding of this description.
Similarly, be to be understood that, in order to simplify the disclosure and to help to understand in each inventive aspect one or more, in the description above to exemplary embodiment of the present invention, each feature of the present invention is grouped together in single embodiment, figure or the description to it sometimes.But, the method for the disclosure should be construed to the following intention of reflection: namely the present invention for required protection requires feature more more than the feature clearly recorded in each claim.Or rather, as claims below reflect, all features of disclosed single embodiment before inventive aspect is to be less than.Therefore, the claims following embodiment are incorporated to this embodiment thus clearly, and wherein each claim itself is as independent embodiment of the present invention.
Those skilled in the art are appreciated that and adaptively can change the module in the equipment in embodiment and they are arranged in one or more equipment different from this embodiment.Module in embodiment or unit or assembly can be combined into a module or unit or assembly, and multiple submodule or subelement or sub-component can be put them in addition.Except at least some in such feature and/or process or unit be mutually repel except, any combination can be adopted to combine all processes of all features disclosed in this instructions (comprising adjoint claim, summary and accompanying drawing) and so disclosed any method or equipment or unit.Unless expressly stated otherwise, each feature disclosed in this instructions (comprising adjoint claim, summary and accompanying drawing) can by providing identical, alternative features that is equivalent or similar object replaces.
In addition, those skilled in the art can understand, although embodiments more described herein to comprise in other embodiment some included feature instead of further feature, the combination of the feature of different embodiment means and to be within scope of the present invention and to form different embodiments.Such as, in the following claims, the one of any of embodiment required for protection can use with arbitrary array mode.
All parts embodiment of the present invention with hardware implementing, or can realize with the software module run on one or more processor, or realizes with their combination.It will be understood by those of skill in the art that the some or all functions that microprocessor or digital signal processor (DSP) can be used in practice to realize according to the some or all parts in the access control apparatus of the Web content of the embodiment of the present invention.The present invention can also be embodied as part or all equipment for performing method as described herein or device program (such as, computer program and computer program).Realizing program of the present invention and can store on a computer-readable medium like this, or the form of one or more signal can be had.Such signal can be downloaded from internet website and obtain, or provides on carrier signal, or provides with any other form.
The present invention will be described instead of limit the invention to it should be noted above-described embodiment, and those skilled in the art can design alternative embodiment when not departing from the scope of claims.In the claims, any reference symbol between bracket should be configured to limitations on claims.Word " comprises " not to be got rid of existence and does not arrange element in the claims or step.Word "a" or "an" before being positioned at element is not got rid of and be there is multiple such element.The present invention can by means of including the hardware of some different elements and realizing by means of the computing machine of suitably programming.In the unit claim listing some devices, several in these devices can be carry out imbody by same hardware branch.Word first, second and third-class use do not represent any order.Can be title by these word explanations.

Claims (12)

1. an access control method for Web content, comprising:
Collect the network access information of end side access side in predetermined amount of time, described network access information comprises the web page title information of the unique identification of end side access side, the webpage of end side access side access;
Obtain the identification information of various Web content;
To appear at end side access side network access information web page title information in the identification information of Web content add up, and the occurrence number of identification information be greater than control threshold value and the Web content meeting filtering rule is chosen for the Web content that this end side access side pays close attention to;
The Web content chosen is sent to service page, the Web content chosen is shown to corresponding end side access side in service page;
Wherein, described to appear at end side access side network access information info web in the identification information of Web content carry out statistics and comprise:
For in predetermined amount of time, the network access information of different time arranges different weighted values, wherein, when the very first time early than the second time time, the weighted value arranged for the network access information that generates in the very first time is less than the weighted value that the network access information for generating in the second time is arranged;
During the identification information of the Web content occurred in the info web of a statistics network access information, by the result of product of weighted value corresponding with this network access information for the occurrence number of the identification information of Web content, as the number of times that the identification information adding up the Web content obtained occurs in the info web of this network access information;
Described the Web content chosen is sent to service page, comprises the Web content chosen to be shown to corresponding end side access side in service page:
According to the occurrence number order from big to small of the identification information of the Web content chosen, obtain the displaying order of the Web content chosen, the Web content chosen and described displaying order are sent to service page, the web page contents chosen is shown to corresponding end side access side according to displaying order in service page.
2. method according to claim 1, wherein, described network access information also comprises the web page address information of the webpage of end side access side access, described to appear at end side access side network access information web page title information in the identification information of Web content add up before, described method also comprises:
According to the web page address information in network access information, judge webpage that end side access side accesses whether in focus web page listings, if, the identification information of the Web content appeared in the web page title information of this network access information is added up, if not, the identification information of the Web content appeared in the web page title information of this network access information is not added up.
3. method according to claim 1, wherein, described the Web content chosen is sent to service page before, described method also comprises:
When the quantity of the end side access side accessing first network content and second network content is greater than amount threshold simultaneously, confirm that first network content and second network content are similar network content;
When being only the Web content of end side access side concern, be the Web content that this end side access side may pay close attention to by second network contents selection by first network contents selection; When being only the Web content of end side access side concern, be the Web content that this end side access side may pay close attention to by first network contents selection by second network contents selection;
Described the Web content chosen is sent to service page, comprises the Web content chosen to be shown to corresponding end side access side in service page:
The Web content that the Web content pay close attention to the end side access side chosen and end side access side may pay close attention to is sent to service page, the Web content that end side access side pays close attention to is shown to this end side access side with the Web content that may pay close attention in service page.
4. method according to claim 1, wherein, described filtering rule comprises following at least one rule:
The quantity comprising word in the identification information of Web content is more than two;
The assessment grade of Web content is greater than level threshold.
5. method according to claim 1, wherein, described the Web content chosen is shown to corresponding end side access side in service page after, described method also comprises:
Obtain the access times of end side access side by the Web content of service page access display;
According to the access times order from big to small of the Web content chosen, upgrade the displaying order of the Web content chosen, the Web content chosen and the displaying order after upgrading are sent to service page, the web page contents chosen is shown to corresponding end side access side according to the displaying order after renewal in service page.
6. the method according to any one of claim 1 to 5, wherein, described Web content is the movie and television contents in network, and the identification information of described Web content is the title of movie and television contents.
7. an access control apparatus for Web content, comprising:
End side access side information collection unit, be suitable for connecting every the schedule time and network, from network, collect the network access information of end side access side in predetermined amount of time, described network access information comprises the web page title information of the unique identification of end side access side, the webpage of end side access side access;
Network-content acquisition unit, is suitable for the identification information obtaining various Web content;
Web content chooses unit, be suitable for adding up the identification information of the Web content in the web page title information of the network access information appearing at end side access side, and the occurrence number of identification information is greater than controls threshold value and the Web content meeting filtering rule is chosen for the Web content that this end side access side pays close attention to;
Web content transmitting element, is suitable for the service page be sent to by the Web content chosen in network, the Web content chosen is shown to corresponding end side access side in service page;
Wherein, described Web content chooses unit, be suitable for the network access information of different time in predetermined amount of time and different weighted values is set, wherein, when the very first time early than the second time time, the weighted value arranged for the network access information that generates in the very first time is less than the weighted value that the network access information for generating in the second time is arranged; During the identification information of the Web content occurred in the info web of a statistics network access information, by the result of product of weighted value corresponding with this network access information for the occurrence number of the identification information of Web content, as the number of times that the identification information adding up the Web content obtained occurs in the info web of this network access information;
Described Web content transmitting element, be suitable for the occurrence number order from big to small according to the identification information of the Web content chosen, obtain the displaying order of the Web content chosen, the Web content chosen and described displaying order are sent to the service page in network, the web page contents chosen is shown to corresponding end side access side according to displaying order in service page.
8. access control apparatus according to claim 7, wherein, described network access information also comprises the web page address information of the webpage of end side access side access,
Described device also comprises focus webpage judging unit, be suitable for appear at end side access side network access information web page title information in the identification information of Web content add up before, according to the web page address information in network access information, judge webpage that end side access side accesses whether in focus web page listings, if, allow Web content to choose the identification information of unit to the Web content appeared in the web page title information of this network access information to add up, if not, forbid that Web content is chosen the identification information of unit to the Web content appeared in the web page title information of this network access information and added up.
9. access control apparatus according to claim 7, wherein,
Web content chooses unit, be suitable for before the described service page Web content chosen is sent in network, when the end side access side quantity of to access first network content and second network content is greater than amount threshold simultaneously, confirm that first network content and second network content are similar network content; When being only the Web content of end side access side concern, be the Web content that this end side access side may pay close attention to by second network contents selection by first network contents selection; When being only the Web content of end side access side concern, be the Web content that this end side access side may pay close attention to by first network contents selection by second network contents selection;
Web content transmitting element, be suitable for the service page be sent to by the Web content that the Web content of the end side access side chosen concern and end side access side may be paid close attention in network, the Web content that end side access side pays close attention to is shown to this end side access side with the Web content that may pay close attention in service page.
10. access control apparatus according to claim 7, wherein, described filtering rule comprises following at least one rule:
The quantity comprising word in the identification information of Web content is more than two;
The assessment grade of Web content is greater than level threshold.
11. access control apparatus according to claim 7, wherein,
Web content transmitting element, is also suitable for after the Web content chosen is shown to corresponding end side access side in service page, obtains the access times of end side access side by the Web content of service page access display; According to the access times order from big to small of the Web content chosen, upgrade the displaying order of the Web content chosen, the Web content chosen and the displaying order after upgrading are sent to service page, the web page contents chosen is shown to corresponding end side access side according to the displaying order after renewal in service page.
12. access control apparatus according to any one of claim 7 to 11, wherein,
Described Web content is the movie and television contents in network.
CN201210468106.5A 2012-11-19 2012-11-19 The access control method of Web content and device Expired - Fee Related CN103020126B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210468106.5A CN103020126B (en) 2012-11-19 2012-11-19 The access control method of Web content and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210468106.5A CN103020126B (en) 2012-11-19 2012-11-19 The access control method of Web content and device

Publications (2)

Publication Number Publication Date
CN103020126A CN103020126A (en) 2013-04-03
CN103020126B true CN103020126B (en) 2016-01-13

Family

ID=47968730

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210468106.5A Expired - Fee Related CN103020126B (en) 2012-11-19 2012-11-19 The access control method of Web content and device

Country Status (1)

Country Link
CN (1) CN103020126B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102984234B (en) * 2012-11-19 2016-06-01 北京奇虎科技有限公司 The access control method of a kind of communication system and Web content
CN105337931B (en) * 2014-06-30 2019-08-20 北京新媒传信科技有限公司 A kind of limit control method and distributed limit control system
CN105450696A (en) * 2014-08-22 2016-03-30 鸿富锦精密工业(深圳)有限公司 Data backup control method and system based on cloud computing
CN105450695A (en) * 2014-08-22 2016-03-30 鸿富锦精密工业(深圳)有限公司 Data backup control system and method based on cloud computing
CN112148957A (en) * 2019-06-26 2020-12-29 北京百度网讯科技有限公司 Webpage access data analysis method, device and equipment and readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101493832A (en) * 2009-03-06 2009-07-29 辽宁般若网络科技有限公司 Website content combine recommendation system and method
CN101833570A (en) * 2010-03-23 2010-09-15 深圳市五巨科技有限公司 Method and device for optimizing page push of mobile terminal
CN102364468A (en) * 2011-09-29 2012-02-29 北京亿赞普网络技术有限公司 User network behavior analysis method, device and system
CN102609474A (en) * 2012-01-18 2012-07-25 北京搜狗信息服务有限公司 Access information providing method and system
CN102752288A (en) * 2012-06-06 2012-10-24 华为技术有限公司 Method and device for identifying network access action

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101493832A (en) * 2009-03-06 2009-07-29 辽宁般若网络科技有限公司 Website content combine recommendation system and method
CN101833570A (en) * 2010-03-23 2010-09-15 深圳市五巨科技有限公司 Method and device for optimizing page push of mobile terminal
CN102364468A (en) * 2011-09-29 2012-02-29 北京亿赞普网络技术有限公司 User network behavior analysis method, device and system
CN102609474A (en) * 2012-01-18 2012-07-25 北京搜狗信息服务有限公司 Access information providing method and system
CN102752288A (en) * 2012-06-06 2012-10-24 华为技术有限公司 Method and device for identifying network access action

Also Published As

Publication number Publication date
CN103020126A (en) 2013-04-03

Similar Documents

Publication Publication Date Title
US11461380B2 (en) System and method for tagging a region within a distributed video file
US10275433B2 (en) Remote browsing and searching
CN103428525B (en) Internet video and the online query of TV programme and control method for playing back and system
US8903863B2 (en) User interface with available multimedia content from multiple multimedia websites
CN104025084B (en) Historical viewings session management
US20130138674A1 (en) System and method for recommending application by using keyword
CN103020126B (en) The access control method of Web content and device
US9336321B1 (en) Remote browsing and searching
SG190645A1 (en) System and method for tracking usage
CN104321743A (en) Method and system for developing applications for consulting content and services on a telecommunications network
CN103699669A (en) Method for message pushing in browser and browser terminal
US20170193059A1 (en) Searching For Applications Based On Application Usage
CN102521257A (en) Method and device for providing corresponding on-line picture according to thumbnail
CN111737449B (en) Method and device for determining similar problems, storage medium and electronic device
CN112052420A (en) Page sharing picture generation method and device and page sharing method and device
CN102955847B (en) The browser form page loads the system of website data
US20070214103A1 (en) System and method for providing content over a communications network
CN108470057A (en) Integrate generation, method for pushing, device, terminal, server and the medium of information
CN103957460A (en) Method and device for generating television receiving terminal desktop application
CN102984234B (en) The access control method of a kind of communication system and Web content
CN103748586A (en) Intelligent television
US20170192978A1 (en) Searching For Applications Based On Application Usage
CN109792452A (en) The adaptive user interface of payload with reduction
CN110474991A (en) Data push method, data-pushing device, electronic equipment and storage medium
CN113569089A (en) Information processing method, device, server, equipment, system and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160113

Termination date: 20211119

CF01 Termination of patent right due to non-payment of annual fee