CN104281680A - Data processing system, method and device for acquiring website resources - Google Patents

Data processing system, method and device for acquiring website resources Download PDF

Info

Publication number
CN104281680A
CN104281680A CN201410521135.2A CN201410521135A CN104281680A CN 104281680 A CN104281680 A CN 104281680A CN 201410521135 A CN201410521135 A CN 201410521135A CN 104281680 A CN104281680 A CN 104281680A
Authority
CN
China
Prior art keywords
data
picture
web
link
appointed website
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410521135.2A
Other languages
Chinese (zh)
Other versions
CN104281680B (en
Inventor
鲁晓莹
李进
刘世戟
刘鸿宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201410521135.2A priority Critical patent/CN104281680B/en
Publication of CN104281680A publication Critical patent/CN104281680A/en
Application granted granted Critical
Publication of CN104281680B publication Critical patent/CN104281680B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a data processing system, method and device for acquiring website resources. The system comprises a data screening device, a webpage parsing server and a database, wherein the data screening device is used for receiving webpage data captured by web crawlers, screening the received data during receiving, and transmitting the screened webpage data related to appointed websites to the webpage parsing server, the webpage parsing server is used for parsing the webpage data related to the appointed websites according to preset parsing strategies to obtain first structuralized data and saving the first structuralized data to the database, and the database is used for performing data fusion according to the first structuralized data received in a preset time period to obtain second structuralized data for describing the resources of the appointed websites. By the system, the updating cycle of the website resources can be shortened, timeliness of the website resources is increased, the image rate of video resources of video websites can be increased, and user experience is increased.

Description

For obtaining the data handling system of site resource, method and device
Technical field
The present invention relates to data processing field, specifically, relating to a kind of data handling system, method and device for obtaining site resource.
Background technology
Search engine based on the site resource of including in database (site resource usually with structural data describe) for user provides search service.The Search Results of search engine is directly related with the site resource of including in database, therefore, in order to improve Consumer's Experience, needs the site resource that upgrades in time.
In the prior art, usually upgrade site resource in the following way: first, wait for that web crawlers (spider) captures the webpage of magnanimity, by the webpage of crawl stored in setting up index in the first database; Then, the full dose webpage in the first database is screened and structural data parsing (this operation is usually by manual activation), by analysis result stored in the second database; Finally, by the second database, several data is carried out to data fusion, set up the process such as index line to be shown.
Owing to waiting for that web crawlers captures the chronic of webpage and the data access process related to the first database, this causes assess the cost (comprising database cost and time cost) for single webpage larger; Because whole data screening, resolving are batch processing off-line, this causes the integral cycle of Data Update longer.
Above defect causes prior art cannot include up-to-date site resource in time, and this affects the search experience of user greatly.And the site resource stronger for ageing demand and structural data calculate comparatively complicated site resource, adopt prior art cannot include in time especially.For the video resource that ageing requirement is higher, its data processing is comparatively complicated, needs on the one hand just can reach good data cover from list of videos page and video playback page common analytic structure data; Need to merge the picture relevant to webpage on the other hand and could improve Consumer's Experience when follow-up displaying, but the mass picture resource that massive video is brought is difficult to complete crawl at short notice, store the process such as conversion at all.When adopting prior art to upgrade video resource, not only the update cycle is long, cannot meet the ageing requirement of video resource, and is difficult to control due to the progress of picture processing and structural data process, being easy to occur cannot the situation of exhibiting pictures, has a strong impact on Consumer's Experience.
Summary of the invention
In order to solve the defect existing for prior art, embodiment of the present invention provides a kind of data handling system, method and device for obtaining site resource, can overcome the defects such as the poor in timeliness of prior art Data Update cycle long, resource.
First aspect, embodiments providing a kind of data handling system for obtaining site resource, comprising:
Data screening device, for receiving the web data captured by web crawlers, and carries out Screening Treatment to the web data received in receiving course, and the web data relevant to appointed website filtered out is sent to web analysis server;
Web analysis server, for carrying out dissection process according to the parses policy pair web data relevant to described appointed website preset, obtaining first structural data relevant to described appointed website, and described first structural data is saved to database;
Described database, for carrying out Data Fusion according to described first structural data received within a predetermined period of time, obtains the second structural data of the resource for describing described appointed website.
Alternatively, in a kind of implementation of the present embodiment, described data screening device specifically for, in the process receiving web data, URL (Uniform Resoure Locator: the uniform resource locator) regular expression according to described appointed website carries out Screening Treatment to the web data received.
Alternatively, in the another kind of implementation of the present embodiment, when described appointed website is video website, described web analysis server specifically for: when the web data that described web analysis server receives is the web data relevant to the video playback page of described appointed website, carry out dissection process according to the first parses policy; When the web data that described web analysis server receives is the web data relevant to the list of videos page of described appointed website, carry out dissection process according to the second parses policy different from described first parses policy.
Alternatively, in another implementation of the present embodiment, described data handling system also comprises picture processing subsystem; Described web analysis server also for, the image link parsed in described dissection process is sent to described picture processing subsystem; Described picture processing subsystem, new picture is obtained for capturing original image according to described image link and carry out process according to picture processing strategy to described original image, preserving described new picture is also described new picture generating pictures link, and the pictorial information of the image link of the image link and described new picture that comprise described original image is sent to described database.
Further alternatively, described picture processing subsystem comprises picture and captures server, picture processing server and picture database, wherein, described picture captures server, for capturing described original image according to described image link, and described original image and image link thereof are sent to described picture processing server; Described picture processing server, obtains described new picture for carrying out process according to picture processing strategy to described original image, and the image link of described original image and described new picture is saved to described picture database; Described picture database, for being the link of described new picture generating pictures, and is sent to described database by described pictorial information.
Or further alternatively, shown database is also for carrying out Data Fusion according to described second structural data and described pictorial information.
Second aspect, embodiments providing a kind of data processing method for obtaining site resource, comprising:
Receive the web data captured by web crawlers, and in receiving course, Screening Treatment is carried out to the web data received, obtain the web data relevant to appointed website;
The parses policy pair web data relevant to described appointed website according to presetting carries out dissection process, obtains first structural data relevant to described appointed website;
Carrying out Data Fusion according to resolving described first structural data obtained within a predetermined period of time, obtaining the second structural data of the resource for describing described appointed website.
Alternatively, in a kind of implementation of the present embodiment, describedly in receiving course, Screening Treatment is carried out to the web data received and comprise: in the process receiving web data, the URL regular expression according to described appointed website carries out Screening Treatment to the web data received.
Alternatively, in the another kind of implementation of the present embodiment, when described appointed website is video website, the parses policy pair web data relevant to described appointed website that described basis is preset carries out dissection process and comprises: when the web data of being correlated with described appointed website is with the web data that the video playback page of described appointed website is relevant, carry out described dissection process according to the first parses policy; When the web data relevant to described appointed website is with the web data that the list of videos page of described appointed website is relevant, carry out described dissection process according to the second parses policy different from described first parses policy.
Alternatively, in another implementation of the present embodiment, described method also comprises: the image link parsed in described dissection process process is sent to picture processing subsystem, described picture processing subsystem is for performing following process: capture original image according to described image link and carry out process according to picture processing strategy to described original image and obtain new picture, preserves described new picture and is described new picture generating pictures link; Receive the pictorial information sent by described picture processing subsystem, described pictorial information comprises the image link of described original image and the image link of described new picture; Data Fusion is carried out according to described second structural data and described pictorial information.
Alternatively, in another implementation of the present embodiment, described method also comprises: capture original image according to the image link parsed in described dissection process; According to picture processing strategy, process is carried out to described original image and obtain new picture, preserve described new picture and preserve and be the link of described new picture generating pictures; Carry out Data Fusion according to described second structural data and pictorial information, described pictorial information comprises the image link of described original image and the image link of described new picture.
The third aspect, embodiments providing a kind of data processing equipment for obtaining site resource, comprising:
Data screening module, for receiving the web data captured by web crawlers, and carrying out Screening Treatment to the web data received, obtaining the web data relevant to appointed website in receiving course;
Data resolution module, for carrying out dissection process according to the parses policy pair web data relevant to described appointed website preset, obtains first structural data relevant to described appointed website;
Data fusion module, for carrying out Data Fusion according to resolving described first structural data obtained within a predetermined period of time, obtains the second structural data of the resource for describing described appointed website.
Alternatively, in first implementation of the present embodiment, described data screening module specifically for, receive web data process in, the URL regular expression according to described appointed website carries out Screening Treatment to the web data received.
Alternatively, in second implementation of the present embodiment, described data resolution module comprises:
First analyzing sub-module, for be video website in described appointed website and the web data relevant to described appointed website is the web data relevant with the video playback page of described appointed website time, carry out dissection process according to the first parses policy; Second analyzing sub-module, for be video website in described appointed website and the web data relevant to described appointed website is the web data relevant with the list of videos page of described appointed website time, carry out dissection process according to the second parses policy different from described first parses policy.
Alternatively, in the 3rd implementation of the present embodiment, described data processing equipment also comprises: image link sending module, and the image link for being parsed in described dissection process by described data resolution module is sent to picture processing subsystem; Wherein, described picture processing subsystem is for performing following process: capture original image according to described image link and carry out process according to picture processing strategy to described original image and obtain new picture, preserving described new picture is also described new picture generating pictures link, and the pictorial information of the image link of the image link and described new picture that comprise described original image is sent to described data processing equipment; Pictorial information receiver module, for receiving described pictorial information; Described data fusion module also for, carry out Data Fusion according to described second structural data and described pictorial information, obtain the structural data comprising described pictorial information.
Alternatively, in the 4th implementation of the present embodiment, described data processing equipment also comprises: picture handling module, image link for parsing in described dissection process according to described data resolution module captures original image, with picture processing module, obtain new picture for original image according to the process of picture processing strategy, preserving described new picture is also described new picture generating pictures link; Described data fusion module also for, Data Fusion is carried out according to described second structural data and pictorial information, obtain the structural data comprising described pictorial information, described pictorial information comprises the image link of described original image and the image link of described new picture.
Fourth aspect, embodiments providing a kind of data handling system for obtaining site resource, comprising: according to the data processing equipment of the third aspect of the embodiment of the present invention or first of the third aspect or the second implementation; With, for preserving the database of described second structural data.
5th aspect, embodiments providing a kind of data handling system for obtaining site resource, comprising: according to the data processing equipment of the 3rd implementation of the third aspect of the embodiment of the present invention; With, for preserving the database of the structural data comprising described pictorial information.
6th aspect, embodiments providing a kind of data handling system for obtaining site resource, comprising: according to the data processing equipment of the 4th implementation of the third aspect of the embodiment of the present invention, picture processing subsystem and the database for preserving the structural data comprising described pictorial information.Wherein, picture processing subsystem, new picture is obtained for capturing original image according to described image link and carry out process according to picture processing strategy to described original image, preserving described new picture is also described new picture generating pictures link, and the pictorial information of the image link of the image link and described new picture that comprise described original image is sent to described data processing equipment.
Various embodiment of the present invention is adopted to have following beneficial effect:
On the one hand, by carrying out Screening Treatment and dissection process to web data in the process receiving web data, thus (such as per hour) Data Fusion can be carried out to reach the object upgrading site resource at set intervals, this effectively overcomes prior art off-line batch processing and causes the defects such as the poor in timeliness of Data Update cycle length, resource.On the other hand, in the process of computation structure data, carry out picture crawl and follow-up picture processing according to resolving the image link address obtained in dissection process, the rate of publishing picture of video resource can be improved, for user provides better search experience.
Accompanying drawing explanation
Fig. 1 is the calcspar of a kind of data handling system for obtaining site resource according to the embodiment of the present invention;
Fig. 2 A is the calcspar of a kind of data handling system for obtaining video website resource according to the embodiment of the present invention;
Fig. 2 B is a kind of calcspar of the picture processing subsystem in Fig. 2 A illustrated embodiment;
Fig. 3 is the schematic flow sheet of a kind of data processing method for obtaining site resource according to the embodiment of the present invention;
Fig. 4 is the schematic flow sheet of a kind of data processing method for obtaining video website resource according to the embodiment of the present invention;
Fig. 5 is the schematic flow sheet of a kind of data processing method for obtaining video website resource according to the embodiment of the present invention;
Fig. 6 is the calcspar of a kind of data processing equipment for obtaining site resource according to the embodiment of the present invention;
Fig. 7 is the calcspar of a kind of data processing equipment for obtaining site resource according to the embodiment of the present invention;
Fig. 8 is the calcspar of a kind of data processing equipment for obtaining site resource according to the embodiment of the present invention;
Fig. 9 A-9C is the calcspar of a kind of data handling system for obtaining site resource according to the embodiment of the present invention.
Embodiment
Be described in detail to various aspects of the present invention below in conjunction with the drawings and specific embodiments.Wherein, well-known module, unit and connection each other, link, communication or operation do not illustrate or do not elaborate.Further, described feature, framework or function can combine by any way in one or more embodiments.It will be appreciated by those skilled in the art that following various embodiments are only for illustrating, but not for limiting the scope of the invention.Can also easy understand, the module in each embodiment described herein and shown in the drawings or unit or step can be undertaken combining and designing by various different configuration.
[the first embodiment]
Fig. 1 is the calcspar of a kind of data handling system for obtaining site resource according to the embodiment of the present invention, and with reference to Fig. 1, data handling system 1 comprises data screening device 10, web analysis server 20 and database 30, is described respectively below.
Data screening device 10, for receiving the web data captured by web crawlers, and carries out Screening Treatment to the web data received in receiving course, and the web data relevant to appointed website filtered out is sent to web analysis server 20.
Alternatively, in a kind of implementation of the present embodiment, data screening device 10 can directly communicate with web crawlers and continuous reception web data, also with the database communication for preserving the web data that web crawlers captures and continuous reception web data, can also can communicate with the data transfer equipment of the web data captured for transmission network reptile and continuous reception web data.
Alternatively, in a kind of implementation of the present embodiment, data screening device 10 can carry out Screening Treatment according to the URL regular expression of appointed website to the web data received, and obtains the web data relevant to appointed website.
Web analysis server 20, for carrying out dissection process according to the parses policy pair web data relevant to appointed website preset, obtaining first structural data relevant to appointed website, and the first structural data is saved to database 30.
Alternatively, in a kind of implementation of the present embodiment, web analysis server 20 receives the web data that data screening device 10 sends constantly, and carry out dissection process after receiving web data at every turn, or, periodically (such as, every one minute) carries out dissection process to the web data received.
Alternatively, in a kind of implementation of the present embodiment, take appointed website as video website be example, web analysis server 20 when receiving the web data relevant to the video playback page of appointed website, can be resolved according to the first parses policy; When receiving the web data relevant to the list of videos page of appointed website, resolve according to the second parses policy different from the first parses policy.That is, parses policy in the present embodiment can comprise multiple parses policy corresponding with resolved data respectively and be not limited to a kind of parses policy.
Database 30, for carrying out Data Fusion according to the first structural data received within a predetermined period of time, obtains the second structural data for describing appointed website resource.It should be noted that, " first " and " second " mentioned in " the first structural data " and " the second structural data " is only used as name and is referred to as not, in addition, not form any restriction to structural data.
Alternatively, in a kind of application scenarios of the present embodiment, when carry out for multiple appointed website the Screening Treatment of web data, dissection process and Data Fusion time, database 30 receives the first relevant to different web sites respectively structural data, and can carry out Data Fusion in the following ways:
Mode one, database 30 periodically carries out Data Fusion, comprising: first structural data with identical URL received in current period is carried out data fusion, obtains the second structural data of corresponding different web sites respectively.
Mode two, database 30 periodically carries out Data Fusion, comprise: in each cycle, first structural data with identical URL received within this cycle is carried out fusion and obtains fusion results, then the fusion results with identical URL obtained in two or more nearest cycles is merged mutually, obtain the second structural data of corresponding different web sites respectively.
Mode three, database 30 periodically carries out Data Fusion, comprise: first structural data with identical URL received at current period is carried out data fusion, result after merging is merged mutually with second structural data with identical URL calculated in the last cycle, obtains the second structural data at the corresponding different web sites of current period difference.
Alternatively, in a kind of implementation of the present embodiment, database 30, after calculating the second structural data, is that the second structural data is set up index and retrieved on line.
Adopt the data handling system 1 that the embodiment of the present invention provides, can screen and dissection process the web data that web crawlers captures in real time or in time, thus the object that Data Fusion reaches renewal site resource can be carried out at set intervals, this effectively overcomes prior art off-line batch processing and causes the defects such as the poor in timeliness of Data Update cycle length, resource.In addition, the data handling system 1 that the embodiment of the present invention provides has built a complete flow chart of data processing, can continuous operation and avoid manpower intervention.
[the second embodiment]
Data handling system 1 shown in Fig. 1 is applicable to the resource obtaining various types of website (such as: news website, video website, education and scientific research website, military website etc.).With regard to acquisition video website resource, consider that representing video resource with graphic form can improve Consumer's Experience, the present invention still further provides a kind of preferred data handling system for obtaining video website resource, as shown in Figure 2 A, data handling system 2, except comprising data screening device 10, web analysis server 20 and database 30, also comprises picture processing subsystem 40.Be described respectively below, wherein, although be not described in detail to data screening plant 10, web analysis server 20 and database 30, three can have all features in the embodiment shown in fig. 1, does not repeat herein.
In the present embodiment, data screening device 10 is for receiving the web data captured by web crawlers, and in receiving course, Screening Treatment is carried out to the web data received, the web data relevant to designated website filtered out is sent to web analysis server 20.
Web analysis server 20, for carrying out dissection process according to the parses policy pair web data relevant to designated website preset, obtain first structural data relevant to designated website, and the first structural data is saved to database 30, and, for the image link parsed in dissection process is sent to picture processing subsystem 40.
Alternatively, in a kind of implementation of the present embodiment, whether web analysis server 10 comprises player according to the web data place page, judge that web data is the web data relevant to video playback page or the web data relevant with list of videos page, if the former, then carry out resolving (analysis result belongs to the first structural data) according to the first parses policy; If the latter, then carry out resolving (analysis result belongs to the first structural data) according to the second parses policy different from the first parses policy.Wherein, image link is comprised to the analysis result of the web data relevant to list of videos page, such as, comprise the image link parsed from web page source code.
In the present embodiment, picture processing subsystem 40 is for performing following process: capture original image according to image link and carry out process according to picture processing strategy to original image and obtain new picture; Preserving new picture is also new picture generating pictures link; And the pictorial information of the image link of the image link and new picture that comprise original image is sent to database 30.
Adopt the data handling system 2 that the present embodiment provides, picture processing is carried out by picture processing subsystem 40, the image data relevant to designated website can be obtained, be convenient to follow-up needing to carry out calling or carrying out Data Fusion when showing the picture of video resource.
Alternatively, in a kind of implementation of the present embodiment, as shown in Figure 2 B, picture processing subsystem 40 can comprise picture crawl server 41, picture processing server 42 and picture database 43.
Picture captures the image link (i.e. the image link of original image) that server 41 sends for receiving web analysis server 20, captures original image, and original image and image link thereof are sent to picture processing server 42 according to image link.
Picture processing server 42, obtains new picture for carrying out process according to picture processing strategy to original image, and the image link of original image and new picture is saved to picture database 43.
Exemplarily, picture processing server 42 can process original image in the following ways: first analyze original image, identifies the Two-Dimensional Moment system of battle formations of its pixel thus the length and width information of acquisition picture; Then, compress original image according to pre-set rule, cutting etc. operates and obtains new picture, the new picture after process is met and represents requirement.
Picture database 43, for being the link of new picture generating pictures, and is sent to database 30 by the pictorial information of the image link of the image link and new picture that comprise original image.
Alternatively, in a kind of implementation of the present embodiment, database 30, except for calculating except the second structural data according to the first structural data received within a predetermined period of time, can also carry out Data Fusion according to the second structural data and the pictorial information received.Such as, for the second structural data and the pictorial information that receives in described predetermined amount of time, the data wherein with identical URL are carried out Data Fusion.Adopt this implementation, with the different calculating of process execution architecture data and the calculating of pictorial information, picture processing efficiency can be improved, thus improve the rate of publishing picture of video resource, for user provides better search experience.
[the 3rd embodiment]
Below by reference to the accompanying drawings the data handling system according to the embodiment of the present invention is illustrated, below in conjunction with accompanying drawing, the data processing method according to the embodiment of the present invention is described.
Fig. 3 is the schematic flow sheet of a kind of data processing method for obtaining site resource according to the embodiment of the present invention, and with reference to Fig. 3, described method comprises:
300: receive the web data captured by web crawlers, and in receiving course, Screening Treatment is carried out to the web data received, obtain the web data relevant to appointed website.
Alternatively, in a kind of implementation of the present embodiment, in the process receiving web data, the URL regular expression according to appointed website carries out Screening Treatment to the web data received.
302: the parses policy pair web data relevant to appointed website according to presetting carries out dissection process, obtains first structural data relevant to appointed website.
Wherein, a kind of parses policy may be adopted for web data that is same or same class website, also may adopt multiple parses policy.Such as, for the web data of news website, a kind of parses policy can be adopted to resolve; For the web data of video website, can be correlated with from video playback page according to web data or be correlated with list of videos page adopts different parses policy to resolve.
304: carrying out Data Fusion according to resolving the first structural data obtained within a predetermined period of time, obtaining the second structural data of the resource for describing appointed website.
Alternatively, in a kind of application scenarios of the present embodiment, periodically can carry out Data Fusion in 304, previously described three kinds of modes that concrete mode please refer to (but being not limited to), do not repeat herein.
Alternatively, in a kind of implementation of the present embodiment, after 304, be that the second structural data is set up index and retrieved on line.
In a kind of specific implementation of the present embodiment, perform 300 by data screening device 10 and execution result is sent to web analysis server 20, perform 302 by web analysis server 20 and execution result is sent to database 30, then performing 304 by database 30.Wherein, the detailed process that various piece performs each step refers to description above, does not repeat herein.
Adopt the data processing method that the embodiment of the present invention provides, by screening and dissection process the web data that web crawlers captures in real time or in time, can carry out Data Fusion at set intervals and reach the object upgrading site resource, this effectively overcomes prior art off-line batch processing and causes the defects such as the poor in timeliness of Data Update cycle length, resource.
[the 4th embodiment]
Fig. 4 is the schematic flow sheet of a kind of data processing method for obtaining video website resource according to the embodiment of the present invention, and with reference to Fig. 4, described method comprises:
400: receive the web data captured by web crawlers, and in receiving course, Screening Treatment is carried out to the web data received, obtain the web data relevant to designated website.
402: the parses policy pair web data relevant to designated website according to presetting carries out dissection process, obtains first structural data relevant to designated website.
Alternatively, in a kind of implementation of the present embodiment, when the web data relevant to designated website is with the web data that the video playback page of designated website is relevant, carry out dissection process according to the first parses policy; When the web data relevant to designated website is with the web data that the list of videos page of designated website is relevant, carry out dissection process according to the second parses policy different from the first parses policy.
404: carrying out Data Fusion according to resolving the first structural data obtained within a predetermined period of time, obtaining the second structural data of the resource for describing appointed website.
406: the image link parsed in dissection process process is sent to picture processing subsystem.Described picture processing subsystem is for performing following process: capture original image according to image link and carry out process according to picture processing strategy to original image and obtain new picture, preserves new picture and is new picture generating pictures link.
Wherein, the explanation for picture processing subsystem see the explanation in Fig. 2 A and Fig. 2 B illustrated embodiment, can not repeat herein.
408: receive the pictorial information that picture processing subsystem sends, described pictorial information comprises the image link of original image and the image link of new picture.
410: carry out Data Fusion according to the second structural data and pictorial information.Alternatively, the pictorial information carrying out merging with the second structural data is the pictorial information received in described predetermined amount of time.
In the present embodiment, do not limit the execution sequence of 404 and 406-408, even in a kind of variation of the present embodiment, 404 and 410 can realize in the following manner simultaneously: according to resolving the first structural data obtained within a predetermined period of time and the pictorial information received carries out Data Fusion, obtain the structural data comprising pictorial information.
Identical implementation can be had with same or analogous step embodiment illustrated in fig. 3 in the present embodiment, not repeat herein.
In a kind of specific implementation of the present embodiment, perform 400 by data screening device 10 and execution result is sent to web analysis server 20, perform 402 by web analysis server 20 and execution result is sent to database 30,404 are performed so that image link is sent to picture processing subsystem 40 by web analysis server 20, by picture processing subsystem 40, (namely pictorial information is sent to database 30,408 are performed by database 30), perform 406 and 410 by database 30.Wherein, the detailed process that various piece performs each step refers to description above, does not repeat herein.
The data processing method adopting the embodiment of the present invention to provide, except having advantage embodiment illustrated in fig. 3, can also improve the rate of publishing picture of video resource, improves Consumer's Experience.
[the 5th embodiment]
Fig. 5 is the schematic flow sheet of a kind of data processing method for obtaining video website resource according to the embodiment of the present invention, and with reference to Fig. 5, described method comprises:
500: receive the web data captured by web crawlers, and in receiving course, Screening Treatment is carried out to the web data received, obtain the web data relevant to designated website.
502: the parses policy pair web data relevant to designated website according to presetting carries out dissection process, obtains first structural data relevant to designated website.
504: carrying out Data Fusion according to resolving the first structural data obtained within a predetermined period of time, obtaining the second structural data of the resource for describing appointed website.
506: capture original image according to the image link parsed in dissection process.
508: carry out process according to picture processing strategy to original image and obtain new picture, preserving new picture is also new picture generating pictures link.
510: carry out Data Fusion according to the second structural data and pictorial information, obtain the structural data comprising pictorial information.Pictorial information comprises the image link of original image and the image link of new picture.
In the present embodiment, do not limit the execution sequence of 504 and 506-508, even in a kind of variation of the present embodiment, 504 and 510 can realize in the following manner simultaneously: according to resolving the first structural data obtained within a predetermined period of time and the pictorial information received carries out Data Fusion, obtain the structural data comprising pictorial information.
In the present embodiment with Fig. 3 and same or analogous step embodiment illustrated in fig. 4 can have identical implementation, do not repeat herein.
In a kind of specific implementation of the present embodiment, perform 500 by data screening device 10 and execution result is sent to web analysis server 20, perform 502 by web analysis server 20 and execution result is sent to database 30, send to picture to capture server 41 image link by web analysis server 20 and perform 506 to capture server 41 by picture, perform 508 by picture processing server 42 and picture database 43 and pictorial information is sent to database 30, performing 504 and 510 by database 30.Wherein, the detailed process that various piece performs each step refers to description above, does not repeat herein.
The data processing method adopting the embodiment of the present invention to provide, except having advantage embodiment illustrated in fig. 3, can also improve the rate of publishing picture of video resource, improves Consumer's Experience.
[the 6th embodiment]
To obtain " http://www.bugaboo.tv " this video website resource, the present invention will be described below, and the features such as the Rule of judgment mentioned in following citing, concrete processing mode all may be used for Fig. 1 in embodiment illustrated in fig. 5.
First, data screening device 10 receives the web data that web crawlers returns, by bugaboo.tv/ (watch|video)/.*, URL is screened, get the video playback page A of bugaboo.tv website and the web data of list of videos page B, and the web data of acquisition is sent to web analysis server 20.
Then, web analysis server 20 is loaded into the parses policy preset, by judging whether the page exists player and identify that A is video playback page, B is list of videos page.Apply mechanically the extraction that the web data of corresponding parses policy to the web data of the A page and the B page carries out structured message respectively, the A page can extract data C (as table one), comprises title, summary, time etc.; The B page can extract data D, and comprise 21 list factors (as table two), each list factor comprises URL, title, image link, the broadcasting time of corresponding resource.
Table one: data C exemplary plot
Table two: data D first factor schematic diagram
Web analysis server 20 obtains the image link (amounting to 21) of 21 factors in data D, sends it to picture and captures server 41.Meanwhile, web analysis server 20 is the type mark that data C and data D stamps for distinguishing the two, and data C and data D is sent to database 30.
Data C, after receiving data C and data D, is entered library storage according to type mark by database 30, carries out decomposing and format data D, to be 21 data (as table three) by 21 Factorizations, is stored by 21 data loadings afterwards.
Example after table three: data D first Factorization
Database 30 merges the data that full storehouse has identical URL at set intervals, the result after fusion as shown in Table 4:
Example after table four: data A and data D first factor (after decomposing) merge
Picture captures server 41 after receiving image link, according to image link (such as, http://i.bug-a-boo.tv/images/xxx.jpg) carry out picture crawl, get jpg picture file (i.e. original image), the jpg picture file of acquisition and image link are sent to picture processing server 42.
The process such as picture processing server 42 pairs of jpg picture files are resolved, cutting obtain new picture file, and the image link of new picture file, original image file and original image file are sent and be saved to picture database 43.
Picture database 43 generates new image link NEW_URL for new picture file, and the pictorial information (as table five) of the image link of the image link and original image that comprise new picture is sent to database 30.
Original image URL New picture URL
http://i.bug-a-boo.tv/images/xxx.jpg NEW_URL
Table five: pictorial information
After database 30 receives pictorial information, carry out data fusion according to URL, obtain the fusion results (as table six) of data A, data D and pictorial information.
Table six: the few examples of the data after fusion
[the 7th embodiment]
Fig. 6 is the calcspar of a kind of data processing equipment for obtaining site resource according to the embodiment of the present invention, and with reference to Fig. 6, data processing equipment 6 comprises data screening module 61, data resolution module 62 and data fusion module 63.Be described respectively below.
Data screening module 61, for receiving the web data captured by web crawlers, and carrying out Screening Treatment to the web data received, obtaining the web data relevant to appointed website in receiving course.Such as, according to the URL regular expression of appointed website, Screening Treatment is carried out to obtain the web data relevant to appointed website to the web data received.
Data resolution module 62, for carrying out dissection process according to the parses policy pair web data relevant to appointed website preset, obtains first structural data relevant to appointed website.
Alternatively, in a kind of implementation of the present embodiment, as illustrated with the dotted box, data resolution module comprises:
First analyzing sub-module 621, for be video website in appointed website and the web data relevant to appointed website is the web data relevant with the video playback page of appointed website time, carry out dissection process according to the first parses policy; With the second analyzing sub-module 622, for be video website in appointed website and the web data relevant to appointed website is the web data relevant with the list of videos page of appointed website time, carry out dissection process according to the second parses policy different from the first parses policy.
Data fusion module 63, for carrying out Data Fusion according to resolving the first structural data obtained within a predetermined period of time, obtains the second structural data of the resource for describing appointed website.
In the present embodiment, modules may be used for performing the optional implementation of Fig. 3 to the corresponding steps in embodiment illustrated in fig. 5 or corresponding steps, is equally applicable to explanation to the process in the present embodiment performed by modules and restriction for the explanation of corresponding steps in Fig. 3 to Fig. 5 and restriction.
The data processing equipment 6 adopting the present embodiment to provide, can shorten the update cycle of site resource, improves the ageing of site resource.
[the 8th embodiment]
Fig. 7 is the calcspar of a kind of data processing equipment for obtaining site resource according to the embodiment of the present invention, with reference to Fig. 7, data processing equipment 7, except comprising data screening module 61, data resolution module 62 and data fusion module 63, also comprises image link sending module 71 and pictorial information receiver module 72.
Image link sending module 71 is sent to picture processing subsystem (such as, picture processing subsystem 40) for the image link parsed in dissection process by data resolution module 62.Wherein, picture processing subsystem is for performing following process: capture original image according to image link and carry out process according to picture processing strategy to original image and obtain new picture, preserving new picture is also new picture generating pictures link, and the pictorial information of the image link of the image link and new picture that comprise original image is sent to data processing equipment 7.
Pictorial information receiver module 72, for receiving the pictorial information that picture processing subsystem sends.
Data fusion module 63, except performing process performed in the embodiment shown in fig. 6, also for carrying out Data Fusion according to the second structural data and pictorial information, obtains the structural data comprising pictorial information.
In the present embodiment, modules may be used for performing embodiment illustrated in fig. 4 in corresponding steps or the optional implementation of corresponding steps, explanation to the process in the present embodiment performed by modules and restriction are equally applicable to for the explanation of corresponding steps in Fig. 4 and restriction.
The data processing equipment 7 that theres is provided of the present embodiment is provided, except can improve site resource ageing except, for video website, the rate of publishing picture of its video resource can also be improved, improve Consumer's Experience.
[the 9th embodiment]
Fig. 8 is the calcspar of a kind of data processing equipment for obtaining site resource according to the embodiment of the present invention, with reference to Fig. 8, data processing equipment 8, except comprising data screening module 61, data resolution module 62 and data fusion module 63, also comprises picture handling module 81 and picture processing module 82.
Picture handling module 81 captures original image for the image link parsed in dissection process according to data resolution module 62.
Picture processing module 82, for obtaining new picture according to picture processing strategy process original image, preserving new picture is also new picture generating pictures link.Such as, be saved to by new picture in database is also new picture generating pictures link.
Data fusion module 63, except performing process performed in the embodiment shown in fig. 6, also for carrying out Data Fusion according to the second structural data and pictorial information, obtains the structural data comprising pictorial information.Described pictorial information comprises the image link of original image and the image link of new picture.
In the present embodiment, modules may be used for performing embodiment illustrated in fig. 5 in corresponding steps or the optional implementation of corresponding steps, explanation to the process in the present embodiment performed by modules and restriction are equally applicable to for the explanation of corresponding steps in Fig. 5 and restriction.
The data processing equipment 8 that theres is provided of the present embodiment is provided, except can improve site resource ageing except, for video website, the rate of publishing picture of its video resource can also be improved, improve Consumer's Experience.
[the tenth embodiment]
Fig. 9 A-9C is the calcspar of a kind of data handling system for obtaining site resource according to the embodiment of the present invention.
In an embodiment of the present invention, as shown in Figure 9 A, data handling system comprises previously described data processing equipment 6 and the database for preserving the second structural data.Wherein, the second structural data calculated sends and is saved to database by data processing equipment 6, thus more relevant to site resource in new database data.
In another embodiment of the invention, as shown in Figure 9 B, data handling system comprises previously described data processing equipment 7, for preserving the database of the structural data comprising pictorial information, and previously described picture processing subsystem 40.Wherein, the structural data comprising pictorial information calculated sends and is saved to database by data processing equipment 7, thus more relevant to site resource in new database data.
In another embodiment of the present invention, as shown in Figure 9 C, data handling system comprises previously described data processing equipment 8 and the database for preserving the structural data comprising pictorial information.Wherein, the structural data comprising pictorial information calculated sends and is saved to database by data processing equipment 8, thus more relevant to site resource in new database data.
Through the above description of the embodiments, those skilled in the art can be well understood to the present invention and can realize by the mode of software combined with hardware platform.Based on such understanding, what technical scheme of the present invention contributed to background technology can embody with the form of software product in whole or in part, this computer software product can be stored in storage medium, as ROM/RAM, magnetic disc, CD etc., comprising some instructions in order to make a computer equipment (can be personal computer, server, smart mobile phone or the network equipment etc.) perform the method described in some part of each embodiment of the present invention or embodiment.
The term used in instructions of the present invention and wording, just to illustrating, are not meaned and are formed restriction.It will be appreciated by those skilled in the art that under the prerequisite of the ultimate principle not departing from disclosed embodiment, can various change be carried out to each details in above-mentioned embodiment.Therefore, scope of the present invention is only determined by claim, and in the claims, except as otherwise noted, all terms should be understood by the most wide in range rational meaning.

Claims (17)

1. for obtaining a data handling system for site resource, it is characterized in that, comprising:
Data screening device, for receiving the web data captured by web crawlers, and carries out Screening Treatment to the web data received in receiving course, and the web data relevant to appointed website filtered out is sent to web analysis server;
Web analysis server, for carrying out dissection process according to the parses policy pair web data relevant to described appointed website preset, obtaining first structural data relevant to described appointed website, and described first structural data is saved to database;
Described database, for carrying out Data Fusion according to described first structural data received within a predetermined period of time, obtains the second structural data of the resource for describing described appointed website.
2. data handling system as claimed in claim 1, is characterized in that,
Described data screening device specifically for, receive web data process in, the uniform resource position mark URL regular expression according to described appointed website carries out Screening Treatment to the web data received.
3. data handling system as claimed in claim 1, is characterized in that, when described appointed website is video website, described web analysis server specifically for:
When the web data that described web analysis server receives is the web data relevant to the video playback page of described appointed website, carry out dissection process according to the first parses policy;
When the web data that described web analysis server receives is the web data relevant to the list of videos page of described appointed website, carry out dissection process according to the second parses policy different from described first parses policy.
4. the data handling system according to any one of claim 1-3, is characterized in that,
Described data handling system also comprises picture processing subsystem;
Described web analysis server also for, the image link parsed in described dissection process is sent to described picture processing subsystem;
Described picture processing subsystem, new picture is obtained for capturing original image according to described image link and carry out process according to picture processing strategy to described original image, preserving described new picture is also described new picture generating pictures link, and the pictorial information of the image link of the image link and described new picture that comprise described original image is sent to described database.
5. data handling system as claimed in claim 4, is characterized in that, described picture processing subsystem comprises picture and captures server, picture processing server and picture database, wherein,
Described picture captures server, for capturing described original image according to described image link, and described original image and image link thereof is sent to described picture processing server;
Described picture processing server, obtains described new picture for carrying out process according to picture processing strategy to described original image, and the image link of described original image and described new picture is saved to described picture database;
Described picture database, for being the link of described new picture generating pictures, and is sent to described database by described pictorial information.
6. data handling system as claimed in claim 4, is characterized in that,
Shown in database also for, carry out Data Fusion according to described second structural data and described pictorial information.
7. for obtaining a data processing method for site resource, it is characterized in that, comprising:
Receive the web data captured by web crawlers, and in receiving course, Screening Treatment is carried out to the web data received, obtain the web data relevant to appointed website;
The parses policy pair web data relevant to described appointed website according to presetting carries out dissection process, obtains first structural data relevant to described appointed website;
Carrying out Data Fusion according to resolving described first structural data obtained within a predetermined period of time, obtaining the second structural data of the resource for describing described appointed website.
8. data processing method as claimed in claim 7, is characterized in that, describedly in receiving course, carries out Screening Treatment to the web data received comprise:
In the process receiving web data, the URL regular expression according to described appointed website carries out Screening Treatment to the web data received.
9. data processing method as claimed in claim 7, it is characterized in that, when described appointed website is video website, the parses policy pair web data relevant to described appointed website that described basis is preset carries out dissection process and comprises:
When the web data relevant to described appointed website is with the web data that the video playback page of described appointed website is relevant, carry out described dissection process according to the first parses policy;
When the web data relevant to described appointed website is with the web data that the list of videos page of described appointed website is relevant, carry out described dissection process according to the second parses policy different from described first parses policy.
10. data processing method as claimed in any one of claims 7-9, it is characterized in that, described method also comprises:
The image link parsed in described dissection process process is sent to picture processing subsystem, described picture processing subsystem is for performing following process: capture original image according to described image link and carry out process according to picture processing strategy to described original image and obtain new picture, preserves described new picture and is described new picture generating pictures link;
Receive the pictorial information sent by described picture processing subsystem, described pictorial information comprises the image link of described original image and the image link of described new picture;
Data Fusion is carried out according to described second structural data and described pictorial information.
11. data processing methods as claimed in any one of claims 7-9, it is characterized in that, described method also comprises:
Original image is captured according to the image link parsed in described dissection process;
According to picture processing strategy, process is carried out to described original image and obtain new picture, preserve described new picture and preserve and be the link of described new picture generating pictures;
Carry out Data Fusion according to described second structural data and pictorial information, described pictorial information comprises the image link of described original image and the image link of described new picture.
12. 1 kinds for obtaining the data processing equipment of site resource, is characterized in that, comprising:
Data screening module, for receiving the web data captured by web crawlers, and carrying out Screening Treatment to the web data received, obtaining the web data relevant to appointed website in receiving course;
Data resolution module, for carrying out dissection process according to the parses policy pair web data relevant to described appointed website preset, obtains first structural data relevant to described appointed website;
Data fusion module, for carrying out Data Fusion according to resolving described first structural data obtained within a predetermined period of time, obtains the second structural data of the resource for describing described appointed website.
13. data processing equipments as claimed in claim 12, is characterized in that,
Described data screening module specifically for, receive web data process in, the URL regular expression according to described appointed website carries out Screening Treatment to the web data received.
14. data processing equipments as claimed in claim 12, it is characterized in that, described data resolution module comprises:
First analyzing sub-module, for be video website in described appointed website and the web data relevant to described appointed website is the web data relevant with the video playback page of described appointed website time, carry out dissection process according to the first parses policy;
Second analyzing sub-module, for be video website in described appointed website and the web data relevant to described appointed website is the web data relevant with the list of videos page of described appointed website time, carry out dissection process according to the second parses policy different from described first parses policy.
15. data processing equipments according to any one of claim 12-14, is characterized in that,
Described data processing equipment also comprises:
Image link sending module, the image link for being parsed in described dissection process by described data resolution module is sent to picture processing subsystem; Wherein, described picture processing subsystem is for performing following process: capture original image according to described image link and carry out process according to picture processing strategy to described original image and obtain new picture, preserving described new picture is also described new picture generating pictures link, and the pictorial information of the image link of the image link and described new picture that comprise described original image is sent to described data processing equipment
Pictorial information receiver module, for receiving described pictorial information;
Described data fusion module also for, carry out Data Fusion according to described second structural data and described pictorial information, obtain the structural data comprising described pictorial information.
16. data processing equipments according to any one of claim 12-14, is characterized in that,
Described data processing equipment also comprises:
Picture handling module, captures original image for the image link parsed in described dissection process according to described data resolution module, and
Picture processing module, obtains new picture for original image according to the process of picture processing strategy, and preserving described new picture is also described new picture generating pictures link;
Described data fusion module also for, Data Fusion is carried out according to described second structural data and pictorial information, obtain the structural data comprising described pictorial information, described pictorial information comprises the image link of described original image and the image link of described new picture.
17. 1 kinds, for obtaining the data handling system of site resource, is characterized in that,
Described data handling system comprises:
Data processing equipment according to any one of claim 12-14, and,
For preserving the database of described second structural data;
Or described data handling system comprises:
Data processing equipment as claimed in claim 15,
For preserving the database of the structural data comprising described pictorial information, and
Picture processing subsystem, new picture is obtained for capturing original image according to described image link and carry out process according to picture processing strategy to described original image, preserving described new picture is also described new picture generating pictures link, and the pictorial information of the image link of the image link and described new picture that comprise described original image is sent to described data processing equipment;
Or described data handling system comprises:
Data processing equipment as claimed in claim 16, and
For preserving the database of the structural data comprising described pictorial information.
CN201410521135.2A 2014-09-30 2014-09-30 Data processing system, method and device for obtaining site resource Active CN104281680B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410521135.2A CN104281680B (en) 2014-09-30 2014-09-30 Data processing system, method and device for obtaining site resource

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410521135.2A CN104281680B (en) 2014-09-30 2014-09-30 Data processing system, method and device for obtaining site resource

Publications (2)

Publication Number Publication Date
CN104281680A true CN104281680A (en) 2015-01-14
CN104281680B CN104281680B (en) 2018-08-21

Family

ID=52256553

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410521135.2A Active CN104281680B (en) 2014-09-30 2014-09-30 Data processing system, method and device for obtaining site resource

Country Status (1)

Country Link
CN (1) CN104281680B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016206395A1 (en) * 2015-06-25 2016-12-29 中兴通讯股份有限公司 Weekly report information processing method and device
CN107180054A (en) * 2016-03-11 2017-09-19 阿里巴巴集团控股有限公司 A kind of method and apparatus of data processing
CN108228667A (en) * 2016-12-22 2018-06-29 钢钢网电子商务(上海)股份有限公司 A kind of integration method and system of iron and steel resource data information
CN115221453A (en) * 2022-09-20 2022-10-21 太平金融科技服务(上海)有限公司深圳分公司 Media information management method, apparatus, device, medium, and computer program product

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102298622A (en) * 2011-08-11 2011-12-28 中国科学院自动化研究所 Search method for focused web crawler based on anchor text and system thereof
CN102622443A (en) * 2012-03-13 2012-08-01 北京邮电大学 Customized screening system and method for microblog
CN102930059A (en) * 2012-11-26 2013-02-13 电子科技大学 Method for designing focused crawler

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102298622A (en) * 2011-08-11 2011-12-28 中国科学院自动化研究所 Search method for focused web crawler based on anchor text and system thereof
CN102622443A (en) * 2012-03-13 2012-08-01 北京邮电大学 Customized screening system and method for microblog
CN102930059A (en) * 2012-11-26 2013-02-13 电子科技大学 Method for designing focused crawler

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016206395A1 (en) * 2015-06-25 2016-12-29 中兴通讯股份有限公司 Weekly report information processing method and device
CN107180054A (en) * 2016-03-11 2017-09-19 阿里巴巴集团控股有限公司 A kind of method and apparatus of data processing
CN107180054B (en) * 2016-03-11 2020-05-12 阿里巴巴集团控股有限公司 Data processing method and device
CN108228667A (en) * 2016-12-22 2018-06-29 钢钢网电子商务(上海)股份有限公司 A kind of integration method and system of iron and steel resource data information
CN115221453A (en) * 2022-09-20 2022-10-21 太平金融科技服务(上海)有限公司深圳分公司 Media information management method, apparatus, device, medium, and computer program product

Also Published As

Publication number Publication date
CN104281680B (en) 2018-08-21

Similar Documents

Publication Publication Date Title
CN105451087A (en) Pushing method, terminals, historical data server and system for barrage information
CN104618806A (en) Method, device and system for acquiring comment information of video
CN102420813B (en) Method and device for providing target information according to terminal attributes of user equipment
CN103416073B (en) For providing the method and apparatus of the feedback about the process to video content
CN102779167A (en) Method and system for displaying webpage in mobile terminal
CN104281680A (en) Data processing system, method and device for acquiring website resources
CN101441629A (en) Automatic acquiring method of non-structured web page information
CN104394475A (en) Streaming media file playing method and media player
CN105373554A (en) Mobile device webpage based screen popup method and system
CN101808114A (en) Method and system for realizing website access and front-end server
CN103678372A (en) Method and equipment for obtaining application performance of page
CN102904765A (en) Method and equipment for data reporting
CN105469381A (en) Information processing method and terminal
CN104572084A (en) Method and device for user interface generating and data issuing in card business
CN103116645A (en) Method and device for browsing webpage with mobile device
US20130138770A1 (en) Apparatus and method for sharing web contents using inspector script
CN101354706A (en) Method and apparatus for collecting web page information
CN103458065A (en) Method for extracting video address based on Webkit kernel under HTML5 standard
CN108280228A (en) A kind of processing method and relevant device of webpage
CN103577426A (en) Method, device and system for providing additional application messages of searching suggestion
CN111680799A (en) Method and apparatus for processing model parameters
CN107454080A (en) One kind is based on internet data security method and system
CN104580127B (en) Method for processing business, server and client
CN104503983A (en) Method and device for providing website certification data for search engine
CN105812839B (en) Video stream data acquisition, page data transmission method, system and network server

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
EE01 Entry into force of recordation of patent licensing contract

Application publication date: 20150114

Assignee: Beijing Intellectual Property Management Co.,Ltd.

Assignor: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY Co.,Ltd.

Contract record no.: X2023110000095

Denomination of invention: Data processing system, method, and device for obtaining website resources

Granted publication date: 20180821

License type: Common License

Record date: 20230821

EE01 Entry into force of recordation of patent licensing contract