CN102035883B - Method and device for optimizing webpage in network equipment - Google Patents

Method and device for optimizing webpage in network equipment Download PDF

Info

Publication number
CN102035883B
CN102035883B CN201010569782.2A CN201010569782A CN102035883B CN 102035883 B CN102035883 B CN 102035883B CN 201010569782 A CN201010569782 A CN 201010569782A CN 102035883 B CN102035883 B CN 102035883B
Authority
CN
China
Prior art keywords
information
user
web page
unit
category
Prior art date
Application number
CN201010569782.2A
Other languages
Chinese (zh)
Other versions
CN102035883A (en
Inventor
朱晋良
邢皖甲
Original Assignee
百度在线网络技术(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 百度在线网络技术(北京)有限公司 filed Critical 百度在线网络技术(北京)有限公司
Priority to CN201010569782.2A priority Critical patent/CN102035883B/en
Publication of CN102035883A publication Critical patent/CN102035883A/en
Application granted granted Critical
Publication of CN102035883B publication Critical patent/CN102035883B/en

Links

Abstract

本发明提供一种在网络设备中用于优化网页的方法及设备,本发明中,通过获取待处理的第一网页信息;分析所述第一网页信息所包含的各个信息单元,以确定所述各个信息单元所属的类别;及基于第一预定规则,结合所述各个信息单元的类别,来实现将所述第一网页信息转换为用于提供给所述用户设备的第二网页信息的目的。 The present invention provides a method and apparatus for optimizing a web page in a network device, according to the present invention, by acquiring a first page of information to be processed; each unit analyzes the information included in a first web page information, in order to determine the respective category information unit belongs; and based on a first predefined rule, in conjunction with the respective category information unit, to achieve the first object of the web page information into a second web page to provide information to the user equipment is used. 与现有技术相比,本发明具有以下优点:1)能够突显用户关注的内容,减少用户查找的时间;2)能够屏蔽广告内容及用户不关注的内容,带来更好的网页浏览体验;3)能够去除网页中的冗余内容,减少网页的加载时间;4)能够调整网页结构,加快网页的排版速度。 Compared with the prior art, the invention has the following advantages: 1) be able to highlight the contents of the user's attention, reduce the time users are looking for; 2) capable of shielding the content of advertising content and the user does not concern a better web browsing experience; 3) can be removed redundant content in web pages, web page load time reduction; 4) page structure can be adjusted to accelerate the speed of the page layout.

Description

一种在网络设备中用于优化网页的方法和设备 A method and apparatus for optimizing a web page in the network device

技术领域 FIELD

[0001] 本发明涉及计算机网络技术,尤其涉及一种在网络设备中用于优化网页的方法和设备。 [0001] The present invention relates to computer network technology, particularly to a method and apparatus for optimizing a web page in the network device.

背景技术 Background technique

[0002] 现如今,通过各种用户设备浏览网页,已成为多数人生活中的一部分,然而,随着互联网的发展,网页中包含的信息越来越多,使得用户不得不花费精力在网页中查找自己需要的信息,并且,为了创造盈利,网站提供的各种网页中往往夹杂着较多的广告,影响了用户的浏览。 [0002] Now, through a variety of user devices browsing the Web, has become part of most people's lives, however, with the development of the Internet, information pages include more and more, so that users have to spend energy in a web page to find the information they need, and, in order to create profit, various web sites provided often mixed with more advertising, affecting the user's browser. 此外,由于部分网站的网页编写不得当,还会造成用户网页加载量偏大、网页生成速度较慢的问题。 In addition, since part of the page to write website shall not be treated, it will result in the user page load amount is too large, slow page generation problem.

[0003] 现有技术中,已提供了屏蔽广告信息的方法,然而,该类方法往往仅通过屏蔽浮动元素、拦截弹出窗口等简单的手段来进行广告屏蔽,不仅屏蔽效果较弱,还有可能屏蔽用户需要的信息。 [0003] prior art has provided a method of shielding advertising information, however, such methods often only to advertising shielded by shielding the floating elements, simple means to intercept pop-up windows, etc., not only shielding effect is weak, there may be shield information required by the user. 而对于网页编写不当造成的网页加载量偏大、网页排版速度较慢等问题,现有技术尚未提供有效的解决方案。 The web page for page loading caused by improper preparation of larger, slower speed web publishing and other issues, the prior art has not provided an effective solution.

发明内容 SUMMARY

[0004] 本发明的目的是提供一种在网络设备中用于优化网页的方法和设备。 [0004] The object of the present invention is to provide a method and apparatus for optimizing a web page in the network device.

[0005] 根据本发明的一个方面,提供一种网络设备中用于优化网页的方法,其中,该方法包括以下步骤: [0005] In accordance with one aspect of the present invention, there is provided a method for optimizing a network device web page, wherein the method comprises the steps of:

[0006] a获取待处理的第一网页信息; [0006] a first obtaining page information to be processed;

[0007] b分析所述第一网页信息所包含的各个信息单元,以确定所述各个信息单元所属的类别; [0007] b each analysis unit of the first information included in the web page information to determine the respective category information unit belongs;

[0008] c基于第一预定规则,结合所述各个信息单元的类别,来将所述第一网页信息转换为用于提供给所述用户设备的第二网页信息。 [0008] c based on a first predefined rule, in conjunction with the respective category information unit, the first web page to second web page information into information to be provided to the user equipment.

[0009] 根据本发明的另一个方面,还提供了一种用于优化网页的网络设备,其中,该网络设备包括: [0009] According to another aspect of the present invention, there is provided a network apparatus for optimizing a web page, wherein, the network device comprising:

[0010] 获取装置、用于获取所述待处理的第一网页信息; [0010] acquiring means for acquiring information of a first page to be processed;

[0011] 类别分析装置、用于分析所述第一网页信息所包含的各个信息单元,以确定所述各个信息单元所属的类别; [0011] category analysis means for analyzing each of the information units included in the first web page information, to determine the category of each information unit belongs;

[0012] 转换装置、用于基于第一预定规则,结合所述各个信息单元的类别,来将所述第一网页信息转换为用于提供给所述用户设备的第二网页信息。 [0012] converter means for a first predetermined rule based on the combination of the various categories of information units, the first web page to second web page information into information for providing to the user equipment.

[0013] 与现有技术相比,本发明具有以下优点:1)能够突显用户关注的内容,减少用户查找的时间;2)能够屏蔽广告内容及用户不关注的内容,带来更好的网页浏览体验;3)能够去除网页中的冗余内容,减少网页的加载时间;4)能够调整网页结构,加快网页的排版速度。 [0013] Compared with the prior art, the present invention has the following advantages: 1) the ability to depict the contents of the user's interest, to reduce the time the user is looking; 2) content and advertisement content capable of shielding the user does not concern a better page browsing experience; 3) be able to remove redundant content in web pages, to reduce page load time; 4) be able to adjust the page structure, accelerate the speed of the page layout.

附图说明 BRIEF DESCRIPTION

[0014] 通过阅读参照以下附图所作的对非限制性实施例所作的详细描述,本发明的其它特征、目的和优点将会变得更明显: [0014] By reading the following detailed description of the accompanying drawings of non-limiting embodiments, and other features, objects and advantages of the invention will become more apparent:

[0015]图1为本发明一个方面的用于优化网页的系统网络拓扑图; [0015] FIG 1 aspect of the present invention, a system for optimizing network topology page;

[0016]图2为本发明一个优选实施例的用于优化网页的系统网络拓扑图; [0016] FIG 2 a preferred network topology optimization system for page embodiments of the present invention embodiment;

[0017] 图3为本发明一个方面的用于优化网页的方法的流程图; [0017] FIG. 3 is a flowchart of a method for optimizing a web page for aspect;

[0018] 图4为本发明一个优选实施例的用于优化网页的方法的流程图; [0018] FIG 4 is a flowchart of a method embodiment for optimizing a web page with a preferred embodiment of the present invention;

[0019] 图5为本发明另一优选实施例的用于优化网页的方法的流程图; Another preferred embodiment of a flowchart of a method for optimizing a web page Embodiment [0019] FIG. 5 of the present invention;

[0020] 图6为本发明再一个优选实施例的用于优化网页的方法的流程图; [0020] FIG 6 is a flowchart of a method for optimizing a web page further embodiment of a preferred embodiment of the present invention;

[0021] 图7为本发明一个方面的用于优化网页的网络设备结构图; [0021] FIG. 7 apparatus for optimizing a network configuration diagram of a web page with an aspect of the present invention;

[0022] 图8为本发明一个优选实施例的用于优化网页的网络设备结构图; [0022] FIG. 8 is preferably a device optimized network configuration diagram of a web page for the embodiment of the embodiment of the present invention;

[0023] 图9为本发明另一优选实施例的用于优化网页的网络设备结构图; [0023] FIG. 9 optimize network configuration diagram of apparatus of the present invention for the page to another preferred embodiment;

[0024] 图10为本发明再一个优选实施例的用于优化网页的网络设备结构图; [0024] FIG. 10 of the present invention is a further optimization of the network device configuration diagram of a preferred embodiment for the web embodiment;

[0025] 附图中相同或相似的附图标记代表相同或相似的部件。 [0025] In the drawings the same or similar to the same or like reference numerals refer to the components.

具体实施方式 Detailed ways

[0026] 下面结合附图对本发明作进一步详细描述。 [0026] The following figures of the present invention will be further described in detail with.

[0027] 图1为本发明一个方面的用于优化网页的系统网络拓扑图。 [0027] FIG. 1 system network topology for optimizing a web page with an aspect of the present invention. 用户通过用户设备I与网络设备2进行交互,网络设备2根据用户的交互行为,获取网页信息,并将该获取的网页信息优化后,经由用户设备I提供给用户。 User by a user equipment and network equipment 2 I interactive network device according to a user's interactions, obtain web page information, and after the optimization of the acquired web page information, via the user equipment provides a user I. 其中,用户设备I包括但不限于:计算机、智能手机、PDA或IPTV。 Wherein the user device I include but are not limited to: a computer, a smart phone, PDA or IPTV. 网络设备2包括但不限于:单个网络服务器、多个网络服务器组成的服务器组或基于云计算(Cloud Computing)的由大量计算机或网络服务器构成的云,其中,云计算是分布式计算的一种,由一群松散耦合的计算机集组成的一个超级虚拟计算机。 2 network device including but not limited to: a single network server group servers, or a plurality of network servers is composed of a large number of cloud-based computer or network server cloud computing (Cloud Computing), wherein the Cloud computing is a distributed computing , a virtual super computer by a computer set consisting of a group of loosely coupled.

[0028] 图2为本发明一个优选实施例的用于优化网页的系统网络拓扑图。 [0028] FIG 2 a preferred network topology optimization system for the embodiment of web embodiment of the present invention. 本实施例中,网络设备2进一步分为web设备及优化设备。 Embodiment, the device further into the network equipment and web optimization apparatus 2 embodiment. 用户通过用户设备I与web设备进行交互,web设备根据用户的交互行为,获取网页信息,并将该网页信息发送给优化设备,优化设备对该网页信息进行优化后,反馈给web设备,web设备再将该优化后的网页信息提供给用户设备1,以使用户设备I根据该网页信息,将网页呈现给用户。 I user by a user equipment device interaction with the web, a web appliance according to a user's interactions, acquires the web page information and the web page information to the optimization device, after the device is optimized to optimize the web page information, back to the web appliance, a web appliance then supplies the optimized web page information to the user equipment 1, so that the user apparatus I according to the information page, the page will be presented to the user. 其中,用户设备I包括但不限于:计算机、智能手机、PDA或IPTV。 Wherein the user device I include but are not limited to: a computer, a smart phone, PDA or IPTV. Web设备及优化涉笔均包括但不限于:单个网络服务器、多个网络服务器组成的服务器组或基于云计算(Cloud Computing)的由大量计算机或网络服务器构成的云,其中,云计算是分布式计算的一种,由一群松散耦合的计算机集组成的一个超级虚拟计算机。 Web optimization involving pen device and includes but is not limited to: a single network server group servers, or a plurality of network servers is composed of a large number of cloud-based computer or network server cloud computing (Cloud Computing), wherein, distributed computing cloud a calculation of a virtual super computer by a computer set consisting of a group of loosely coupled.

[0029] 请参阅图1及图3,图3为本发明一个方面的用于优化网页的方法的流程图。 [0029] Please refer to FIG. 1 and FIG. 3, FIG. 3 is a method for optimizing a web aspect of the present invention. FIG.

[0030] 在步骤SI中,用户通过任何一种可与用户设备I进行人机交互的交互设备来输入第一请求,其中,该第一请求用于为用户设备I请求处理待处理的第一网页信息,例如,用于为用户设备I请求用户希望浏览的网页信息,或者,用于为用户设备I请求已存储在用户设备I上但需要优化的网页信息等。 [0030] In step SI, the user may be by any of the user devices I to interact with interactive apparatus to request a first input, wherein the first request for a first device to a user request processing pending I web page information, for example, for a user requesting the user device I want to browse web page information, or a request for a user device I have been stored on the user device I but requires optimized web page information. 其中,该交互设备可以是键盘、鼠标、遥控器、触摸板或声控设备等,用户可以通过执行预设的操作方式告知用户设备I发送所述请求。 Wherein, the interaction device may be a keyboard, a mouse, a remote controller, a touch panel or a voice-activated device, etc., the user can instruct the user equipment by performing a predetermined operation to transmit the request I. 例如,以触摸板式人机交互设备为例,用户通过触控触摸板,选择用户设备I所显示的某个网页链接,再例如,用户I通过在触摸板上以预设的轨迹滑动,以打开与该预设的轨迹相应的网页,例如,主页等。 For example, the touch-panel human interface devices, for example, the user through touch touchpad, select a page that links the user device I show, another example, a user on the touch pad I by a preset trajectory slide to open corresponding to the preset track web page, for example, home and so on. 当然,本领域技术人员应该理解,上述交互设备仅仅只是例举,而非用于限制本发明,事实上,其他可供用户用于输入请求的交互设备或方式也均适用于本发明,并以引用方式包含于此,而不做赘述。 Of course, those skilled in the art will appreciate, the above-described interaction device merely exemplified and not intended to limit the present invention, in fact, available to other user interaction device or a mode input request is also suitable for use in the present invention, and to incorporated here by reference, do not repeat them.

[0031] 接着,在步骤S2中,所述用户设备I将所述用户输入的第一请求发送至所述网络设备2。 [0031] Next, in step S2, the user apparatus I to the first request input by the user is transmitted to the network device 2. 其中,用户设备I和网络设备2之间的信息收发通过网络进行,该网络包括但不限于:1)有线网络;2)无线网络;3)局域网;4)广域网;5)VPN网络;6)无线自组织网络(AdHoc网络)等。 Wherein the messaging between the user device I and a network device via a network, the network including, but not limited to: 1) a wired network; 2) a wireless network; 3) local area network; 4) a wide area network; 5) VPN network; 6) wireless ad hoc network (AdHoc network) and so on.

[0032] 接着,在步骤S3中,网络设备2获取待处理的第一网页信息。 [0032] Next, in step S3, the network device 2 acquires the first page of information to be processed. 网络设备2获取待处理的第一网页信息的方式包括多种: Obtaining a first embodiment to be processed web page information comprises a plurality of network device 2:

[0033] I)当用户设备I发送的第一请求中包括第一网页信息的全部内容,则网络设备2获取该第一请求后,从该第一请求中直接提取第一网页信息; After [0033] I) when the first user device I request sent by a first web page includes the entire contents of information, the network device 2 acquires the first request, extracting the first web page information directly from the first request;

[0034] 2)当用户设备I发送的第一请求中仅包括第一网页信息的链接地址,则网络设备2获取该第一请求后,从所述第一请求中提取所述待处理的第一网页信息的链接地址,再根据所述链接地址,从相应的网站中获取所述待处理的第一网页信息。 After [0034] 2) includes only information on a first web page link address, the network device 2 acquires the first request when a first request sent from the user device I, the extraction of the first transaction from the request a link address of the web page information, and then, acquires the first page of the information to be processed from the corresponding site according to the link address.

[0035] 接着,在步骤S4中,网络设备2分析所述第一网页信息所包含的各个信息单元,以确定所述各个信息单元所属的类别。 [0035] Next, in step S4, the information of each unit of the network device 2 analyzes the information contained in the first web page to determine the category information of the respective unit belongs.

[0036] 具体地,网络设备2对所述第一网页信息进行分析,识别出第一网页信息中需要处理的信息单元,并通过分析与信息单元相关的因素,来确定各个信息单元所属的类别。 [0036] In particular, the two pairs of the first network device to analyze web page information, the identification information of the web page information in the first unit to be processed, by analyzing factors associated with the information unit, to determine the category of each information unit belongs .

[0037] 其中,网络设备2可根据以下至少一项因素来确定所述信息单元的类别: [0037] wherein, the network device 2 may determine the type of the information unit according to at least one of the following factors:

[0038] I)所述信息单元的标识符; [0038] I) the information element identifier;

[0039] 具体地,网络设备2根据第一网页信息中所包含的标识符,来区分信息单元,并判断信息单元所属的类别。 [0039] Specifically, the first network device 2 based on the identifier included in the web page information, to distinguish the information unit, and determines the category information unit belongs.

[0040] 例如,若网络设备2检测到标识符“〈title〉”,则网络设备2判断两个标识符“〈title〉”之间的内容为一个信息单元,该信息单元为标题;又例如,若网络设备2检测到标识符“ /* ”或者“ // ”,则网络设备2判断“ /* ”或者“ // ”至“;”之间的内容为一个信息单元,该信息单元为注释单元等。 Content between [0040] For example, if the network device 2 detects the identifier "<title>", the network device 2 determines two identifiers "<title>" as an information unit, the header information unit; and e.g. If the network device 2 detects the identifier "/ *" or "//", the network device 2 determines "/ *" or "//" to ";" the contents of one information unit among the information units comments unit and so on.

[0041] 2)所述信息单元的文本内容; [0041] 2) the text content of the information unit;

[0042] 具体地,网络设备2根据第一网页信息中所包含的标识符,来区分信息单元,随后,根据该信息单元的文本内容,来判断信息单元所属的类别。 [0042] Specifically, the first network device 2 based on the identifier included in the web page information, to distinguish the information unit, then, according to the text content of the information unit, to determine the category information unit belongs.

[0043] 例如,网络设备2将信息单元中的文本内容与预设的广告词库中包含的广告词匹配,若匹配成功,例如匹配得到“欢迎选购”等,则判断该信息单元为广告单元。 [0043] For example, ad text information with the preset unit 2 ad network device comprising matching lexicon, if the matching succeeds, for example, to obtain matching "welcome" and the like, it is judged that the information unit ads unit.

[0044] 3)所述信息单元在所述第一网页信息中的位置; [0044] 3) the location information of the first unit of the web page information;

[0045] 具体地,网络设备2根据第一网页信息中所包含的标识符,来区分信息单元,随后,网络设备2通过该信息单元在第一网页信息中的位置,来判断信息单元所属的类别; [0045] Specifically, the first network device 2 based on the identifier included in the web page information, to distinguish the information unit, then the network device 2 by the position information unit in the first page information to determine the information unit belongs category;

[0046] 例如,网络设备2分析得到超过一定数量的,结构相近的信息单元位于第一网页信息后1/5位置内,则判断该信息单元为广告单元。 [0046] For example, the network device 2 obtained over a certain number of analysis, the unit is located close structural information of a first page information in the 1/5 position, it is judged that the information unit for the ad unit.

[0047] 4)与所述单元相关的信息单元的信息; [0047] 4) the cell information related with the information unit;

[0048] 具体地,网络设备2根据第一网页信息中所包含的标识符来区分信息单元,随后,网络设备2通过查找与该信息单元具有相同标识符的信息单元的类别或查找与该信息单元位置相近且结构相似的信息单元的类别或包含的内容,来判断该信息单元所属的类别。 [0048] Specifically, the network device 2 to distinguish a first information element based on the identifier information included in the web page, then the network device 2 having the same identifier type information element to the information unit by looking to the information or to find categories or content includes cell location information of similar structure and similar means to determine the category of the information unit belongs. 其中,所述结构相似是指两个信息单元中相同的部分超过一预定阈值,例如,超过50%等。 Wherein the structure is similar to refer to like parts in the two information elements exceeds a predetermined threshold, e.g., more than 50% and the like. 在此,本领域技术人员应可根据实际需求来确定一个合理的预定阈值。 Here, the skilled artisan may determine a reasonable predetermined threshold value according to the actual needs.

[0049] 例如,网络设备2在判断一信息单元时,首先查找到其前一信息单元;随后,将其与待判断的信息单元进行对比,当两者标识符相同,且两者的文本匹配度高于一预定阈值,则判断该信息单元的类别与前一信息单元的类别相同。 [0049] For example, network device 2 when an information determination unit first finds the previous one information unit; then, it is compared with the information unit to be determined when both the same identifier, and matching both the text is higher than a predetermined threshold value, it is judged that the category information unit is the same as the previous category information element.

[0050] 需要说明的是,网络设备2在判断信息单元的类别的过程中,可综合上述因素进行判断,例如,当网络设备2检索到信息单元的文本内容与广告词库中的广告词相匹配,则再进一步判断该信息单元在第一网页中的位置及该信息单元是否具有结构相似的相邻的信息单元,若该信息单元位于第一网页信息后1/5位置内,且具有结构相似的信息单元,则判断该信息单元为广告单元,若该信息单元位于第一网页信息1/3-2/3的中间位置内,且该信息单元没有与其结构相似的相邻的信息单元,则判断该信息单元不是广告单元等。 [0050] Incidentally, the category determination process of the network device 2 in the information unit, the determination may be above factors, e.g., when the network device 2 ad text retrieved thesaurus information in ad units with match, then further determines the position of the information unit in the first page and the information unit is has a similar structure adjacent information units, if the 1/5 position of the first units of information pages of information, and having the structure similar information unit, it is judged that the information unit for the ad unit, if the intermediate position of the information unit located at a first web page information 1 / 3-2 / 3, and this information unit is not similar with the structure of the adjacent information units, this information element is judged not the ad unit and the like.

[0051] 需要进一步说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明所做的限制,本领域技术人员应该理解,任何通过分析要素来确定信息单元类别的方法,均应包含在本发明的范围内。 [0051] It is further noted that the above example is only to better illustrate the technical solutions of the present invention, does not limit the present invention, those skilled in the art will appreciate that any method for determining the type of the information unit by analyzing the elements It should be included within the scope of the present invention.

[0052] 接着,在步骤S5中,网络设备2基于第一预定规则,结合所述各个信息单元的类另IJ,来将所述第一网页信息转换为用于提供给所述用户设备的第二网页信息。 [0052] Next, in step S5, the network device 2 based on a first predetermined rule, each of the information units in conjunction with other classes IJ, to convert the first web page for providing information to the user of the device two pages of information.

[0053] 具体地,网络设备2根据第一预定规则中所记录的信息单元的类别与可执行操作之间的对应关系,来执行相应操作,以将所述第一网页信息转换为第二网页信息。 [0053] Specifically, the network device 2 according to the correspondence between the type of operation and may perform a predetermined rule recorded in a first information element to perform a corresponding operation to the first web page into a second web page information information.

[0054] 例如,第一预定规则中设定对于css单元,当其位于第一网页信息的起始位置时,不对其进行操作;当其位于第一网页信息的其他位置时,将其移至第一网页信息的起始位置。 [0054] For example, a first predefined rule for css setting means when it is in the starting position of the first web page information, without its operation; when it is in another position of the first web page information will move the starting position of the first web page information. 则当网络设备2检测到信息单元的类别为css单元时,根据第一预定规则中的规则,结合CSS单元的当前位置,确定是否执行将CSS单元移动至起始位置的操作。 When the network device 2 detects the category information unit is css unit, according to a first predetermined rule in the rule, with the current position of the CSS unit, the operation determines whether the CSS unit is moved to the starting position. 由于CSS单元影响网页的结构,而浏览器在生成网页的过程中通常是按照第一网页信息的内容从头至尾生成,因此,通过将CSS单元前置,能够避免浏览器在生成一部分网页后,由于检测到CSS单元,因此需要重新生成网页的问题,加快了浏览器生成网页的速度。 Since the CSS unit affect the structure of the page and the process of generating the browser page from start to finish is usually generated as the content of the first web page information, therefore, by the CSS unit front can be generated as part of the browser on the page to avoid, Since the detection unit to CSS, so the need to regenerate the page in question, speeds up browser generated web pages.

[0055] 需要说明的是,根据信息单元的类别来调整信息单元位置的方式,不以上述举例为限,本领域技术人员应该理解,只要是根据信息单元的类别,将影响网页结构的信息单元前置的方案,均应包含在本发明的范围内。 [0055] Note that, by adjusting the information unit location based on the type information element way, not in the above example is limited, those skilled in the art will appreciate, as long as according to the category information unit, the influence of information units on a page structure pre-program should be included within the scope of the present invention.

[0056] 又例如,第一预定规则中设定,删除注释单元,则当网络设备2检测到信息单元的类别为注释单元时,将注释单元删除。 [0056] As another example, a first predefined rule set, delete comment unit, when the network device 2 detects the information unit as a comment category unit, the unit deletes the comment. 由于注释不影响网页生成,因此,将注释删除,能够减少浏览器加载网页内容的时间,也减少了用户需要下载的流量,加快了网页呈现的速度。 Since the comment does not affect the Web page generation, therefore, to delete the comment, you can reduce the time browser loads the page content, also reduces the need for users to download traffic, accelerate the speed of page rendering.

[0057] 需要说明的是,根据信息单元的类别来删除信息单元的方式,不以上述举例为限,本领域技术人员应该理解,只要是根据信息单元的类别,删除不影响网页生成的信息单元的方案,均应包含在本发明的范围内。 [0057] Incidentally, to remove the information unit based on the type information element way, not in the above example is limited, those skilled in the art will appreciate, as long as according to the type of information units deleted without affecting the information element generated page program, should be included within the scope of the present invention.

[0058] 当网络设备2完成对第一网页信息的所有处理后,将该处理后的第一网页信息作为第二网页信息。 [0058] When the network device 2 to complete all the information of the first page processing, the first page of the information processing as a second web page information.

[0059] 需要进一步说明的是,根据第一预定规则,结合信息单元类别,来将所述第一网页信息转换为用于提供给所述用户设备的第二网页信息的处理方法,并不以上述举例为限,例如,处理方法还可包括屏蔽垃圾信息单元、凸显正文单元和标题单元等等。 [0059] It is further noted that, according to a first predetermined rule, in conjunction with the category information unit, to the first page of the second page information conversion processing method for providing information to the user equipment, not to limited to the example described above, e.g., a processing method may further include a shield spam unit, and highlights the text header units like unit.

[0060] 需要更进一步说明的是,步骤S4与步骤S5之间并无先后顺序,网络设备2可在每判断一个信息单元类别后,即执行相应的操作,也可判断所有信息单元的类别后,再执行相应的操作。 [0060] The further explanation is required, there is no order between S5, the network device 2 may be determined in each category after one information unit, i.e., performs a corresponding operation in step S4 and steps may be determined for all the categories of information units , then the appropriate action.

[0061] 在步骤S6中,网络设备2将第二网页信息发送给用户设备I。 [0061] In step S6, the network device 2 transmits the second page information to a user device I.

[0062] 在步骤S7中,用户设备I根据第二网页信息,生成网页以呈现给用户。 [0062] In step S7, the user apparatus I according to a second web page information, generates a web page for presentation to the user.

[0063] 请参阅图2及图3,作为本发明的一个优选实施例,网络设备2可进一步包括web设备及优化设备。 [0063] Please refer to FIGS. 2 and 3, as a preferred embodiment of the present invention, the network device may further include a web 2 and device optimization devices.

[0064] 本实施例中,步骤SI已在参照图1及图3所示的实施例中详述,并以引用的方式包含于此,不再赘述。 [0064] In the present embodiment, in the step SI has been described in detail with reference to the embodiment shown in FIGS. 1 and 3, and is incorporated herein by reference, not repeated.

[0065] 在步骤S2中,用户设备I将第一请求发送至web设备。 [0065] In step S2, the user device I first request to the web device. 其发送方式与以上参照图1及图3所述实施例中的相应步骤S2相同或相似,并以引用的方式包含于此,不再赘述。 Transmits a manner described above with reference to FIG. 1 and FIG. 3 the corresponding steps in the same or similar to S2, and is incorporated herein by reference, not repeated.

[0066] 在步骤S3中,web设备根据第一请求获取第一网页信息。 [0066] In step S3, web page information according to the first device acquires the first request. 其获取方式与参照图1及图3所述实施例中的相应步骤S3相同或相似,并以引用的方式包含于此,在此不再赘述。 Acquiring a manner described with reference to FIG 1 and FIG 3 embodiments of the respective steps in the same or similar to S3, and is incorporated herein by reference, are not repeated here.

[0067] 随后,web设备将第一网页信息发送给优化设备,优化设备获取该待处理的第一网页信息。 [0067] Subsequently, web page information of the first device transmits to the optimization apparatus optimizing apparatus acquires the web page information to be processed first.

[0068] 接着,优化设备执行前述参照图1及图3所示的实施例中的步骤S4及步骤S5,将第一网页信息处理为第二网页信息。 [0068] Next, the optimization device performs Referring to FIG 1 and FIG 3 embodiments shown in the embodiment in step S4 and step S5, the first page of the second information processing web page information.

[0069] 接着,优化设备将第二网页信息发送给web设备,web设备再执行步骤S6,将第二网页信息提供给用户设备I。 [0069] Next, a second optimization device transmits the web page information to a web appliance, a web appliance then performs step S6, the second web page to provide information to the user device I.

[0070] 最后,用户设备I执行步骤S7,根据第二网页信息,生成网页以呈现给用户。 [0070] Finally, the user equipment performs I step S7, according to a second web page information, generates a web page for presentation to the user.

[0071] 图4为本发明一个优选实施例的用于优化网页的方法的流程图。 [0071] FIG 4 a flow chart for the preferred method for optimizing a web page according to an embodiment of the present invention. 本实施例中。 The present embodiment. 本实施例中,步骤S4可由网络设备2或包含于网络设备2中的优化设备完成,其中,步骤S4进一步包括步骤S41及步骤S42。 In this embodiment, the step S4 may be a network device 2 or 2 complete optimization apparatus comprises a network device, wherein, further comprising the step S4 and step S41 to step S42.

[0072] 步骤SI至步骤S3已在参照图1及图3或图2及图3所示的实施例中予以详述,并以引用的方式包含于此,不再赘述。 [0072] Step SI to step S3 already described in detail in reference to FIG. 1 and the embodiment shown in FIG. 3 or FIG. 2 and FIG. 3, and is incorporated herein by reference, not repeated.

[0073] 在步骤S41中,网络设备2根据所述第一网页信息的链接地址在模板库中进行匹配查询,以获取相应的类别识别模板。 [0073] In step S41, the network device 2 in the template match query database according to the link address of the first page information, to obtain the corresponding class identification template.

[0074] 具体地,模板库中包含了各个类别识别模板及与该各个类别识别模板对应的链接地址,网络设备2将第一网页信息的链接地址与模板库中的链接地址进行匹配,得到能够成功匹配的类别识别模板。 [0074] Specifically, the template library contains the recognition template for each category and with the respective template category identification address corresponding to the link, the link address of the network device 2 and the information of the first web page template gallery link address matching can be obtained successful matching category recognition template. 其中,当网络设备2能够成功匹配到多个链接地址时,选择匹配度最高的链接地址所对应的类别识别模板。 Wherein, when the network device 2 can be successfully matched to the plurality of link address, select a category template matching to identify the highest address corresponding to the link.

[0075] 其中,匹配度可根据两条链接地址的表现形式之间的相似程度来计算,该表现形式包括但不限于基于http,https,ftp, tencent协议的URL地址或IP地址,MAC地址等。 [0075] wherein, the matching degree may be calculated according to the degree of similarity between the two link address manifestations, manifestations of which include but are not limited based on http, https, ftp, URL address or IP address tencent protocol, MAC address, etc. . 例如,第一网页信息的链接地址表现为如下URL地址:http://news.sina.com, cn/society,网络设备2在模板库中成功匹配到多个链接: For example, the first information on the website link address showed the following URL address: http: //news.sina.com, cn / society, network equipment 2 successfully matched to multiple links in the template gallery:

[0076] www.sina.com, cn: [0076] www.sina.com, cn:

[0077] http://finance, sina.com, cn/stock/: [0077] http: // finance, sina.com, cn / stock /:

[0078] http: //mobile, sina.com, cn/: [0078] http: // mobile, sina.com, cn /:

[0079] http: //news, sina.com, cn/s/sd/:及. [0079] http: // news, sina.com, cn / s / sd /: and.

[0080] http: //news, sina.com, cn/society: [0080] http: // news, sina.com, cn / society:

[0081] 其中,根据字符串相似度可以确定与第一网页信息的链接地址表现形式匹配度最高的链接为http://news, sina.com, cn/society,该链接对应“类别识别模板一”,则网络设备2选择“类别识别模板一”作为与第一网页信息相对应的类别识别模板。 [0081] wherein, similarity can be determined in accordance with the string form of links exhibit the highest degree of matching with a link address information of the first page is http: // news, sina.com, cn / society, the link corresponding to "category identifying a template ", the network device 2 to select" a recognition template category "as the first web page information corresponding to the type identification template.

[0082] 在步骤S42中,网络设备2根据第一网页信息所包含的各个信息单元,并结合所述类别识别模板,来确定所述各个信息单元所属的类别。 [0082] In step S42, the network device 2 in accordance with various information units included in the first page information, and combining the template category identification, to determine the category information of the respective unit belongs.

[0083] 具体地,在结合前述实施例中第一预定规则所参考因素的基础上,网络设备2进一步根据类别识别模板所提供的信息,来对信息单元进行针对性更强的识别操作,以下将结合前述参考因素,予以详述: [0083] Specifically, on the basis of the foregoing embodiments in conjunction with the first predetermined rule on the factors referred embodiment, network device 2 is further category identification information provided by the template, to carry out the operation more targeted identification information units, the following the combination of the aforementioned reference factor, to be detailed:

[0084] I)所述信息单元的标识符; [0084] I) the information element identifier;

[0085] 网络设备2结合类别识别模板中记录的标识符所表示的含义,来判断信息单元所属的类别。 [0085] The network device 2 binding recognition template category identifier recorded meanings indicated, to determine the category information unit belongs.

[0086] 例如,“类别识别模板一”中记录,标识符“ [ad] ”表示广告,则网络设备2判断标识符为“ [ad] ”的信息单元为广告单元。 [0086] For example, "a recognition template category" is recorded, the identifier "[AD]" indicates that the ad, the network device 2 determines identifier "[AD]" information element for the ad unit.

[0087] 2)所述信息单元的文本内容; [0087] 2) the text content of the information unit;

[0088] 网络设备2结合类别识别模板中记录的文本内容的相关信息,判断信息单元所属的类别。 [0088] 2 binding text information recognition template categories recorded network devices, the category determination information unit belongs.

[0089] 例如,“类别识别模板一”中记录,当一个信息单元所包含的文本字数超过一预设阈值时,该信息单元为重要信息单元,则网络设备2判该信息单元为突显单元。 [0089] For example, the "category recognition template a" recording, when the text words of an information unit contains more than a predetermined threshold value, the information unit as an important information element, the network device 2 determination of the information unit is a highlight cell.

[0090] 3)所述信息单元在所述第一网页信息中的位置; [0090] 3) the location information of the first unit of the web page information;

[0091]网络设备2结合类别识别模板中记录的信息单元的位置与其所属类别的对应关系,来判断信息单元所属的类别。 [0091] The corresponding relationship between the network device 2 combined with its location Category type identification information element template record, to determine the category information unit belongs.

[0092] 例如,“类别识别模板一”中记录,位于第一网页信息后1/3位置内的内容为广告信息,则网络设备2判断位于第一网页信息后1/3位置内的信息单元为广告单元。 After [0092] For example, "a recognition template category" in the record, the first rear third of the web page information contents position information for an ad, the network device 2 is located in the first page information determination unit in the third position for the ad unit.

[0093] 4)与所述信息单元相关的信息单元的信息; [0093] 4) information relating to the information element of the information unit;

[0094] 例如,“类别识别模板一”中记录,当存在超过4个结构相似且位置相近的信息单元时,该信息单元为用于内容推荐的信息单元,则网络设备2判断该类信息单元为推荐单元。 [0094] For example, "a type identification template" recording, when there are more than four of similar structure and similar to the position of the information unit, the information unit for the recommended content information unit, the network device determines the kind of information means 2 the recommended unit.

[0095] 需要说明的是,网络设备2在判断信息单元的类别的过程中,可综合上述因素进行判断,例如,“类别识别模板一”中记录,当存在超过4个结构相似且位置相近的信息单元时,需进一步根据信息单元所处的位置进行判断,若信息单元所处的位置为第一网页信息中靠前1/2至3/4的位置内,则该信息单元为推荐单元;若信息单元所处的位置为第一网页信息中靠后1/5的位置内,则该信息单元为广告单元等。 [0095] Incidentally, the category of process network device 2 determines that the information unit may be above factors for determining, for example, "category recognition template a" recording, when there are more than four similar structure and a similar position when the information unit, need further determination unit is located according to the location information, if the position of a first information unit is located within the web page information in the forward position of 1/2 to 3/4, then the information unit is a recommendation unit; If the position of a first information unit is located on the web page information after the position of 1/5, the information unit for the ad unit and the like.

[0096] 需要进一步说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明所做的限制,本领域技术人员应该理解,任何通过结合类别识别模板及要素分析来确定信息单元的类别的方法,均应包含在本发明的范围内。 [0096] It is further noted that the above example is only to better illustrate the technical solutions of the present invention, it does not limit the present invention, those skilled in the art will appreciate, any type identification determined by binding template and factor analysis the method of the category information element shall be included within the scope of the present invention.

[0097] 步骤S5至步骤S7已在参照图1及图3所示的实施例中予以详述,在此以引用的方式包含,不再赘述。 [0097] Step S5 to step S7 in the embodiment to be had with reference to FIG. 1 and detailed in FIG. 3, herein by reference contains not repeated.

[0098] 优选地,本实施例还包括根据用户经由所述用户设备发送的反馈信息和/或所述第二网页信息,来确定待更新或待建立的类别识别模板的步骤。 [0098] Preferably, according to the present embodiment further comprises a feedback information transmitted by the user via the user device and / or the second web page information, the step of determining to be established or to be updated template type identification.

[0099] 具体地,当用户设备I将基于第二网页信息生成的网页呈现给用户后,用户可再次通过人机交互,经由用户设备I向网络设备2发送反馈信息,该反馈信息包括用户对于网页优化的满意度。 [0099] Specifically, when the user device I will be presented to the user based on a second web page information generated by the user can be interactive, user equipment via the I 2 network device transmits feedback information, the feedback information comprises a user again to page optimization satisfaction. 网络设备2记录用户的反馈信息,并选择用户评价值低于一预定阈值的第二网页信息所采用的类别识别模板,以作为待更新的类别识别模板;或者,若该第二网页信息未采用类别识别模板,则网络设备2记录该第二网页信息的链接地址,以确定在模板库中建立与该链接地址相对应的类别识别模板。 2 records the user's feedback network device, and select the category recognition template user evaluation value is below a second predetermined threshold web page information used to identify the type as a template to be updated; Alternatively, if the second web page information is not used recognition template categories, the network device 2 to record the second information on the website link address to determine and establish the link address corresponding to the recognition template categories in the template library.

[0100] 图5为根据本发明另一优选实施例的用于优化网页的方法的流程图。 [0100] FIG. 5 is a flowchart of a method for optimizing a web page according to another preferred embodiment of the present invention. 本实施例中,步骤S4进一步包括步骤S4',步骤S4'可由网络设备2或包含于网络设备2中的优化设备完成。 In this embodiment, step S4 further comprises the step S4 ', step S4' can be 2 network device or a device 2 for optimizing the network device is completed.

[0101] 步骤SI至步骤S3已在参照图1及图3或图2及图3所示的实施例中予以详述,并以引用的方式包含于此,不再赘述。 [0101] Step SI to step S3 already described in detail in reference to FIG. 1 and the embodiment shown in FIG. 3 or FIG. 2 and FIG. 3, and is incorporated herein by reference, not repeated.

[0102] 在步骤S4,中,网络设备2通过结合用户相关信息对所述第一网页信息所包含的各个信息单元进行分析,以确定所述各个信息单元所属的类别。 [0102] In step S4, the network device 2 of each analysis unit of the first information contained in the web page information by combining the user-related information, to determine the category information of the respective unit belongs. 其中,网络设备2通过识别用户身份,来获取该用户的用户相关信息,网络设备2可根据以下方式识别用户身份:1)用户设备I的唯一识别码,例如,手机号、用户设备的硬件识别码等;2)用户的注册信息;3)记录在用户设备cookie中的信息等。 Wherein, the network device 2, to obtain user related information of the user by identifying the user, the network device 2 according to the following manner to identify the user: 1) a unique identifier of the user device I, e.g., phone number, the hardware to identify the user equipment code and the like; 2) the user registration information; 3) the information recorded in the cookie of the user equipment and the like. 用户相关信息可保存在网络设备2中,或者,用户相关信息保存在用户设备I中,并由网络设备2获取,或者,网络设备2综合保存在用户设备I及网络设备2中的信息,得到用户相关信息。 User-related information can be stored in the network device 2, or user-related information stored in the user device I, acquired by the network device 2, or 2 integrated network device information stored in the user device I and 2 network equipment, get user-related information.

[0103] 其中,所述用户相关信息可由用户主动提供,或网络设备根据记录的用户行为推测得到。 [0103] wherein the user-related information by a user unsolicited or the network user devices obtained according to the estimation of the behavior record. 网络设备2可结合以下至少一项用户相关信息,来分析信息单元的类别: 2 network device may incorporate at least one user-related information, category information analyzing unit:

[0104] I)用户的个人属性,包括用户的年龄、性别、身份、收入、教育程度等; [0104] I) the user's personal attributes, including the user's age, gender, identity, income, education and so on;

[0105] 2)用户的偏好设置,包括屏蔽网页内容的偏好设置,突显网页内容的偏好设置等; [0105] 2) user preferences, including web content preferences screen, highlighting the web content preferences, etc.;

[0106] 3)用户的历史行为,包括用户浏览、点击网页的行为记录等; [0106] 3) the historical behavior of the user, including the user's browser, click on the page's behavior records;

[0107] 4)用户的环境信息,包括用户所在的位置信息、用户当前的时间信息及用户设备相关信息等,其中,用户设备相关信息包括但不限于:网络运营商、用户设备类型,IMEI,用户设备操作系统信息、屏幕分辨率、软件信息等。 [0107] 4) environment information of a user, including the location information of the user is located, the user's current time information and the user device-related information, wherein the user information about the device, including but not limited to: the network operator, user equipment type, IMEI, user device operating system information, screen resolution, and software information.

[0108] 例如,当用户相关信息包含该用户为女性,则网络设备2判断包含“服装”、“购物”等词汇的信息单元为突显单元。 Information element [0108] For example, when the user information including the female user, the network device 2 is determined with "clothing", "shopping" and other words of highlight cell.

[0109] 又例如,当用户在偏好设置中设置突显标题,则网络设备2将检测到的标题单元判断为突显单元。 [0109] As another example, when the user sets highlighted title preferences, the network device 2 will detect the header unit determines that the highlight cell.

[0110] 又例如,当在一预设时间长度内所记录的用户行为仅包括该用户通过新浪网的新闻页面主页点击打开网页的行为,而无该用户进一步在打开的网页上进行点击的行为,则网络设备2可基于所记录的用户行为判断该用户仅浏览网页中的正文,故可将正文以外的其他信息单元确定为可忽略单元。 [0110] In another example, when a user behavior within a predetermined length of time recorded only includes the user clicks to open a web page by Sina news page homepage behavior, behavior without the user further clicks on the open pages , the network device 2 may be recorded based on user behavior is determined that the user to view only the text page, it means other than text information may be determined to be negligible unit.

[0111] 再例如,网络设备2根据用户设备I当前的IP地址,判断用户所在位置为上海,则当信息单元的文本内容中包括“上海”时,网络设备2可确定该信息单元为突显单元。 [0111] For another example, the network device 2 according to a user device I the current IP address, determines user location in Shanghai, when the information unit is text including "Shanghai" when, network device 2 may determine that the information unit is a highlight cell .

[0112] 步骤S5至步骤S7已在参照图1及图3所示的实施例中予以详述,以引用的方式包含于此,不再赘述。 [0112] Step S5 to step S7 in the embodiment to be had with reference to Fig. 1 and detailed in Figure 3, is incorporated herein by reference, not repeated.

[0113] 需要说明的是,在步骤S4'中,还可进一步包括前述步骤S41及S42,以结合类别识别模板及用户相关信息,来确定信息单元所属的类别。 [0113] Note that, at step S4 ', the may further comprise the step S41 and S42, the category identification to bind template and user-related information, to determine the category information unit belongs.

[0114] 需要进一步说明的是,上述举例仅为更好地说明本发明的方案,而非对本发明的限制,本领域技术人员应该理解,根据任何其他的用户相关信息以及基于用户相关信息来判断信息单元所属类别的任何其他方式,均应包含在本发明的范围内。 [0114] It is further noted that the above example is only to better illustrate the present invention, not limitation of the invention, those skilled in the art will appreciate, based on the user information and related information is determined according to any other user any other way of category information element shall be included within the scope of the present invention.

[0115] 图6为本发明再一个优选实施例的用于优化网页的方法的流程图。 [0115] FIG 6 is a flowchart of another method of the invention for optimizing a web page of the preferred embodiment. 本实施例中,步骤S5进一步包括步骤S5',步骤S5'可由网络设备2或包含于网络设备2中的优化设备完成。 In this embodiment, further comprising the step S5 step S5 ', step S5' may be complete or a network device 2 for optimizing the network device 2.

[0116] 步骤SI至步骤S4已在参照图1和图3、图2和图3、图4或图5所示的实施例中予以详述,并以引用的方式包含于此,不再赘述。 [0116] Step 3 is a step to SI, FIG. 2 and FIG. 3, the embodiment shown in FIG. 4 or FIG. 5 to be S4 in FIG. 1 and described in detail, and is incorporated herein by reference, not repeated .

[0117] 在步骤S5'中,网络设备2根据所述第一预定规则,并基于所述各个信息单元的类另IJ,来对所述各个信息单元执行相应的操作,以将所述第一网页信息转换为第二网页信息。 [0117] In step S5 ', the network device 2 according to said predetermined first rule, based on the respective class information of another IJ unit, performs a corresponding operation on the respective information unit to the first web page information into the second information.

[0118] 其中,所述第一预定规则包括参考以下至少一项因素来确定所述相应的操作: [0118] wherein said first predetermined rule comprises at least one reference to the following factors to determine the appropriate action:

[0119] I)预设的所述类别与可执行操作之间的对应关系; [0119] I) a preset correspondence relationship between the categories and executable operations;

[0120] 具体地,在第一预定规则中,规定了每一种信息单元类别所对应的可执行操作,网络设备2根据信息单元类别与可执行操作之间的对应关系,来对各个信息单元执行相应的操作,当所有操作完成后,则将处理后的第一网页信息作为第二网页信息。 [0120] Specifically, at a first predetermined rule, each of the predetermined information element corresponding to the category of executable operations, the network device 2 the correspondence between the category information and perform the operation unit to the respective information units the appropriate action, after completion of all the operations, the first web page is being processed, the information as the second web page information.

[0121] 例如,第一预定规则规定了注释单元及广告单元所对应的可执行操作为删除操作,则当网络设备2检测到注释单元,将该注释单元删除; [0121] For example, a first predetermined unit annotation rules and advertising units corresponding executable operations for the delete operation, when the network device 2 detects the annotation means, the annotation unit deletes;

[0122] 又例如,第一预定规则规定了当css单元未处于网页信息的起始位置时,将其置于起始位置,则当网络设备2检测到css单元时,检测css单元所处的位置,当其位置不为起始位置时,将其移至起始位置; [0122] As another example, a first predetermined rules when the unit is not in the starting position css web page information, which is placed at the beginning, when the network device detects css unit 2, the detection unit is located css position, when the position is not the starting position, to move the starting position;

[0123] 又例如,第一预定规则规定了以红色字体来对突显单元中的文本内容进行突显,则当网络设备2检测到突显单元时,将突显单元的文本内容的色彩格式更改为红色; [0123] As another example, a first predetermined rules specify highlighted in red to highlight the textual content unit, when the network device 2 detects the highlight section, change the format of the color highlight text unit is red;

[0124] 再例如,第一预定规则规定了标记可忽略单元,则当网络设备2检测到可忽略单元时,对可忽略单元进行标记,以供用户设备I识别可忽略单元,则用户设备I可根据用户的选择,将所述可忽略单元生成在网页中,呈现给用户;或者,屏蔽该可忽略单元,不将其呈现给用户。 [0124] As another example, a first predetermined rules specify marking means negligible, when the network device detects a negligible unit 2, marked on the unit can be ignored for the user equipment unit I recognition negligible, the user equipment I according to user's selection, the page can be ignored in the generation unit, presented to the user; alternatively, the shield unit can be ignored, it is not presented to the user.

[0125] 2)用户相关信息; [0125] 2) information related to the user;

[0126] 具体地,第一预定规则中包含:根据用户相关信息及信息单元的类别,来对信息单元执行相应操作的规则。 [0126] Specifically, a first predefined rule comprises: the user related information and the category information unit, the rule performs corresponding operations information units.

[0127] 例如,若用户在用户偏好设置中规定以灰色背景的方式,对突显单元进行突显,则网络设备2将突显单元的背景更改为灰色; [0127] For example, if the user specifies a gray background in the manner of highlighting unit highlights the user preferences, the network device 2 will highlight gray background change unit;

[0128] 又例如,若用户在超过一预定次数中,从未选择呈现可忽略单元,则网络设备2将可忽略单元的透明度调整为59%,以对可忽略单元进行淡化处理。 [0128] As another example, if the user exceeds a predetermined number of times, the nonselected negligible presentation unit, the network device 2 will be negligible transparency adjustment means 59% to desalination treatment unit can be ignored.

[0129] 需要说明的是,网络设备2还可根据第一预定规则,结合上述两者,来将第一网页信息转换为第二网页信息。 [0129] Incidentally, the network device 2 according to a first predetermined rule may be, a combination of both, to convert the first information to a second web page web page information. 例如,第一预定规则中规定,可屏蔽单元所对应的可执行操作包括标记、删除及淡化,需要结合用户相关信息来选择一项操作,则当检测到可屏蔽单元时,网络设备2根据用户相关信息,来选择屏蔽、删除或者淡化操作。 For example, a first predetermined rules provide that a shielding unit comprises a corresponding flag executable operations, delete, and desalination, requires a combination of user-related information to select an operation, when the shielding unit can be detected, the network device 2 according to a user related information, to choose to block, delete or fading operations.

[0130] 需要进一步说明的是,上述举例仅为更好地说明本发明的内容,而非对本发明的限制,本领域技术人员应该理解,根据第一预定规则来对所述各个信息单元执行相应的操作,以将所述第一网页信息转换为第二网页信息的方案,均应包含在本发明的范围内。 [0130] It is further noted that the above example is only to better illustrate the present invention, not limitation of the invention, those skilled in the art will appreciate, to perform the respective information units according to respective first predetermined rule operation, to convert the first information to the second web page web page information of the program, should be included within the scope of the present invention.

[0131] 图7为本发明一个方面的用于优化网页的网络设备结构图。 [0131] FIG. 7 configuration diagram of a network apparatus for optimizing web page according to an aspect of the present invention. 本实施例中,网络设备2包括获取装置21、类别分析装置22及转换装置23。 In this embodiment, network device 2 includes an acquisition unit 21, category analysis device 22 and switching means 23.

[0132] 用户通过任何一种可与用户设备I进行人机交互的交互设备来输入第一请求,其中,该第一请求用于为用户设备I请求处理待处理的第一网页信息,例如,用于为用户设备I请求用户希望浏览的网页信息,或者,用于为用户设备I请求已存储在用户设备I上但需要优化的网页信息等。 [0132] User input may be by any of a request for a first user device I to interact with a human-computer interaction device, wherein the first request for a user device I first request processing web page information to be processed, e.g., web page for information for a user device I wish to request the user to browse, or, for a user equipment I request has been stored on the user's device, but I need to optimize the web page information. 其中,该交互设备可以是键盘、鼠标、遥控器、触摸板或声控设备等,用户可以通过执行预设的操作方式告知用户设备I发送所述请求。 Wherein, the interaction device may be a keyboard, a mouse, a remote controller, a touch panel or a voice-activated device, etc., the user can instruct the user equipment by performing a predetermined operation to transmit the request I. 例如,以触摸板式人机交互设备为例,用户通过触控触摸板,选择用户设备I所显示的某个网页链接,再例如,用户I通过在触摸板上以预设的轨迹滑动,以打开与该预设的轨迹相应的网页,例如,主页等。 For example, the touch-panel human interface devices, for example, the user through touch touchpad, select a page that links the user device I show, another example, a user on the touch pad I by a preset trajectory slide to open corresponding to the preset track web page, for example, home and so on. 当然,本领域技术人员应该理解,上述交互设备仅仅只是举例,而非用于限制本发明,事实上,其他可供用户用于输入请求的交互设备或方式也均适用于本发明,并以引用方式包含于此,而不做赘述。 Of course, those skilled in the art will appreciate, the above-described example only interaction device only, not intended to limit the present invention, in fact, available to other user interaction device or a mode input request is also suitable for use in the present invention, and by reference incorporated here, do not go into details.

[0133] 所述用户设备I将所述用户输入的第一请求发送至所述网络设备2。 [0133] The first user device I to the user input request is sent to the network device 2. 其中,用户设备I和网络设备2之间的信息收发通过网络进行,该网络包括但不限于:1)有线网络;2)无线网络;3)局域网;4)广域网;5)VPN网络;6)无线自组织网络(Ad Hoc网络)等。 Wherein the messaging between the user device I and a network device via a network, the network including, but not limited to: 1) a wired network; 2) a wireless network; 3) local area network; 4) a wide area network; 5) VPN network; 6) wireless ad hoc network (Ad Hoc network) and so on.

[0134] 获取装置21获取待处理的第一网页信息。 [0134] The acquisition means acquires the first page of information to be processed 21. 获取装置21获取待处理的第一网页信息的方式包括多种: Obtaining a first embodiment to be processed web page information acquisition apparatus 21 includes a plurality of:

[0135] I)获取装置21包括第一子获取装置(图未示)及第二子获取装置(图未示)。 [0135] I) comprises a first sub-acquisition apparatus 21 acquiring means (not shown) and a second sub-acquiring means (not shown). 当用户设备I发送的第一请求中包括第一网页信息的全部内容,则第一子获取装置获取该第一请求后,第二子获取装置从该第一请求中直接提取第一网页信息; When the entire content of the first request sent by a user device I comprises a first web page information, the first sub-after acquiring means acquires the first request, a second sub-web page information acquisition means extracts the first request from the first;

[0136] 2)获取装置21包括第一子获取装置(图未示)及第二子获取装置(图未示),且第二子获取装置还进一步包括提取装置(图未示)及第三子获取装置(图未示)。 [0136] 2) comprises a first sub-acquisition apparatus 21 acquiring means (not shown) and a second sub-acquiring means (not shown), and the second acquiring means further comprises a sub-extracting device (not shown), and a third obtaining sub-means (not shown). 当用户设备I发送的第一请求中仅包括第一网页信息的链接地址,则第一子获取装置获取该第一请求后,提取装置从所述第一请求中提取所述待处理的第一网页信息的链接地址,第三子获取装置再根据所述链接地址,从相应的网站中获取所述待处理的第一网页信息。 After the requesting user when the first transmission device I comprises a first web page in only the link destination information, the first acquisition means acquires the first sub-request, said extracting means extracts from the first request to be processed in a first web page link address information, and then the third sub-acquisition means acquires the first page of the information to be processed from the corresponding site according to the link address.

[0137] 类别分析装置22分析所述第一网页信息所包含的各个信息单元,以确定所述各个信息单元所属的类别。 [0137] each category analysis means 22 analyzes the information units of information included in a first web page to determine the category information of the respective unit belongs.

[0138] 具体地,类别分析装置22对所述第一网页信息进行分析,识别出第一网页信息中需要处理的信息单元,并通过分析与信息单元相关的因素,来确定各个信息单元所属的类别。 [0138] In particular, the first web page category analysis means performs analysis information 22, the identification information of the web page information in the first unit to be processed, by analyzing factors associated with information units, each information unit belongs to is determined category.

[0139] 其中,类别分析装置22可根据以下至少一项因素来确定所述信息单元的类别: [0139] wherein category analysis device 22 may determine the category information unit according to at least one of the following factors:

[0140] I)所述信息单元的标识符; [0140] I) the information element identifier;

[0141] 具体地,类别分析装置22根据第一网页信息中所包含的标识符,来区分信息单元,并判断信息单元所属的类别。 [0141] Specifically, the analysis device 22 according to the category identifier of the first web page information contained to distinguish the information unit, and the category determination information unit belongs.

[0142] 例如,若类别分析装置22检测到标识符“〈title〉”,则判断两个标识符“〈title〉”之间的内容为一个信息单元,该信息单元为标题;又例如,若类别分析装置22检测到标识符“/*”或者“//”,则判断“/*”或者“//”至“;”之间的内容为一个信息单元,该信息单元为注释单元等。 SUMMARY [0142] For example, if the category analysis means 22 detects the identifier "<title>", it is determined that two identifiers "<title>" as between one information unit, the header information unit; As another example, if category analysis means 22 detects the identifier "/ *" or "//", the judgment "/ *" or "//" to ";" the contents of one information unit among the information units like unit as a comment.

[0143] 2)所述信息单元的文本内容; [0143] 2) the text content of the information unit;

[0144] 具体地,类别分析装置22根据第一网页信息中所包含的标识符,来区分信息单元,随后,根据该信息单元的文本内容,来判断信息单元所属的类别。 [0144] Specifically, the analysis device 22 according to the category identifier of the first web page information contained to distinguish the information unit, then, according to the text content of the information unit, to determine the category information unit belongs.

[0145] 例如,类别分析装置22将信息单元中的文本内容与预设的广告词库中包含的广告词匹配,若匹配成功,例如匹配得到“欢迎选购”等,则判断该信息单元为广告单元。 [0145] For example, ad text with a preset category analysis unit of the device 22 to the information contained in the lexicon ad matching, if the matching succeeds, for example, to obtain matching "welcome" and the like, it is judged that the information unit ad units.

[0146] 3)所述信息单元在所述第一网页信息中的位置; [0146] 3) the location information of the first unit of the web page information;

[0147] 具体地,类别分析装置22根据第一网页信息中所包含的标识符,来区分信息单元,随后,类别分析装置22通过该信息单元在第一网页信息中的位置,来判断信息单元所属的类别; [0147] Specifically, the analysis device 22 according to the category identifier of the first web page information contained to distinguish the information unit, and then, category analysis device 22 through the information element in a first position of the page information to determine the information unit It belongs to the category;

[0148] 例如,类别分析装置22分析得到超过一定数量的,结构相近的信息单元位于第一网页信息后1/5位置内,则判断该信息单元为广告单元。 [0148] For example, the category analysis means 22 analyzes obtained over a certain number of, after the configuration information unit is located close to a first page information in the fifth position, it is judged that the information unit for the ad unit.

[0149] 4)与所述单元相关的信息单元的信息; [0149] 4) the cell information related with the information unit;

[0150] 具体地,类别分析装置22根据第一网页信息中所包含的标识符,来区分信息单元,随后,类别分析装置22通过查找与该信息单元具有相同标识符的信息单元的类别或查找与该信息单元位置相近且结构相似的信息单元的类别或包含的内容,来判断该信息单元所属的类别。 [0150] Specifically, the analysis device 22 according to the category identifier of the first web page information contained to distinguish the information unit, then the category information category analysis apparatus 22 having the same identifier by searching the lookup unit or the information unit categories or content contains information similar to the unit location information and similar structural element, to determine the category of the information unit belongs. 其中,所述结构相似是指两个信息单元中相同的部分超过一预定比例阈值,例如,超过50%等。 Wherein the structure is similar to refer to like parts in the two information elements exceeds a predetermined threshold proportion, e.g., more than 50% and the like. 在此,本领域技术人员应可根据实际需求来确定一个合理的预定阈值。 Here, the skilled artisan may determine a reasonable predetermined threshold value according to the actual needs.

[0151] 例如,类别分析装置22在判断一信息单元时,首先查找到其前一信息单元;随后,将其与待判断的信息单元进行对比,当两者标识符相同,且两者的文本匹配度高于一预定阈值,则判断该信息单元的类别与前一信息单元的类别相同。 [0151] For example, when the category analysis means 22 determines an information unit, first finds the previous one information unit; then, it is compared with the information unit to be determined when the two identifiers identical, and both text the degree of matching is higher than a predetermined threshold value, it is judged that the category information unit is the same as the previous category information element.

[0152] 需要说明的是,类别分析装置22在判断信息单元的类别的过程中,可综合上述因素进行判断,例如,当类别分析装置22检索到信息单元的文本内容与广告词库中的广告词相匹配,则再进一步判断该信息单元在第一网页中的位置及该信息单元是否具有结构相似的相邻的信息单元,若该信息单元位于第一网页信息后1/5位置内,且具有结构相似的信息单元,则判断该信息单元为广告单元,若该信息单元位于第一网页信息1/3-2/3的中间位置内,且该信息单元没有与其结构相似的相邻的信息单元,则判断该信息单元不是广告单元等。 [0152] Incidentally, the process 22 determines that the category information unit category analysis device, can be determined above factors, for example, when the type of the analysis apparatus 22 to the retrieved text information unit thesaurus ad ad the word match, then further determining whether the information element in the first page and the location information has a structure similar unit is adjacent information units, if the 1/5 position of the first units of information pages of information, and information element has a similar structure, it is judged that the information unit for the ad unit, if the intermediate position of the information unit located at a first web page information 1 / 3-2 / 3, and this information unit is not similar to its adjacent structure of the information unit, it is judged that the information unit is not the ad unit and the like.

[0153] 需要进一步说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明所做的限制,本领域技术人员应该理解,任何通过分析要素来确定信息单元的类别的方法,均应包含在本发明的范围内。 [0153] It is further noted that the above example is only to better illustrate the technical solutions of the present invention, does not limit the present invention, those skilled in the art will appreciate, any unit is determined by analyzing information Feature Category the method should be included within the scope of the present invention.

[0154] 转换装置23基于第一预定规则,结合所述各个信息单元的类别,来将所述第一网页信息转换为用于提供给所述用户设备的第二网页信息。 [0154] a first switching means 23 based on a predetermined rule, in conjunction with the respective category information unit, the first web page to second web page information into information to be provided to the user equipment.

[0155] 具体地,转换装置23根据第一预定规则中所记录的信息单元的类别与可执行操作之间的对应关系,来执行相应操作,以将所述第一网页信息转换为第二网页信息。 [0155] Specifically, the converting means 23 according to a correspondence relationship between the type and perform a predetermined operation rule in the first recorded information unit, to perform a corresponding operation to the first web page into a second web page information information.

[0156] 例如,第一预定规则中设定对于css单元,当其位于第一网页信息的起始位置时,不对其进行操作;当其位于第一网页信息的其他位置时,将其移至第一网页信息的起始位置。 [0156] For example, a first predefined rule for css setting means when it is in the starting position of the first web page information, without its operation; when it is in another position of the first web page information will move the starting position of the first web page information. 则当类别分析装置22判断得到信息单元的类别为css单元时,根据第一预定规则中的规则,结合CSS单元的当前位置,确定是否执行将CSS单元移动至起始位置的操作。 When the analyzing means 22 judges the category classification information obtained when the cell is css unit, according to a first predetermined rule in the rule, with the current position of the CSS unit, the operation determines whether the CSS unit is moved to the starting position. 由于CSS单元影响网页的结构,而浏览器在生成网页的过程中通常是按照第一网页信息的内容从头至尾生成,因此,通过将CSS单元前置,能够避免浏览器在生成一部分网页后,由于检测到CSS单元,因此需要重新生成网页的问题,加快了浏览器生成网页的速度。 Since the CSS unit affect the structure of the page and the process of generating the browser page from start to finish is usually generated as the content of the first web page information, therefore, by the CSS unit front can be generated as part of the browser on the page to avoid, Since the detection unit to CSS, so the need to regenerate the page in question, speeds up browser generated web pages.

[0157] 需要说明的是,根据信息单元的类别来调整信息单元位置的方式,不以上述举例为限,本领域技术人员应该理解,只要是根据信息单元的类别,将影响网页结构的信息单元前置的方案,均应包含在本发明的范围内。 [0157] Note that, by adjusting the information unit location based on the type information element way, not in the above example is limited, those skilled in the art will appreciate, as long as according to the category information unit, the influence of information units on a page structure pre-program should be included within the scope of the present invention.

[0158] 又例如,第一预定规则中设定,删除注释单元,则当类别分析装置22判断得到信息单元的类别为注释单元时,将注释单元删除。 [0158] As another example, a first predefined rule set, delete comment unit, the classification information obtained when the unit cell is a comment, the comment determination unit deletes category analysis device 22. 由于注释不影响网页生成,因此,将注释删除,能够减少浏览器加载网页内容的时间,也减少了用户需要下载的流量,加快了网页呈现的速度。 Since the comment does not affect the Web page generation, therefore, to delete the comment, you can reduce the time browser loads the page content, also reduces the need for users to download traffic, accelerate the speed of page rendering.

[0159] 需要说明的是,根据信息单元的类别来删除信息单元的方式,不以上述举例为限,本领域技术人员应该理解,只要是根据信息单元的类别,删除不影响网页生成的信息单元的方案,均应包含在本发明的范围内。 [0159] Incidentally, to remove the information unit based on the type information element way, not in the above example is limited, those skilled in the art will appreciate, as long as according to the type of information units deleted without affecting the information element generated page program, should be included within the scope of the present invention.

[0160] 当转换装置23完成对第一网页信息的所有处理后,将处理后的第一网页信息作为第二网页信息。 [0160] When the information converting means 23 to complete all processing of the first page, the first page after the page information as the second processing information.

[0161] 需要进一步说明的是,根据第一预定规则,结合信息单元类别,来将所述第一网页信息转换为用于提供给所述用户设备的第二网页信息的处理方法,并不以上述举例为限,例如,处理方法还可包括屏蔽垃圾信息单元、凸显正文单元和标题单元等等。 [0161] It is further noted that, according to a first predetermined rule, in conjunction with the category information unit, to the first page of the second page information conversion processing method for providing information to the user equipment, not to limited to the example described above, e.g., a processing method may further include a shield spam unit, and highlights the text header units like unit.

[0162] 需要更进一步说明的是,类别分析装置22与转换装置23各自所执行的操作并无绝对的先后顺序,类别分析装置22在每判断一个信息单元类别后,转换装置23即可执行相应的操作,也可当类别分析装置22判断所有信息单元的类别后,转换装置23再执行相应的操作。 [0162] further explanation is required, the operating category analysis means 22 and the conversion means 23 is not executed by the respective absolute order, in each category analysis means 22 determines a category information unit, the switching means 23 to perform a corresponding operation, also when all category analysis means 22 determines the category information unit, the converting means 23 then performs a corresponding operation.

[0163] 网络设备2将转换装置23生成的第二网页信息发送给用户设备1,用户设备I根据第二网页信息,生成网页以呈现给用户。 The second web page information [0163] The network device 2 generated by the converting means 23 1 to the user equipment, a user device I according to a second web page information, generates a web page for presentation to the user.

[0164] 作为本发明的一个优选实施例,网络设备2可进一步包括web设备及优化设备。 [0164] As a preferred embodiment of the present invention, the network device may further include a web 2 and device optimization apparatus embodiment. 则获取装置22包含在web设备中,类别分析装置22及转换装置23包含在优化设备中。 The obtaining means 22 comprises a web in the apparatus, the category analysis means 22 and switching means 23 includes the optimization apparatus.

[0165] 用户设备I将第一请求发送至web设备。 [0165] The user device I first request to the web device. 其发送方式已在参照图7所示的实施例中详述,并以引用的方式包含于此,不再赘述。 Which has been transmitted in a manner described in detail in reference to the embodiment shown in FIG. 7, and is incorporated herein by reference, not repeated. 获取装置22根据第一请求获取第一网页信息。 A first acquisition means 22 acquires the web page information according to the first request. 其获取方式已在与参照图7所示的实施例中详述,并以引用的方式包含于此,不再赘述。 Obtaining which has been detailed with reference to the embodiment shown in FIG. 7, and is incorporated herein by reference, not repeated.

[0166] 随后,web设备将第一网页信息发送给优化设备,优化设备获取该待处理的第一网页信息。 [0166] Subsequently, web page information of the first device transmits to the optimization apparatus optimizing apparatus acquires the web page information to be processed first.

[0167] 接着,类别分析装置22及转换装置23将第一网页信息处理为第二网页信息。 [0167] Next, category analysis device 22 and the conversion means 23 of the first page of the second information processing web page information. 类别分析装置22及转换装置23将第一网页信息处理为第二网页信息的方式已在参照图7所示的实施例中详述,并以引用的方式包含于此,不再赘述。 Category analysis device 22 and the mode switching means 23 to the first page of the second information processing web page information have been described in detail in reference to the embodiment shown in FIG. 7, and is incorporated herein by reference, not repeated.

[0168] 接着,优化设备将第二网页信息发送给web设备,web设备再将第二网页信息提供给用户设备1,用户设备I根据第二网页信息,生成网页以呈现给用户。 [0168] Next, a second optimization device transmits the web page information to the web device, the device and then a second web page information to the user equipment 1, the user device I according to a second web page information, generates a web page for presentation to the user.

[0169] 图8为本发明一个优选实施例的用于优化网页的网络设备结构图。 [0169] FIG. 8 is preferably a device optimized network configuration diagram of a web page for the embodiment of the embodiment of the present invention. 本实施例中,类别分析装置22可包含于网络设备2或包含于网络设备2的优化设备中,其中,类别分析装置22还进一步包括匹配查询装置221及确定装置222。 In this embodiment, the category may be included in the analysis device 22 or a network device 2 for optimizing the network device 2, wherein the category analysis apparatus 22 further comprising a match query further determining means 221 and means 222.

[0170] 获取装置21及转换装置23已在参照图7所示的实施例中予以详述,并以引用的方式包含于此,不再赘述。 [0170] means 21 and switching means 23 has been acquired in the illustrated embodiment with reference to FIG. 7 to be described in detail in, and incorporated herein by reference, not repeated.

[0171] 匹配查询装置221根据所述第一网页信息的链接地址在模板库24中进行匹配查询,以获取相应的类别识别模板。 [0171] The match query 221 to the first link address matches the query web page information in the template gallery 24, to obtain the corresponding class identification template.

[0172] 具体地,模板库24中包含了各个类别识别模板及与该各个类别识别模板对应的链接地址,匹配查询装置221将第一网页信息的链接地址与模板库中的链接地址进行匹配,得到能够成功匹配的类别识别模板。 [0172] Specifically, the template database 24 contains various categories of the recognition template, and the template corresponding to each category identified link address, the match query 221 to the first link address to the web page template library information in the link address match, been able to successfully identify the template matching category. 其中,当匹配查询装置221能够成功匹配到多个链接地址时,选择匹配度最高的链接地址所对应的类别识别模板。 Wherein, when the apparatus 221 can match the query successfully matched to a plurality of link address, select a category template matching to identify the highest address corresponding to the link.

[0173] 其中,匹配度可根据两条链接地址的表现形式之间的相似程度来计算,该表现形式包括但不限于基于http,https, ftp,tencent协议的URL地址或IP地址,MAC地址等。 [0173] wherein, the matching degree may be calculated according to the degree of similarity between the two link address manifestations, manifestations of which include but are not limited based on http, https, ftp, URL address or IP address tencent protocol, MAC address, etc. . 例如,第一网页信息的链接地址表现为如下URL地址http://news, sina.com, cn/society,匹配查询装置221在模板库24中成功匹配到多个链接: For example, the first information on the website link address showed the following URL address http: // news, sina.com, cn / society, match the query device 221 successfully matched to multiple links in the template library 24:

[0174] www.sina.com, cn: [0174] www.sina.com, cn:

[0175] http: //finance, sina.com, cn/stock/: [0175] http: // finance, sina.com, cn / stock /:

[0176] http: //mobile, sina.com, cn/: [0176] http: // mobile, sina.com, cn /:

[0177] http: //news, sina.com, cn/s/sd/:及. [0177] http: // news, sina.com, cn / s / sd /: and.

[0178] http: //news, sina.com, cn/society: [0178] http: // news, sina.com, cn / society:

[0179] 其中,根据字符串相似度可以确定与第一网页信息的链接地址表现形式匹配度最高的链接为http://news, sina.com, cn/society,该链接对应“类别识别模板一”,则匹配查询装置221选择“类别识别模板一”作为与第一网页信息相对应的类别识别模板。 [0179] wherein, similarity can be determined in accordance with the string form of links exhibit the highest degree of matching with a link address information of the first page is http: // news, sina.com, cn / society, the link corresponding to "category identifying a template ", the matching means 221 selects the query" category a recognition template "as the first web page information corresponding to the type identification template.

[0180] 确定装置222根据第一网页信息所包含的各个信息单元,并结合所述类别识别模板,来确定所述各个信息单元所属的类别。 [0180] determining apparatus 222 according to various information units included in the first page information, combined with the template category identification, to determine the category information of the respective unit belongs.

[0181] 具体地,在结合前述实施例中第一预定规则所参考因素的基础上,确定装置222进一步根据类别识别模板所提供的信息,来信息单元进行针对性更强的识别操作,以下将结合前述参考因素,予以详述: [0181] Specifically, on the basis of the foregoing embodiments in conjunction with the first embodiment the predetermined rule with reference to factors, information type determining means 222 is further provided in accordance with the recognition template to more targeted information unit identification operation, will be combined with the aforementioned reference factor, to be detailed:

[0182] I)所述信息单元的标识符; [0182] I) the information element identifier;

[0183] 确定装置222结合类别识别模板中记录的标识符所表示的含义,来判断信息单元所属的类别。 [0183] determining means 222 in conjunction with the meaning of the recognition template category identifier recorded represented determines the category information unit belongs.

[0184] 例如,“类别识别模板一”中记录,标识符“ [ad] ”表示广告,则确定装置222判断标识符为“ [ad] ”的信息单元为广告单元。 [0184] For example, "a recognition template category" is recorded, the identifier "[AD]" indicates that the ad, determining means 222 determines the identifier "[AD]" information element for the ad unit.

[0185] 2)所述信息单元的文本内容; [0185] 2) the text content of the information unit;

[0186] 确定装置222结合类别识别模板中记录的文本内容的相关信息,判断信息单元所属的类别。 [0186] determining means 222 in conjunction with the text information recorded in the recognition template category, the category determination information unit belongs.

[0187] 例如,“类别识别模板一”中记录,当一个信息单元所包含的文本字数超过一预设阈值时,该信息单元为重要信息单元,则确定装置222判该信息单元为突显单元。 [0187] For example, the "category recognition template a" recording, when the text words of an information unit contains more than a predetermined threshold value, the information element is important information unit, it is determined that the device 222 judged that the information unit is a highlight cell.

[0188] 3)所述信息单元在所述第一网页信息中的位置; [0188] 3) the location information of the first unit of the web page information;

[0189] 确定装置222结合类别识别模板中记录的信息单元的位置与其所属类别的对应关系,来判断信息单元所属的类别。 [0189] correspondence relationship determining means 222 in conjunction with its position Category type identification information element recorded template, to determine the category information unit belongs.

[0190] 例如,“类别识别模板一”中记录,位于第一网页信息后1/3位置内的内容为广告信息,则确定装置222判断位于第一网页信息后1/3位置内的信息单元为广告单元。 [0190] For example, "a recognition template category" in the record, the first rear third of the web page information contents position information for an ad, it is determined that the device information determination unit 222 in the third position at the first web page information for the ad unit.

[0191] 4)与所述信息单元相关的信息单元的信息; [0191] 4) information relating to the information element of the information unit;

[0192] 例如,“类别识别模板一”中记录,当存在超过4个结构相似且位置相近的信息单元时,该信息单元为用于内容推荐的信息单元,则确定装置222判断该类信息单元为推荐单 [0192] For example, "a type identification template" recording, when there are more than four of similar structure and similar to the position of the information unit, the information unit is a unit for content recommendation information, the determination means 222 determines the kind of information means to recommend a single

J L.ο J L.ο

[0193] 需要说明的是,确定装置222在判断信息单元的类别的过程中,可综合上述因素进行判断,例如,“类别识别模板一”中记录,当存在超过4个结构相似且位置相近的信息单元时,需进一步根据信息单元所处的位置进行判断,若信息单元所处的位置为第一网页信息中靠前1/2至3/4的位置内,则该信息单元为推荐单元;若信息单元所处的位置为第一网页信息中靠后1/5的位置内,则该信息单元为广告单元等。 [0193] Incidentally, the process determining means 222 determines that the category information unit may be judged above factors, e.g., "a type identification template" recording, when there are more than four similar structure and a similar position when the information unit, need further determination unit is located according to the location information, if the position of a first information unit is located within the web page information in the forward position of 1/2 to 3/4, then the information unit is a recommendation unit; If the position of a first information unit is located on the web page information after the position of 1/5, the information unit for the ad unit and the like.

[0194] 需要进一步说明的是,上述举例仅为更好地说明本发明的技术方案,而非对本发明所做的限制,本领域技术人员应该理解,任何通过结合类别识别模板及要素分析来确定信息单元的类别的方法,均应包含在本发明的范围内。 [0194] It is further noted that the above example is only to better illustrate the technical solutions of the present invention, it does not limit the present invention, those skilled in the art will appreciate, any type identification determined by binding template and factor analysis the method of the category information element shall be included within the scope of the present invention.

[0195] 优选地,本实施例还包括更新装置(图未示),更新装置用于根据用户经由所述用户设备发送的反馈信息和/或所述第二网页信息,来确定待更新或待建立的类别识别模板。 [0195] Preferably, the present embodiment further includes updating means (not shown), means for updating, via feedback information to determine the user equipment and / or the second web page according to the user information to be updated or be the establishment of category recognition template.

[0196] 具体地,当用户设备I将基于第二网页信息生成的网页呈现给用户后,用户可再次通过人机交互,经由用户设备I向网络设备2发送反馈信息,该反馈信息包括用户对于网页优化的满意度,更新装置记录用户的反馈信息,并选择用户评价值低于一预定阈值的第二网页信息所采用的类别识别模板,以作为待更新的类别识别模板;或者,若该第二网页信息未采用类别识别模板,则更新装置记录该第二网页信息的链接地址,以确定在模板库中建立与该链接地址相对应的类别识别模板。 [0196] Specifically, when the user device I will be presented to the user based on a second web page information generated by the user can be interactive, user equipment via the I 2 network device transmits feedback information, the feedback information comprises a user again to page is optimized satisfaction, updating means records the user's feedback information, and select the category recognition template user evaluation value is below a second predetermined threshold web page information used to identify the type as a template to be updated; Alternatively, if the second two page template category identification information is not employed, the device records the second web page link address information is updated to determine the establishment of the link address corresponding to the class identifier template in the template gallery.

[0197] 图9为根据本发明另一优选实施例的用于优化网页的网络设备结构图。 [0197] Example 9 is a configuration diagram of a network device optimizing a web page according to another preferred embodiment of the present invention. 本实施例中,类别分析装置22可包含于网络设备2或包含于网络设备2的优化设备中,其中,类别分析装置22还进一步包括子类别分析装置223。 In this embodiment, the category may be included in the analysis device 22 or a network device 2 for optimizing the network device 2, wherein the category analysis apparatus 22 further includes analysis means 223 subcategories.

[0198] 获取装置21及转换装置23已在参照图7所示的实施例中予以详述,并以引用的方式包含于此,不再赘述。 [0198] means 21 and switching means 23 has been acquired in the illustrated embodiment with reference to FIG. 7 to be described in detail in, and incorporated herein by reference, not repeated.

[0199] 子类别分析装置223通过结合用户相关信息对对所述第一网页信息所包含的各个信息单元进行分析,以确定所述各个信息单元所属的类别。 [0199] subcategories analyzer 223 analyze the various information units included in the first web page information by combining the user-related information, to determine the category information of the respective unit belongs. 其中,网络设备2通过识别用户身份,来获取该用户的用户相关信息,网络设备2可根据以下方式识别用户身份:1)用户设备I的唯一识别码,例如,手机号、用户设备的硬件识别码等;2)用户的注册信息;3)记录在用户设备cookie中的信息等。 Wherein, the network device 2, to obtain user related information of the user by identifying the user, the network device 2 according to the following manner to identify the user: 1) a unique identifier of the user device I, e.g., phone number, the hardware to identify the user equipment code and the like; 2) the user registration information; 3) the information recorded in the cookie of the user equipment and the like. 用户相关信息可保存在网络设备2中,或者,用户相关信息保存在用户设备I中,并由网络设备2获取,或者,网络设备2综合保存在用户设备I及网络设备2中的信息,得到用户相关信息。 User-related information can be stored in the network device 2, or user-related information stored in the user device I, acquired by the network device 2, or 2 integrated network device information stored in the user device I and 2 network equipment, get user-related information.

[0200] 其中,所述用户相关信息可由用户主动提供,或网络设备根据记录的用户行为推测得到。 [0200] wherein the user-related information by a user unsolicited or the network user devices obtained according to the estimation of the behavior record. 子类别分析装置223可结合以下至少一项用户相关信息,来分析信息单元的类别: Subcategory analysis device 223 may incorporate at least one user-related information, category information analyzing unit:

[0201] I)用户的个人属性,包括用户的年龄、性别、身份、收入、教育程度等; [0201] I) the user's personal attributes, including the user's age, gender, identity, income, education and so on;

[0202] 2)用户的偏好设置,包括屏蔽网页内容的偏好设置,突显网页内容的偏好设置等; Preferences [0202] 2) a user, comprising a shield web content preferences, highlighting web content preferences, etc.;

[0203] 3)用户的历史行为,包括用户浏览、点击网页的行为记录等; [0203] 3) the historical behavior of the user, including the user's browser, click on the page's behavior records;

[0204] 4)用户的环境信息,包括用户所在的位置信息、用户当前的时间信息及用户设备相关信息等,其中,用户设备相关信息包括但不限于:网络运营商、用户设备类型,IMEI,用户设备操作系统信息、屏幕分辨率、软件信息等。 [0204] 4) environment information of a user, including the location information of the user is located, the user's current time information and the user device-related information, wherein the user information about the device, including but not limited to: the network operator, user equipment type, IMEI, user device operating system information, screen resolution, and software information.

[0205] 例如,当用户相关信息包含该用户为女性,则子类别分析装置223判断包含“月艮装”、“购物”等词汇的信息单元为突显单元。 [0205] For example, when the user information containing the user is female, the subcategory information analyzing means comprises a means 223 determines "Burgundy loaded month", "shopping" and other words of highlight cell.

[0206] 又例如,当用户在偏好设置中设置突显标题,则子类别分析装置223将检测到的标题单元判断为突显单元。 [0206] As another example, when the user sets preferences highlighted title, the sub-category analysis device 223 will detect the header unit determines that the highlight cell.

[0207] 又例如,当用户在一预设的时间长度内所记录的用户行为仅包括该用户通过新网的新闻页面主页点击打开网页的行为,而无该用户进一步在打开的网页上进行点击的行为,则子类别分析装置223可基于所记录的用户行为判断该用户仅浏览网页中的正文,故可将正文以外的其他信息单元确定为可忽略单元。 [0207] In another example, when the user behavior of users within a predetermined length of time recorded only includes the user clicks to open pages through the new network news page homepage behavior, without further user clicks on an open Web page behavior, the sub-category analysis device 223 may be recorded based on the user behavior is determined that the user to view only the text page, it means other than text information may be determined to be negligible unit.

[0208] 再例如,子类别分析装置223根据用户设备I当前的IP地址,判断用户所在位置为上海,则当信息单元的文本内容中包括“上海”时,子类别分析装置223可确定该信息单元为突显单元。 When [0208] In another example, the sub-category analysis device 223 according to a user device I the current IP address, determines user location in Shanghai, when the text information unit includes "Shanghai", sub-category analysis device 223 may determine that the information to highlight the unit cell.

[0209] 需要说明的是,子类别分析装置223也可进一步包含匹配查询装置221及确定装置222,以结合类别识别模板及用户相关信息,来确定信息单元所属的类别。 [0209] Incidentally, the sub-category analysis device 223 may further comprise means match query 221 and determining device 222 to identify the category template binding and user-related information, to determine the category information unit belongs.

[0210] 需要进一步说明的是,上述举例仅为更好地说明本发明的方案,而非对本发明的限制,本领域技术人员应该理解,根据任何其他的用户相关信息以及基于用户相关信息来判断信息单元所属类别的任何其他方式,均应包含在本发明的范围内。 [0210] It is further noted that the above example is only to better illustrate the present invention, not limitation of the invention, those skilled in the art will appreciate, based on the user information and related information is determined according to any other user any other way of category information element shall be included within the scope of the present invention.

[0211] 图10为本发明再一个优选实施例的用于优化网页的网络设备结构图。 [0211] FIG. 10 the present invention further optimize the network device configuration diagram of a preferred embodiment for the web embodiment. 本实施例中,转换装置23可包含于网络设备2或包含于网络设备2的优化设备中,其中,转换装置23还进一步包括子转换装置231。 In this embodiment, switching means 23 may comprise a network device 2 for optimizing or a network device 2, wherein the switching means 23 further includes converting means 231 sub.

[0212] 获取装置21及类别分析装置22已在参照图7、图8或图9所示的实施例中予以详述,并以引用的方式包含于此,不再赘述。 [0212] obtaining means 21 and 22 are in category analysis device 7, to be described in detail the embodiment shown in FIG. 8 or FIG. 9, and is incorporated herein by reference, not repeated.

[0213] 子转换装置231根据所述第一预定规则,并基于所述各个信息单元的类别,来对所述各个信息单元执行相应的操作,以将所述第一网页信息转换为第二网页信息。 [0213] Sub-conversion device 231 according to the first predetermined rule, each category based on said information units, performs a corresponding operation on the respective information units, the first web page to second web page information is converted into information.

[0214] 其中,所述第一预定规则包括参考以下至少一项因素来确定所述相应的操作: [0214] wherein said first predetermined rule comprises at least one reference to the following factors to determine the appropriate action:

[0215] I)预设的所述类别与可执行操作之间的对应关系; [0215] I) a preset correspondence relationship between the categories and executable operations;

[0216] 具体地,在第一预定规则中,规定了每一种信息单元类别所对应的可执行操作,子转换装置231根据信息单元类别与可执行操作之间的对应关系,来对各个信息单元执行相应的操作,当所有操作完成后,则将处理后的第一网页信息作为第二网页信息。 [0216] Specifically, at a first predetermined rule, each of the predetermined information element corresponding to the category of executable operations, the sub conversion apparatus 231 according to a correspondence relationship between the category information and perform the operation unit to each information unit performs the corresponding operation, when all operation is completed, the first web page is being processed, the information as the second web page information.

[0217] 例如,第一预定规则规定了注释单元及广告单元所对应的可执行操作为删除操作,则当子转换装置231检测到注释单元,将该注释单元删除; [0217] For example, a first predetermined unit annotation rules and advertising units corresponding executable operations for the delete operation, when the sub conversion means 231 detects the annotation means, the annotation unit deletes;

[0218] 又例如,第一预定规则规定了当css单元未处于网页信息的起始位置时,将其置于起始位置,则当子转换装置231检测到css单元时,检测css单元所处的位置,当其位置不为起始位置时,将其移至起始位置; [0218] As another example, a first predetermined rules when the unit is not in the starting position css web page information, which is placed at the beginning, when the sub conversion means 231 detects css means, detecting means located css position, when the position is not the starting position, to move the starting position;

[0219] 又例如,第一预定规则规定了以红色字体来对突显单元中的文本内容进行突显,则当子转换装置231检测到突显单元时,将突显单元的文本内容的色彩格式更改为红色; [0219] As another example, a first predetermined rules specify highlighted in red to highlight the textual content unit, the switching means when the sub-unit 231 detects the highlight, the highlight color format change unit is red text ;

[0220] 再例如,第一预定规则规定了标记可忽略单元,则当子转换装置231检测到可忽略单元时,对可忽略单元进行标记,以供用户设备I识别可忽略单元,则用户设备I可根据用户的选择,将所述可忽略单元生成在网页中,呈现给用户;或者,屏蔽该可忽略单元,不将其呈现给用户。 [0220] As another example, a first predetermined rules specify marking means negligible, then when the sub conversion device 231 detects the unit can be ignored, can be ignored for marks unit, for the user equipment unit I recognition negligible, the user equipment I according to a user's selection, the page can be ignored in the generation unit, presented to the user; alternatively, the shield unit can be ignored, it is not presented to the user.

[0221] 2)用户相关信息; [0221] 2) information related to the user;

[0222] 具体地,第一预定规则中包含:根据用户相关信息及信息单元的类别,来对信息单元执行相应操作的规则。 [0222] Specifically, a first predefined rule comprises: the user related information and the category information unit, the rule performs corresponding operations information units.

[0223] 例如,若用户在用户偏好设置中规定以灰色背景的方式,对突显单元进行突显,则子转换装置231将突显单元的背景更改为灰色; [0223] For example, if the user specifies a gray background in the manner of highlighting unit highlights the user preferences, the sub converter unit 231 will highlight the background is changed to gray;

[0224] 又例如,若用户在超过一预定次数中,从未选择呈现可忽略单元,则子转换装置231将可忽略单元的透明度调整为59%,以对可忽略单元进行淡化处理。 [0224] As another example, if the user exceeds a predetermined number of times, the nonselected negligible presentation unit, the sub conversion means negligible transparency unit 231 is adjusted to 59%, for treatment of desalination units is negligible.

[0225] 需要说明的是,子转换装置231还可根据第一预定规则,结合上述两者,来将第一网页信息转换为第二网页信息。 [0225] Incidentally, the sub conversion apparatus 231 according to a first predetermined rule may be, a combination of both, to convert the first information to a second web page web page information. 例如,第一预定规则中规定,可屏蔽单元所对应的可执行操作包括标记删除及淡化,需要结合用户相关信息来选择一项操作,则当检测到可屏蔽单元时,子转换装置231根据用户相关信息,来选择屏蔽、删除或者淡化操作。 For example, a first predetermined rules provide that a shielding unit corresponding executable operations comprising dilute and marked for deletion, the user need to combine related information to select an operation, when the shielding unit can be detected, the sub conversion apparatus 231 according to a user related information, to choose to block, delete or fading operations.

[0226] 需要进一步说明的是,上述举例仅为更好地说明本发明的内容,而非对本发明的限制,本领域技术人员应该理解,根据第一预定规则来对所述各个信息单元执行相应的操作,以将所述第一网页信息转换为第二网页信息的方案,均应包含在本发明的范围内。 [0226] It is further noted that the above example is only to better illustrate the present invention, not limitation of the invention, those skilled in the art will appreciate, to perform the respective information units according to respective first predetermined rule operation, to convert the first information to the second web page web page information of the program, should be included within the scope of the present invention.

[0227] 本发明中的各预定阈值,均可由本领域技术人员根据实际需求来确定。 [0227] Each predetermined threshold value in the present invention, may be determined by one skilled in the art according to actual demand.

[0228] 对于本领域技术人员而言,显然本发明不限于上述示范性实施例的细节,而且在不背离本发明的精神或基本特征的情况下,能够以其他的具体形式实现本发明。 In the case [0228] to those skilled in the art, that the invention is not limited to the details of the above-described exemplary embodiment, but without departing from the spirit or essential characteristics of the present invention, the present invention can be realized in other specific forms. 因此,无论从哪一点来看,均应将实施例看作是示范性的,而且是非限制性的,本发明的范围由所附权利要求而不是上述说明限定,因此旨在将落在权利要求的等同要件的含义和范围内的所有变化涵括在本发明内。 Therefore, no matter from what point of view, the embodiments should be considered exemplary, and not limiting, the scope of the invention being indicated by the appended claims rather than by the foregoing description, the appended claims are therefore intended to All changes which come within the meaning and range of equivalents thereof should be covered within the present invention. 不应将权利要求中的任何附图标记视为限制所涉及的权利要求。 In the claims should not be considered as any reference numerals as claimed in claim limitations involved. 此夕卜,显然“包括” 一词不排除其他单元或步骤,单数不排除复数。 Bu this evening, apparently "comprising" does not exclude other elements or steps, the singular does not exclude a plurality. 系统权利要求中陈述的多个单元或装置也可以由一个单元或装置通过软件或者硬件来实现。 A plurality of units or means recited in the claims the system can also be implemented by a single unit or through software or hardware. 第一,第二等词语用来表示名称,而并不表示任何特定的顺序。 The first, second, etc. are used to indicate the name, but does not indicate any particular sequence.

Claims (20)

1.一种在网络设备中用于优化网页的方法,其中,该方法包括以下步骤: a获取待处理的第一网页信息; b分析所述第一网页信息所包含的各个信息单元,以确定所述各个信息单元所属的类别; c基于第一预定规则,结合所述各个信息单元的类别,来将所述第一网页信息转换为用于提供给用户设备的第二网页信息; 其中,所述步骤b包括以下步骤: -根据第一网页信息所包含的各个信息单元,并结合类别识别模板以及用户相关信息,来确定所述各个信息单元所属的类别; 其中,所述步骤b中根据以下至少一项因素,来确定所述信息单元的类别: -所述信息单元在所述第一网页信息中的位置; -与所述信息单元相关的信息单元的信息。 A method for optimizing a web page in a network device, wherein the method comprises the steps of: a page acquiring first information to be processed; b each information unit analyzes the information included in a first web page to determine the respective category information unit belongs; C based on a first predefined rule, in conjunction with the various categories of information units to the first web page information is converted into a second web page to provide information to a user device; wherein the said step b comprises the steps of: - a first unit in accordance with various information included in the web page information, in conjunction with the category information and the user identification templates, to determine the category of each information unit belongs; wherein said step b according to the following at least one factor determining the cell type information: - the position in the first information unit of the web page information; - information related to the information units.
2.根据权利要求1所述的方法,其中,所述步骤b中还根据以下至少一项因素,来确定所述信息单元的类别: -所述信息单元的标识符; -所述信息单元的文本内容。 The method according to claim 1, wherein said step (b) further according to at least one of the following factors to determine the category of the information unit: - the information element identifier; - the information element text content.
3.根据权利要求1或2所述的方法,其中,所述步骤b还包括以下步骤: -根据所述第一网页信息的链接地址在模板库中进行匹配查询,以获取相应的所述类别识别模板。 3. The method of claim 1 or claim 2, wherein said step b further comprises the step of: - matching the query in a template library according to the link address of the first web page information, corresponding to the category to obtain recognition template.
4.根据权利要求3所述的方法,其中,该方法还包括以下步骤: -根据用户经由所述用户设备发送的反馈信息和/或所述第二网页信息,来确定待更新或待建立的类别识别模板。 4. The method according to claim 3, wherein the method further comprises the step of: - according to the feedback information sent by the user via the user device and / or the second web page information to be updated is determined to be established, or category recognition template.
5.根据权利要求1或2所述的方法,其中,所述步骤c包括以下步骤: -基于所述第一预定规则,结合所述各个信息单元的类别,来对所述各个信息单元执行相应的操作,以将所述第一网页信息转换为第二网页信息。 5. The method of claim 1 or claim 2, wherein said step c comprises the steps of: - a first predetermined rule based on the binding type of the respective information units, each performing the respective information units operation, to convert the first information the second web page web page information.
6.根据权利要求5所述的方法,其中,所述第一预定规则包括参考以下至少一项因素来确定所述相应的操作: -预设的所述类别与可执行操作之间的对应关系; -用户相关信息。 6. The method as claimed in claim 5, wherein said first predetermined rule comprises at least one reference to the following factors to determine the appropriate action: - a preset correspondence relationship between the categories and executable operations ; - user-related information.
7.根据权利要求1所述的方法,其中,所述用户相关信息包括以下至少一项: -用户的个人属性; -用户的偏好设置; -用户的历史行为; -用户的环境信息。 7. The method according to claim 1, wherein the user-related information comprises at least one of: - the user's personal attribute; - user preferences; - historical behavior of the user; - the user environment information.
8.根据权利要求1或2所述的方法,其中,所述步骤a还包括以下步骤: -获取来自用户设备的第一请求,该第一请求用于为用户设备请求处理待处理的第一网页信息; -根据所述第一请求,获取所述待处理的第一网页信息。 8. The method of claim 1 or claim 2, wherein said step a further comprises the step of: - obtaining a first request from a user device, the first request to a first user equipment requests for processing pending web page information; - according to the first request, the acquired first page information to be processed.
9.根据权利要求8所述的方法,其中,所述获取所述待处理的第一网页信息的步骤包括以下步骤: -从所述第一请求中提取所述待处理的第一网页信息的链接地址; -根据所述链接地址,获取所述待处理的第一网页信息。 9. A method according to claim 8, wherein the obtaining of the first web page information to be processed comprises the steps of: - extracting a first page of the transaction information from the first request link address; - according to the link address, acquires the first page of the information to be processed.
10.根据权利要求1或2所述的方法,其中,所述网络设备包括:网络主机、单个网络服务器、多个网络服务器集或基于云计算的计算机集合。 10. The method of claim 1 or claim 2, wherein, the network device comprising: a host network, a single network server, or a set of a plurality of network servers based on a set of computer-cloud.
11.一种用于优化网页的网络设备,其中,该网络设备包括: 获取装置,用于获取待处理的第一网页信息; 类别分析装置,用于分析所述第一网页信息所包含的各个信息单元,以确定所述各个信息单元所属的类别; 转换装置,用于基于第一预定规则,结合所述各个信息单元的类别,来将所述第一网页信息转换为用于提供给用户设备的第二网页信息; 其中,所述类别分析装置包括: 子类别分析装置,用于根据第一网页信息所包含的各个信息单元,并结合类别识别模板以及用户相关信息,来确定所述各个信息单元所属的类别; 其中,所述类别分析装置根据以下至少一项因素,来确定所述信息单元的类别: -所述信息单元在所述第一网页信息中的位置; -与所述信息单元相关的信息单元的信息。 11. An apparatus for optimizing web page, wherein the network device comprises: acquiring means for acquiring a first page of information to be processed; category analysis means for analyzing each of the web page information contained in the first information unit, to determine the category of each information unit belongs; conversion means, based on a first predefined rule, in conjunction with the respective unit category information to the first information into a web page provided to the user equipment a second web page information; wherein, the category analysis apparatus comprising: a subcategory analyzing means for each information unit contained in a first web page information, in conjunction with the recognition template category and user-related information, determining the respective information categories the cell belongs; wherein, said category analysis device according to at least one factor determining the cell type information: - the position in the first information unit of the web page information; - information unit with the Related information information element.
12.根据权利要求11所述的网络设备,其中,所述类别分析装置还根据以下至少一项因素,来确定所述信息单元的类别: -所述信息单元的标识符; -所述信息单元的文本内容。 12. The network device of claim 11, wherein said analyzing means further categories according to at least one of the following factors to determine the category information unit: - the information element identifier; - the information unit text content.
13.根据权利要求11或12所述的网络设备,其中,所述类别分析装置还包括: 匹配查询装置,用于根据所述第一网页信息的链接地址在模板库中进行匹配查询,以获取相应的所述类别识别模板。 13. The network device of claim 11 or claim 12, wherein said apparatus further comprises a category analysis: query matching means for matching queries in a template library according to the link address of the first page information, to obtain the respective template category identification.
14.根据权利要求13所述的网络设备,其中,该网络设备还包括: 更新装置,用于根据用户经由所述用户设备发送的反馈信息和/或所述第二网页信息,来确定待更新或待建立的类别识别模板。 Via feedback information sent by the user equipment and / or the second web page information to be updated is determined according to a user updating means for: 14. The network device according to claim 13, wherein the network device further comprises or to be established categories of recognition template.
15.根据权利要求11或12所述的网络设备,其中,所述转换装置包括: 子转换装置,用于基于所述第一预定规则,结合所述各个信息单元的类别,来对所述各个信息单元执行相应的操作,以将所述第一网页信息转换为第二网页信息。 15. The network device of claim 11 or claim 12, wherein said converting means comprises: a sub conversion means based on the first predetermined rule, in conjunction with the respective category information unit, to the respective performs a corresponding operation information unit, to convert the first information the second web page web page information.
16.根据权利要求15所述的网络设备,其中,所述第一预定规则包括参考以下至少一项因素来确定所述相应的操作: -预设的所述类别与可执行操作之间的对应关系; -用户相关信息。 16. The network device according to claim 15, wherein said first predetermined rule comprises at least one reference to the following factors to determine the appropriate action: - correspondence between the category and perform a predetermined operation relations; - user-related information.
17.根据权利要求11所述的网络设备,其中,所述用户相关信息包括以下至少一项: -用户的个人属性; -用户的偏好设置; -用户的历史行为; -用户的环境信息。 17. The network device of claim 11, wherein the user-related information comprises at least one of: - the user's personal attribute; - user preferences; - historical behavior of the user; - the user environment information.
18.根据权利要求11或12所述的网络设备,其中,所述获取装置还包括以下装置: 第一子获取装置,用于获取来自用户设备的第一请求,该第一请求用于为用户设备请求处理待处理的第一网页信息; 第二子获取装置,用于根据所述第一请求,获取所述待处理的第一网页信息。 18. The network device of claim 11 or claim 12, wherein said acquiring means further comprises means for: acquiring a first sub-means for acquiring a first request from a user device, the first request for a user the information processing device requests the first web page to be processed; a second sub-obtaining means, according to the first request, the acquired first page information to be processed.
19.根据权利要求18所述的网络设备,其中,所述第二子获取装置包括: 提取装置,用于从所述第一请求中提取所述待处理的第一网页信息的链接地址; 第三子获取装置,用于根据所述链接地址,获取所述待处理的第一网页信息。 19. The network device according to claim 18, wherein the second sub-acquiring means comprising: extracting means for extracting a first link address information of the web page to be processed from said first request; first three sons acquisition means, according to the link address for obtaining a first web page of the information to be processed.
20.根据权利要求11或12所述的网络设备,其中,该网络设备包括:网络主机、单个网络服务器、多个网络服务器集或基于云计算的计算机集合。 20. The network device of claim 11 or claim 12, wherein, the network device comprising: a host network, a single network server, or a set of a plurality of network servers based on a set of computer-cloud.
CN201010569782.2A 2010-11-26 2010-11-26 Method and device for optimizing webpage in network equipment CN102035883B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010569782.2A CN102035883B (en) 2010-11-26 2010-11-26 Method and device for optimizing webpage in network equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010569782.2A CN102035883B (en) 2010-11-26 2010-11-26 Method and device for optimizing webpage in network equipment

Publications (2)

Publication Number Publication Date
CN102035883A CN102035883A (en) 2011-04-27
CN102035883B true CN102035883B (en) 2015-07-01

Family

ID=43888200

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010569782.2A CN102035883B (en) 2010-11-26 2010-11-26 Method and device for optimizing webpage in network equipment

Country Status (1)

Country Link
CN (1) CN102035883B (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102420813B (en) * 2011-10-27 2015-02-18 北京百度网讯科技有限公司 Method and device for providing target information according to terminal attributes of user equipment
CN103377233A (en) * 2012-04-26 2013-10-30 腾讯科技(深圳)有限公司 Webpage sharing method and corresponding system
CN103425670B (en) * 2012-05-16 2018-11-13 百度在线网络技术(北京)有限公司 A method of providing content recommendation information to the user a method, apparatus and equipment
CN103838728B (en) * 2012-11-21 2018-01-09 腾讯科技(深圳)有限公司 Processing method and web browser information
CN103942231B (en) * 2013-01-18 2019-01-15 联想(北京)有限公司 A kind of display methods and electronic equipment of webpage
CN104239559A (en) * 2014-09-26 2014-12-24 北京金山安全软件有限公司 Webpage opening method and device
CN105677649B (en) * 2014-11-18 2019-04-23 中国移动通信集团公司 A kind of method and device of individualized webpage typesetting
CN104468740B (en) * 2014-11-21 2019-03-08 网宿科技股份有限公司 A kind of webpage transmission intelligent processing system and its method
CN104615686B (en) * 2015-01-22 2018-11-09 百度在线网络技术(北京)有限公司 Method and apparatus for searching
CN105989034A (en) * 2015-02-03 2016-10-05 阿里巴巴集团控股有限公司 Webpage display method and webpage display device
CN104809172B (en) * 2015-04-10 2019-02-12 百度在线网络技术(北京)有限公司 A kind of webpage representation method and device
CN104850595B (en) * 2015-04-27 2018-07-27 小米科技有限责任公司 Optimization method and apparatus for web opening time
CN105138698A (en) * 2015-09-25 2015-12-09 百度在线网络技术(北京)有限公司 Dynamic layout method and device for webpages
CN105893624A (en) * 2016-04-29 2016-08-24 珠海市魅族科技有限公司 Method and system for displaying data
CN106446156A (en) * 2016-09-22 2017-02-22 宇龙计算机通信科技(深圳)有限公司 Webpage data shielding method and system
CN106844731A (en) * 2017-02-10 2017-06-13 宇龙计算机通信科技(深圳)有限公司 Advertisement shielding method and system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101246494A (en) * 2008-03-19 2008-08-20 腾讯科技(深圳)有限公司 Internet web page conversion method, system and equipment
CN101615193A (en) * 2009-07-07 2009-12-30 北京大学 Searching system based on encyclopedic data extracting integration
CN101702782A (en) * 2009-11-17 2010-05-05 广州杰赛科技股份有限公司 Digital television webpage monitoring server, system and method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4225703B2 (en) * 2001-04-27 2009-02-18 インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Maschines Corporation Information access method, information access system and program

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101246494A (en) * 2008-03-19 2008-08-20 腾讯科技(深圳)有限公司 Internet web page conversion method, system and equipment
CN101615193A (en) * 2009-07-07 2009-12-30 北京大学 Searching system based on encyclopedic data extracting integration
CN101702782A (en) * 2009-11-17 2010-05-05 广州杰赛科技股份有限公司 Digital television webpage monitoring server, system and method

Also Published As

Publication number Publication date
CN102035883A (en) 2011-04-27

Similar Documents

Publication Publication Date Title
AU2010315738B2 (en) Social browsing
KR100810010B1 (en) A method and system for improving presentation of html pages in web devices
US6920609B1 (en) Systems and methods for identifying and extracting data from HTML pages
CN102591942B (en) Method and device for automatic application recommendation
US20080256443A1 (en) System for aggregating and displaying syndicated news feeds
CN102792244B (en) To increase browsing speed preview
JP5346374B2 (en) Web page privacy risk protection method and system
JP5505671B2 (en) Update notification method and browser
US20120197855A1 (en) Method and Apparatus of Generating Internet Navigation Page
US7885986B2 (en) Enhanced browsing experience in social bookmarking based on self tags
US20140129541A1 (en) Configuring web crawler to extract web page information
CN102144243B (en) Content recommendations based on browsing information
US20100161631A1 (en) Techniques to share information about tags and documents across a computer network
CN102024028B (en) Method and equipment for distinctly displaying main contents of webpage on mobile terminal
US9547648B2 (en) Electronic document information extraction
CN108563750A (en) Surfacing applications based on browsing activity
CN104699782A (en) Dispersed type of web comments
US9971745B2 (en) Method and system for providing suggested tags associated with a target web page for manipulation by a user optimal rendering engine
CN101968802A (en) Method and equipment for recommending content of Internet based on user browse behavior
CN101777080A (en) User click data-based webpage analysis method
CN102298616B (en) Method and device for providing related sub links in search result
WO2013126084A2 (en) Graphical overlay related to data mining and analytics
CN101651707B (en) Method for automatically acquiring user behavior log of network
CN101609457A (en) Method and device for providing recommendatory configuration for start page
US20090100015A1 (en) Web-based workspace for enhancing internet search experience

Legal Events

Date Code Title Description
C06 Publication
C10 Request of examination as to substance
C14 Granted