CN101819584B - 轻量级智能网页内容解析方法 - Google Patents
轻量级智能网页内容解析方法 Download PDFInfo
- Publication number
- CN101819584B CN101819584B CN201010126329.4A CN201010126329A CN101819584B CN 101819584 B CN101819584 B CN 101819584B CN 201010126329 A CN201010126329 A CN 201010126329A CN 101819584 B CN101819584 B CN 101819584B
- Authority
- CN
- China
- Prior art keywords
- webpage
- content
- data
- analysis
- user
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000004458 analytical method Methods 0.000 title claims abstract description 14
- 238000006243 chemical reaction Methods 0.000 claims 1
- 238000013517 stratification Methods 0.000 claims 1
- 238000013499 data model Methods 0.000 abstract description 7
- 230000002452 interceptive effect Effects 0.000 abstract description 2
- 238000000034 method Methods 0.000 abstract 1
- 238000010586 diagram Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
Images
Landscapes
- Information Transfer Between Computers (AREA)
Abstract
Description
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201010126329.4A CN101819584B (zh) | 2010-03-18 | 2010-03-18 | 轻量级智能网页内容解析方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201010126329.4A CN101819584B (zh) | 2010-03-18 | 2010-03-18 | 轻量级智能网页内容解析方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101819584A CN101819584A (zh) | 2010-09-01 |
CN101819584B true CN101819584B (zh) | 2011-11-09 |
Family
ID=42654686
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201010126329.4A Active CN101819584B (zh) | 2010-03-18 | 2010-03-18 | 轻量级智能网页内容解析方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101819584B (zh) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102254027B (zh) * | 2011-07-29 | 2013-05-08 | 四川长虹电器股份有限公司 | 批量获取网页内容的方法 |
CN102298637B (zh) * | 2011-08-31 | 2015-04-15 | 北京中搜网络技术股份有限公司 | 用于内容发布的方法和系统 |
CN102314502B (zh) * | 2011-09-01 | 2017-03-01 | 百度在线网络技术(北京)有限公司 | 一种用于在移动终端上显示网页主体内容的方法和设备 |
CN102831212B (zh) * | 2012-08-14 | 2015-08-26 | 优视科技有限公司 | 页面显示的排版方法及装置 |
AU2015258733B2 (en) * | 2014-05-14 | 2020-03-12 | Pagecloud Inc. | Methods and systems for web content generation |
CN106202348A (zh) * | 2016-07-04 | 2016-12-07 | 中山大学 | 一种网页表格信息抽取方法 |
CN108762732B (zh) * | 2018-05-30 | 2019-06-11 | 南京焦点领动云计算技术有限公司 | 一种HTML内联CSS和内联JavaScript合并方法 |
CN112528205B (zh) * | 2020-12-22 | 2021-10-29 | 中科院计算技术研究所大数据研究院 | 一种网页主体信息提取方法、装置及存储介质 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1959679A (zh) * | 2006-09-25 | 2007-05-09 | 北京爱笛星科技有限公司 | 网页微内容提取、聚合和自动更新系统的方法 |
CN101202748A (zh) * | 2007-11-27 | 2008-06-18 | 优视动景(北京)技术服务有限公司 | 一种微浏览器浏览网页的方法及微浏览器 |
-
2010
- 2010-03-18 CN CN201010126329.4A patent/CN101819584B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1959679A (zh) * | 2006-09-25 | 2007-05-09 | 北京爱笛星科技有限公司 | 网页微内容提取、聚合和自动更新系统的方法 |
CN101202748A (zh) * | 2007-11-27 | 2008-06-18 | 优视动景(北京)技术服务有限公司 | 一种微浏览器浏览网页的方法及微浏览器 |
Also Published As
Publication number | Publication date |
---|---|
CN101819584A (zh) | 2010-09-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101819584B (zh) | 轻量级智能网页内容解析方法 | |
CN103631882B (zh) | 基于图挖掘技术的语义化业务生成系统和方法 | |
CN103365924B (zh) | 一种互联网信息搜索的方法、装置和终端 | |
CN103023714B (zh) | 基于网络话题的活跃度与集群结构分析系统及方法 | |
CN104881488A (zh) | 基于关系表的可配置信息抽取方法 | |
CN102567494B (zh) | 网站分类方法及装置 | |
CN102163213B (zh) | 一种语音浏览方法及浏览器 | |
CN102063488A (zh) | 一种基于语义的代码搜索方法 | |
CN102193798B (zh) | 基于Internet的OpenAPI自动获取方法 | |
CN102521232B (zh) | 一种互联网元数据的分布式采集处理系统及方法 | |
CN106293675A (zh) | 系统静态资源加载方法及装置 | |
CN105468744A (zh) | 一种实现税务舆情分析和全文检索的大数据平台 | |
CN101872350A (zh) | 网页正文抽取方法和装置 | |
KR101801257B1 (ko) | 효율적 건설문서 관리를 위한 텍스트마이닝 적용 기술 | |
CN103559234A (zh) | RESTful Web服务的自动化语义标注系统和方法 | |
CN106844782B (zh) | 一种面向网络的多通道大数据采集系统及方法 | |
CN106294885A (zh) | 一种面向异构网页的数据收集与标注方法 | |
CN112287114A (zh) | 一种知识图谱服务处理方法和装置 | |
CN105956932A (zh) | 配用电数据融合方法和系统 | |
CN101763432A (zh) | 一种轻量级网页动态视图快速构建方法 | |
CN102156749B (zh) | 一种地图网站的自动搜索判别方法、系统及其分布式服务器系统 | |
CN103853770A (zh) | 一种抽取论坛网页中帖子内容的方法及系统 | |
CN102831175A (zh) | 一种基于云平台的水利业务Web服务库的构建方法 | |
CN102486792A (zh) | 一种将通用论坛页面重新组织和显示的方法及系统 | |
CN101576933A (zh) | 基于标题分隔符的全自动web页面分组法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C56 | Change in the name or address of the patentee | ||
CP02 | Change in the address of a patent holder |
Address after: 201203 Shanghai Zhangjiang hi tech park, 1623 No. 2 Cailun Road, building two storey Patentee after: Shanghai Intple Information Technology Co.,Ltd. Address before: 201203 Shanghai city Pudong New Area Cailun Road No. 1690 Building No. 2 Room 303 Patentee before: Shanghai Intple Information Technology Co.,Ltd. |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Light weight intelligent webpage content analysis method Effective date of registration: 20120815 Granted publication date: 20111109 Pledgee: Bank of Communications Ltd. Shanghai New District Branch Pledgor: Shanghai Intple Information Technology Co.,Ltd. Registration number: 2012990000446 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20131119 Granted publication date: 20111109 Pledgee: Bank of Communications Ltd. Shanghai New District Branch Pledgor: Shanghai Intple Information Technology Co.,Ltd. Registration number: 2012990000446 |
|
PLDC | Enforcement, change and cancellation of contracts on pledge of patent right or utility model | ||
CP02 | Change in the address of a patent holder |
Address after: Room 701, building 2, No. 525, Xizang North Road, Jing'an District, Shanghai 200070 Patentee after: SHANGHAI INTPLE INFORMATION TECHNOLOGY Co.,Ltd. Address before: 201203 floor 2, building 2, No. 1623, Cailun Road, Zhangjiang High Tech Park, Shanghai Patentee before: SHANGHAI INTPLE INFORMATION TECHNOLOGY Co.,Ltd. |
|
CP02 | Change in the address of a patent holder | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240531 Address after: Building 1, 3rd Floor, No. 37 Jiangjun Avenue, Jiangning District, Nanjing City, Jiangsu Province, 211106 Patentee after: JIANGSU YINPAO NETWORK TECHNOLOGY CO.,LTD. Country or region after: China Address before: Room 701, building 2, No. 525, Xizang North Road, Jing'an District, Shanghai 200070 Patentee before: Shanghai Intple Information Technology Co.,Ltd. Country or region before: China |
|
TR01 | Transfer of patent right |