JP2006351002A5 - - Google Patents
Download PDFInfo
- Publication number
- JP2006351002A5 JP2006351002A5 JP2006132999A JP2006132999A JP2006351002A5 JP 2006351002 A5 JP2006351002 A5 JP 2006351002A5 JP 2006132999 A JP2006132999 A JP 2006132999A JP 2006132999 A JP2006132999 A JP 2006132999A JP 2006351002 A5 JP2006351002 A5 JP 2006351002A5
- Authority
- JP
- Japan
- Prior art keywords
- document
- schema
- verification
- structured
- electronic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000012795 verification Methods 0.000 claims 95
- 238000000034 method Methods 0.000 claims 11
- 239000000284 extract Substances 0.000 claims 5
- 239000012634 fragment Substances 0.000 claims 5
- 230000006870 function Effects 0.000 claims 4
- 238000007781 pre-processing Methods 0.000 claims 3
- 238000005070 sampling Methods 0.000 claims 3
- 230000010365 information processing Effects 0.000 claims 1
- 238000010606 normalization Methods 0.000 claims 1
- 239000002245 particle Substances 0.000 claims 1
- 230000011218 segmentation Effects 0.000 claims 1
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2006132999A JP2006351002A (ja) | 2005-05-17 | 2006-05-11 | 文書検証装置、文書検証方法およびプログラム |
| US11/434,957 US8112816B2 (en) | 2005-05-17 | 2006-05-17 | Document verification apparatus and document verification method |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2005144611 | 2005-05-17 | ||
| JP2006132999A JP2006351002A (ja) | 2005-05-17 | 2006-05-11 | 文書検証装置、文書検証方法およびプログラム |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2006351002A JP2006351002A (ja) | 2006-12-28 |
| JP2006351002A5 true JP2006351002A5 (https=) | 2009-06-18 |
Family
ID=37449649
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2006132999A Withdrawn JP2006351002A (ja) | 2005-05-17 | 2006-05-11 | 文書検証装置、文書検証方法およびプログラム |
Country Status (2)
| Country | Link |
|---|---|
| US (1) | US8112816B2 (https=) |
| JP (1) | JP2006351002A (https=) |
Families Citing this family (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2006157399A (ja) * | 2004-11-29 | 2006-06-15 | Hitachi Ltd | 電子署名付き電子文書交換支援方法及び情報処理装置 |
| US8082493B2 (en) * | 2006-04-10 | 2011-12-20 | Oracle International Corporation | Streaming XML patch |
| US8429526B2 (en) * | 2006-04-10 | 2013-04-23 | Oracle International Corporation | Efficient evaluation for diff of XML documents |
| JP4659721B2 (ja) * | 2006-11-09 | 2011-03-30 | キヤノン株式会社 | コンテンツ編集装置及びコンテンツ検証装置 |
| US20080281863A1 (en) * | 2007-05-10 | 2008-11-13 | Hewlett-Packard Development Company, L.P. | Repository system and method |
| US7865822B2 (en) * | 2007-06-18 | 2011-01-04 | Intel Corporation | Method and apparatus for parallel validation of documents |
| US8554800B2 (en) * | 2008-07-30 | 2013-10-08 | Portool Ltd. | System, methods and applications for structured document indexing |
| JP5982308B2 (ja) * | 2013-03-14 | 2016-08-31 | 株式会社エヌ・ティ・ティ・データ | 判定装置、判定方法、判定プログラム |
| US9519805B2 (en) * | 2013-08-01 | 2016-12-13 | Cellco Partnership | Digest obfuscation for data cryptography |
| US20150121351A1 (en) * | 2013-10-31 | 2015-04-30 | Alan Cabrera | Generating configuration data based on application definitions |
| US10831991B1 (en) | 2015-06-02 | 2020-11-10 | United Service Automobile Association (USAA) | Systems and methods for testing content developed for access via a network |
| US11121905B2 (en) | 2019-08-15 | 2021-09-14 | Forcepoint Llc | Managing data schema differences by path deterministic finite automata |
| WO2023049288A2 (en) * | 2021-09-24 | 2023-03-30 | DocMagic, Inc. | Enabling electronic loan documents |
| JP7454154B1 (ja) | 2023-11-30 | 2024-03-22 | ビヨンドブロックチェーン株式会社 | サービス同一性検査装置および方法 |
Family Cites Families (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| AUPQ849500A0 (en) * | 2000-06-30 | 2000-07-27 | Canon Kabushiki Kaisha | Hash compact xml parser |
| US8555261B1 (en) * | 2001-06-28 | 2013-10-08 | Microsoft Corporation | Object-oriented pull model XML parser |
| WO2003009277A2 (en) * | 2001-07-20 | 2003-01-30 | Gracenote, Inc. | Automatic identification of sound recordings |
| JP3972323B2 (ja) | 2001-09-04 | 2007-09-05 | インターナショナル・ビジネス・マシーンズ・コーポレーション | スキーマ生成装置、データ処理装置及びその方法並びにプログラム |
| JP2003084987A (ja) | 2001-09-11 | 2003-03-20 | Internatl Business Mach Corp <Ibm> | Xml文書の妥当性を検証するためのオートマトンの生成方法、xml文書の妥当性検証方法、xml文書の妥当性を検証するためのオートマトンの生成システム、xml文書の妥当性検証システムおよびプログラム |
| JP2003150586A (ja) | 2001-11-12 | 2003-05-23 | Ntt Docomo Inc | 文書変換システム、文書変換方法及び文書変換プログラムを記録したコンピュータ読み取り可能な記録媒体 |
| KR100472458B1 (ko) | 2002-06-26 | 2005-03-10 | 삼성전자주식회사 | 외부 xml유효성 검증 장치를 이용하는 xml파싱 장치및 방법 |
| US7356616B2 (en) * | 2002-11-06 | 2008-04-08 | Microsoft Corporation | Maintaining structured time data for electronic messages |
| US20050149729A1 (en) * | 2003-12-24 | 2005-07-07 | Zimmer Vincent J. | Method to support XML-based security and key management services in a pre-boot execution environment |
-
2006
- 2006-05-11 JP JP2006132999A patent/JP2006351002A/ja not_active Withdrawn
- 2006-05-17 US US11/434,957 patent/US8112816B2/en not_active Expired - Fee Related
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN100576201C (zh) | 用于从自然语言文本开发本体的方法和电子数据处理系统 | |
| CN101290624B (zh) | 一种新闻网页元数据自动抽取方法 | |
| Peters et al. | Content extraction using diverse feature sets | |
| CN103389895B (zh) | 一种前端页面的生成方法及系统 | |
| JP6203374B2 (ja) | ウェブページ・スタイルアドレスの統合 | |
| US9361317B2 (en) | Method for entity enrichment of digital content to enable advanced search functionality in content management systems | |
| CN102662966B (zh) | 一种面向主题的获取动态页面内容的方法及系统 | |
| JP2006351002A5 (https=) | ||
| CN105022803B (zh) | 一种提取网页正文内容的方法及系统 | |
| WO2023155303A1 (zh) | 网页数据的提取方法和装置、计算机设备、存储介质 | |
| CN104965901A (zh) | 一种目标页面内容抓取方法和装置 | |
| CA2517189A1 (en) | Web content adaption process and system | |
| CN102682098A (zh) | 检测网页内容变更的方法及装置 | |
| CN101872350A (zh) | 网页正文抽取方法和装置 | |
| US20140016814A1 (en) | Hierarchical and index based watermarks represented as trees | |
| JP5527845B2 (ja) | 文書情報の文章的特徴及び外形的特徴に基づく文書分類プログラム、サーバ及び方法 | |
| CN103838796A (zh) | 一种网页结构化信息抽取方法 | |
| US9449114B2 (en) | Removing non-substantive content from a web page by removing its text-sparse nodes and removing high-frequency sentences of its text-dense nodes using sentence hash value frequency across a web page collection | |
| CN112818279A (zh) | 网页相似度的确定方法及确定装置、计算机可读存储介质 | |
| CN101763432A (zh) | 一种轻量级网页动态视图快速构建方法 | |
| CN102004805B (zh) | 基于最大相似性匹配的网页去噪系统及其去噪方法 | |
| CN114398138A (zh) | 界面生成方法、装置、计算机设备和存储介质 | |
| CN105183730B (zh) | 网页信息的处理方法和装置 | |
| CN108121743A (zh) | 一种通用网页模版的生成和使用方法、系统 | |
| Lin et al. | Combining a segmentation-like approach and a density-based approach in content extraction |