CN100447805C - 文档处理装置和文档处理方法 - Google Patents
文档处理装置和文档处理方法 Download PDFInfo
- Publication number
- CN100447805C CN100447805C CNB2005100559257A CN200510055925A CN100447805C CN 100447805 C CN100447805 C CN 100447805C CN B2005100559257 A CNB2005100559257 A CN B2005100559257A CN 200510055925 A CN200510055925 A CN 200510055925A CN 100447805 C CN100447805 C CN 100447805C
- Authority
- CN
- China
- Prior art keywords
- document
- data
- string
- title
- character string
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/211—Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
- Character Input (AREA)
Abstract
Description
Claims (8)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2004271734 | 2004-09-17 | ||
JP2004271734A JP2006085582A (ja) | 2004-09-17 | 2004-09-17 | 文書処理装置およびプログラム |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1750018A CN1750018A (zh) | 2006-03-22 |
CN100447805C true CN100447805C (zh) | 2008-12-31 |
Family
ID=36074077
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2005100559257A Expired - Fee Related CN100447805C (zh) | 2004-09-17 | 2005-03-18 | 文档处理装置和文档处理方法 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20060062492A1 (zh) |
JP (1) | JP2006085582A (zh) |
CN (1) | CN100447805C (zh) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101226596B (zh) | 2007-01-15 | 2012-02-01 | 夏普株式会社 | 文档图像处理装置以及文档图像处理方法 |
CN101226595B (zh) | 2007-01-15 | 2012-05-23 | 夏普株式会社 | 文档图像处理装置以及文档图像处理方法 |
CN101354703B (zh) * | 2007-07-23 | 2010-11-17 | 夏普株式会社 | 文档图像处理装置和文档图像处理方法 |
JP2009169536A (ja) * | 2008-01-11 | 2009-07-30 | Ricoh Co Ltd | 情報処理装置、画像形成装置、ドキュメント生成方法、ドキュメント生成プログラム |
US8504567B2 (en) * | 2010-08-23 | 2013-08-06 | Yahoo! Inc. | Automatically constructing titles |
US9082037B2 (en) * | 2013-05-22 | 2015-07-14 | Xerox Corporation | Method and system for automatically determining the issuing state of a license plate |
US10176500B1 (en) * | 2013-05-29 | 2019-01-08 | A9.Com, Inc. | Content classification based on data recognition |
CN104463155B (zh) * | 2013-09-18 | 2018-05-11 | 株式会社东芝 | 文件管理装置以及文件管理方法 |
JP6050843B2 (ja) | 2015-01-30 | 2016-12-21 | 株式会社Pfu | 情報処理装置、方法およびプログラム |
US10572528B2 (en) | 2016-08-11 | 2020-02-25 | International Business Machines Corporation | System and method for automatic detection and clustering of articles using multimedia information |
US20200026767A1 (en) * | 2018-07-17 | 2020-01-23 | Fuji Xerox Co., Ltd. | System and method for generating titles for summarizing conversational documents |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH10214194A (ja) * | 1997-01-29 | 1998-08-11 | Nec Corp | クラス定義取り込み方式 |
JPH11282844A (ja) * | 1998-03-26 | 1999-10-15 | Toshiba Corp | 文書作成方法および情報処理装置および記録媒体 |
JP2000123022A (ja) * | 1998-10-13 | 2000-04-28 | Ricoh Co Ltd | 文縮約方法、文書縮約装置及び文書抄録装置 |
JP2000137728A (ja) * | 1998-11-02 | 2000-05-16 | Fujitsu Ltd | 文書解析装置及びプログラム記録媒体 |
JP2004151882A (ja) * | 2002-10-29 | 2004-05-27 | Fuji Xerox Co Ltd | 情報出力制御方法、情報出力処理システム、プログラム |
JP2004199529A (ja) * | 2002-12-20 | 2004-07-15 | Fujitsu Ltd | 帳票認識装置および帳票認識方法 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5635272A (en) * | 1995-07-03 | 1997-06-03 | The United States Of America As Represented By The Secretary Of The Army | Composite structure for transmitting high shear loads |
JP3425834B2 (ja) * | 1995-09-06 | 2003-07-14 | 富士通株式会社 | 文書画像からのタイトル抽出装置および方法 |
US5776582A (en) * | 1996-08-05 | 1998-07-07 | Polyplus, Inc. | Load-bearing structures with interlockable edges |
US6327387B1 (en) * | 1996-12-27 | 2001-12-04 | Fujitsu Limited | Apparatus and method for extracting management information from image |
US5892843A (en) * | 1997-01-21 | 1999-04-06 | Matsushita Electric Industrial Co., Ltd. | Title, caption and photo extraction from scanned document images |
US7099507B2 (en) * | 1998-11-05 | 2006-08-29 | Ricoh Company, Ltd | Method and system for extracting title from document image |
WO2000052645A1 (fr) * | 1999-03-01 | 2000-09-08 | Matsushita Electric Industrial Co., Ltd. | Dispositif de traitement d'image document, procede d'extraction de titre de document et procede d'information d'etiquetage de document |
JP3913985B2 (ja) * | 1999-04-14 | 2007-05-09 | 富士通株式会社 | 文書画像中の基本成分に基づく文字列抽出装置および方法 |
-
2004
- 2004-09-17 JP JP2004271734A patent/JP2006085582A/ja active Pending
-
2005
- 2005-03-16 US US11/080,924 patent/US20060062492A1/en not_active Abandoned
- 2005-03-18 CN CNB2005100559257A patent/CN100447805C/zh not_active Expired - Fee Related
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH10214194A (ja) * | 1997-01-29 | 1998-08-11 | Nec Corp | クラス定義取り込み方式 |
JPH11282844A (ja) * | 1998-03-26 | 1999-10-15 | Toshiba Corp | 文書作成方法および情報処理装置および記録媒体 |
JP2000123022A (ja) * | 1998-10-13 | 2000-04-28 | Ricoh Co Ltd | 文縮約方法、文書縮約装置及び文書抄録装置 |
JP2000137728A (ja) * | 1998-11-02 | 2000-05-16 | Fujitsu Ltd | 文書解析装置及びプログラム記録媒体 |
JP2004151882A (ja) * | 2002-10-29 | 2004-05-27 | Fuji Xerox Co Ltd | 情報出力制御方法、情報出力処理システム、プログラム |
JP2004199529A (ja) * | 2002-12-20 | 2004-07-15 | Fujitsu Ltd | 帳票認識装置および帳票認識方法 |
Also Published As
Publication number | Publication date |
---|---|
CN1750018A (zh) | 2006-03-22 |
US20060062492A1 (en) | 2006-03-23 |
JP2006085582A (ja) | 2006-03-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN100447805C (zh) | 文档处理装置和文档处理方法 | |
CN100361493C (zh) | 文档处理装置和文档处理方法 | |
US8139870B2 (en) | Image processing apparatus, recording medium, computer data signal, and image processing method | |
US8645184B2 (en) | Future technology projection supporting apparatus, method, program and method for providing a future technology projection supporting service | |
CN102959578B (zh) | 取证系统、取证方法及取证程序 | |
CN101276372A (zh) | 信息搜索装置及方法 | |
CN101645086B (zh) | 检索方法 | |
US20080162602A1 (en) | Document archiving system | |
US10078672B2 (en) | Search device, search method, and computer program product | |
CN101432733A (zh) | 利用来自搜索的所检索数据来增加电子文档的内容 | |
US20100005058A1 (en) | Computer product, information retrieving apparatus, and information retrieving method | |
CN102624770B (zh) | 信息摘录方法及基于云计算的摘录信息网络存储管理系统 | |
CN102737030A (zh) | 专利文档的数据输出方法、终端及系统 | |
US20070185832A1 (en) | Managing tasks for multiple file types | |
JP2010262638A (ja) | 代表者の信頼度を用いた検索結果順位化装置および方法 | |
US20040010556A1 (en) | Electronic document information expansion apparatus, electronic document information expansion method , electronic document information expansion program, and recording medium which records electronic document information expansion program | |
JP4135659B2 (ja) | フォーマット変換装置およびファイル検索装置 | |
US11468126B2 (en) | Method for collecting component model in component e-commerce platform | |
CN112000257A (zh) | 一种文档重点内容的导出方法及装置 | |
CN114495138A (zh) | 一种智能文档识别与特征提取方法、装置平台和存储介质 | |
JP2002024761A (ja) | 画像処理装置及び画像処理方法並びに記憶媒体 | |
JP2000020549A (ja) | 文書データベースシステムへの入力支援装置 | |
JP5618968B2 (ja) | 類似ページ検出装置、類似ページ検出方法、類似ページ検出プログラム | |
CN112905733A (zh) | 一种基于ocr识别技术的图书保存方法、系统及装置 | |
CN111160870A (zh) | 一种专利文件生成方法、装置、系统和存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CI01 | Publication of corrected invention patent application |
Correction item: Inventor (sixth inventor) Correct: Yi Tengdu False: Yi Tengdu Number: 11 Volume: 22 |
|
CI02 | Correction of invention patent application |
Correction item: Inventor (sixth inventor) Correct: Yi Tengdu False: Yi Tengdu Number: 11 Volume: 22 |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20081231 Termination date: 20170318 |
|
CF01 | Termination of patent right due to non-payment of annual fee |