JP5417471B2 - 構造化文書管理装置、構造化文書検索方法 - Google Patents

構造化文書管理装置、構造化文書検索方法 Download PDF

Info

Publication number
JP5417471B2
JP5417471B2 JP2012057240A JP2012057240A JP5417471B2 JP 5417471 B2 JP5417471 B2 JP 5417471B2 JP 2012057240 A JP2012057240 A JP 2012057240A JP 2012057240 A JP2012057240 A JP 2012057240A JP 5417471 B2 JP5417471 B2 JP 5417471B2
Authority
JP
Japan
Prior art keywords
headline
document
structured document
vocabulary
relevance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2012057240A
Other languages
English (en)
Japanese (ja)
Other versions
JP2013191046A (ja
JP2013191046A5 (enExample
Inventor
智晴 國分
俊彦 真鍋
亘 仲野
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Toshiba Digital Solutions Corp
Original Assignee
Toshiba Corp
Toshiba Solutions Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Toshiba Corp, Toshiba Solutions Corp filed Critical Toshiba Corp
Priority to JP2012057240A priority Critical patent/JP5417471B2/ja
Priority to PCT/JP2012/068505 priority patent/WO2013136545A1/ja
Priority to CN2012800029691A priority patent/CN103415850A/zh
Priority to US13/845,878 priority patent/US20130268554A1/en
Publication of JP2013191046A publication Critical patent/JP2013191046A/ja
Publication of JP2013191046A5 publication Critical patent/JP2013191046A5/ja
Application granted granted Critical
Publication of JP5417471B2 publication Critical patent/JP5417471B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • G06F16/83Querying
    • G06F16/835Query processing
    • G06F16/8373Query execution

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
JP2012057240A 2012-03-14 2012-03-14 構造化文書管理装置、構造化文書検索方法 Expired - Fee Related JP5417471B2 (ja)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2012057240A JP5417471B2 (ja) 2012-03-14 2012-03-14 構造化文書管理装置、構造化文書検索方法
PCT/JP2012/068505 WO2013136545A1 (ja) 2012-03-14 2012-07-20 構造化文書管理装置、構造化文書検索方法
CN2012800029691A CN103415850A (zh) 2012-03-14 2012-07-20 结构化文档管理装置、结构化文档检索方法
US13/845,878 US20130268554A1 (en) 2012-03-14 2013-03-18 Structured document management apparatus and structured document search method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2012057240A JP5417471B2 (ja) 2012-03-14 2012-03-14 構造化文書管理装置、構造化文書検索方法

Publications (3)

Publication Number Publication Date
JP2013191046A JP2013191046A (ja) 2013-09-26
JP2013191046A5 JP2013191046A5 (enExample) 2013-11-21
JP5417471B2 true JP5417471B2 (ja) 2014-02-12

Family

ID=49160504

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2012057240A Expired - Fee Related JP5417471B2 (ja) 2012-03-14 2012-03-14 構造化文書管理装置、構造化文書検索方法

Country Status (4)

Country Link
US (1) US20130268554A1 (enExample)
JP (1) JP5417471B2 (enExample)
CN (1) CN103415850A (enExample)
WO (1) WO2013136545A1 (enExample)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10157175B2 (en) * 2013-03-15 2018-12-18 International Business Machines Corporation Business intelligence data models with concept identification using language-specific clues
US10698924B2 (en) 2014-05-22 2020-06-30 International Business Machines Corporation Generating partitioned hierarchical groups based on data sets for business intelligence data models
US10002179B2 (en) 2015-01-30 2018-06-19 International Business Machines Corporation Detection and creation of appropriate row concept during automated model generation
US9984116B2 (en) 2015-08-28 2018-05-29 International Business Machines Corporation Automated management of natural language queries in enterprise business intelligence analytics
CN105912585A (zh) * 2016-04-01 2016-08-31 乐视控股(北京)有限公司 一种邮件搜索方法及装置
CN106407330A (zh) * 2016-09-04 2017-02-15 乐视控股(北京)有限公司 一种电子邮件的显示方法及装置
US10657158B2 (en) * 2016-11-23 2020-05-19 Google Llc Template-based structured document classification and extraction
CN107391535B (zh) * 2017-04-20 2021-01-12 创新先进技术有限公司 在文档应用中搜索文档的方法及装置
JP6710007B1 (ja) * 2019-04-26 2020-06-17 Arithmer株式会社 対話管理サーバ、対話管理方法、及びプログラム
CN110175322A (zh) * 2019-05-22 2019-08-27 北京神州泰岳软件股份有限公司 一种文档的结构化方法及装置
CN110688842B (zh) * 2019-10-14 2023-06-09 鼎富智能科技有限公司 一种文档标题层级的分析方法、装置及服务器
US11663215B2 (en) 2020-08-12 2023-05-30 International Business Machines Corporation Selectively targeting content section for cognitive analytics and search
CN113204579B (zh) * 2021-04-29 2024-06-07 北京金山数字娱乐科技有限公司 内容关联方法、系统、装置、电子设备及存储介质
CN113408660B (zh) * 2021-07-15 2024-05-24 北京百度网讯科技有限公司 图书聚类方法、装置、设备和存储介质
CN116894176A (zh) * 2023-07-27 2023-10-17 国网江苏省电力有限公司经济技术研究院 一种面向输变电工程设计文档的指标提取优化方法和系统

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6385602B1 (en) * 1998-11-03 2002-05-07 E-Centives, Inc. Presentation of search results using dynamic categorization
US7587381B1 (en) * 2002-01-25 2009-09-08 Sphere Source, Inc. Method for extracting a compact representation of the topical content of an electronic text
JP2003242175A (ja) * 2002-02-15 2003-08-29 Ricoh Co Ltd 文書検索システム、文書検索方法、その方法によったプログラムおよびそのプログラムを記憶した記憶媒体
JP3999093B2 (ja) * 2002-09-30 2007-10-31 株式会社東芝 構造化文書検索方法及び構造化文書検索システム
US20060150076A1 (en) * 2004-12-30 2006-07-06 Microsoft Corporation Methods and apparatus for the evaluation of aspects of a web page
JP2006195667A (ja) * 2005-01-12 2006-07-27 Toshiba Corp 構造化文書検索装置、構造化文書検索方法、及び構造化文書検索プログラム
US7546294B2 (en) * 2005-03-31 2009-06-09 Microsoft Corporation Automated relevance tuning
US20070150473A1 (en) * 2005-12-22 2007-06-28 Microsoft Corporation Search By Document Type And Relevance
JP2007206822A (ja) * 2006-01-31 2007-08-16 Fuji Xerox Co Ltd 文書管理システム、文書廃棄管理システム、文書管理方法および文書廃棄管理方法
US7779370B2 (en) * 2006-06-30 2010-08-17 Google Inc. User interface for mobile devices
JP2008146209A (ja) * 2006-12-07 2008-06-26 Just Syst Corp 文書検索装置、文書検索方法および文書検索プログラム
US9218414B2 (en) * 2007-02-06 2015-12-22 Dmitri Soubbotin System, method, and user interface for a search engine based on multi-document summarization
US20090055386A1 (en) * 2007-08-24 2009-02-26 Boss Gregory J System and Method for Enhanced In-Document Searching for Text Applications in a Data Processing System
US8538989B1 (en) * 2008-02-08 2013-09-17 Google Inc. Assigning weights to parts of a document
JP5355949B2 (ja) * 2008-07-16 2013-11-27 株式会社東芝 次検索キーワード提示装置、次検索キーワード提示方法、及び次検索キーワード提示プログラム
GB2472250A (en) * 2009-07-31 2011-02-02 Stephen Timothy Morris Method for determining document relevance
US8209361B2 (en) * 2010-01-19 2012-06-26 Oracle International Corporation Techniques for efficient and scalable processing of complex sets of XML schemas
US8140512B2 (en) * 2010-04-12 2012-03-20 Ancestry.Com Operations Inc. Consolidated information retrieval results
US8504567B2 (en) * 2010-08-23 2013-08-06 Yahoo! Inc. Automatically constructing titles

Also Published As

Publication number Publication date
US20130268554A1 (en) 2013-10-10
JP2013191046A (ja) 2013-09-26
CN103415850A (zh) 2013-11-27
WO2013136545A1 (ja) 2013-09-19

Similar Documents

Publication Publication Date Title
JP5417471B2 (ja) 構造化文書管理装置、構造化文書検索方法
US10810237B2 (en) Search query generation using query segments and semantic suggestions
US9910932B2 (en) System and method for completing a user query and for providing a query response
US20120290561A1 (en) Information processing apparatus, information processing method, program, and information processing system
EP3345118B1 (en) Identifying query patterns and associated aggregate statistics among search queries
US20080294619A1 (en) System and method for automatic generation of search suggestions based on recent operator behavior
US20120330968A1 (en) System and method for matching comment data to text data
US8527507B2 (en) Custom ranking model schema
US11347815B2 (en) Method and system for generating an offline search engine result page
CN105431844A (zh) 用于搜索系统的第三方搜索应用
CN104838414A (zh) 用于电子书的自定义字典
US9129024B2 (en) Graphical user interface in keyword search
US20120109932A1 (en) Related links
CN104462030B (zh) 字符转换装置、字符转换方法
US10078686B2 (en) Combination filter for search query suggestions
US20150339387A1 (en) Method of and system for furnishing a user of a client device with a network resource
US20170193119A1 (en) Add-On Module Search System
US9648130B1 (en) Finding users in a social network based on document content
US9773035B1 (en) System and method for an annotation search index
WO2013015811A1 (en) Search query generation using query segments and semantic suggestions
CN116049238B (zh) 节点信息查询方法、装置、设备、介质和程序产品
JP5285491B2 (ja) 情報検索システム、方法及びプログラム、索引作成システム、方法及びプログラム、
US10496711B2 (en) Method of and system for processing a prefix associated with a search query
JP2019113937A (ja) 検索支援システム、検索支援方法、及び検索支援プログラム
JP5104329B2 (ja) ドキュメント検索システム

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20131008

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20131022

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20131118

S531 Written request for registration of change of domicile

Free format text: JAPANESE INTERMEDIATE CODE: R313531

R350 Written notification of registration of transfer

Free format text: JAPANESE INTERMEDIATE CODE: R350

LAPS Cancellation because of no payment of annual fees