NZ599047A - Document analysis and association system and method - Google Patents

Document analysis and association system and method

Info

Publication number
NZ599047A
NZ599047A NZ599047A NZ59904710A NZ599047A NZ 599047 A NZ599047 A NZ 599047A NZ 599047 A NZ599047 A NZ 599047A NZ 59904710 A NZ59904710 A NZ 59904710A NZ 599047 A NZ599047 A NZ 599047A
Authority
NZ
New Zealand
Prior art keywords
text
global
document
local
documents
Prior art date
Application number
NZ599047A
Other languages
English (en)
Inventor
Hamish Ogilvy
Owen James Prime
Phillip Anthony Burns
Original Assignee
Sajari Pty Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sajari Pty Ltd filed Critical Sajari Pty Ltd
Publication of NZ599047A publication Critical patent/NZ599047A/xx

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Quality & Reliability (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Operations Research (AREA)
  • Economics (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
NZ599047A 2009-09-26 2010-09-24 Document analysis and association system and method NZ599047A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US24612109P 2009-09-26 2009-09-26
PCT/AU2010/001259 WO2011035389A1 (en) 2009-09-26 2010-09-24 Document analysis and association system and method

Publications (1)

Publication Number Publication Date
NZ599047A true NZ599047A (en) 2013-02-22

Family

ID=43795233

Family Applications (1)

Application Number Title Priority Date Filing Date
NZ599047A NZ599047A (en) 2009-09-26 2010-09-24 Document analysis and association system and method

Country Status (8)

Country Link
US (1) US8666994B2 (xx)
EP (1) EP2480987A4 (xx)
CN (1) CN102597991A (xx)
AU (1) AU2010300096B2 (xx)
BR (1) BR112012006743A2 (xx)
CA (1) CA2775368A1 (xx)
NZ (1) NZ599047A (xx)
WO (1) WO2011035389A1 (xx)

Families Citing this family (80)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9183535B2 (en) * 2008-07-30 2015-11-10 Aro, Inc. Social network model for semantic processing
US9069862B1 (en) 2010-10-14 2015-06-30 Aro, Inc. Object-based relationship search using a plurality of sub-queries
US8706717B2 (en) 2009-11-13 2014-04-22 Oracle International Corporation Method and system for enterprise search navigation
US20120076414A1 (en) * 2010-09-27 2012-03-29 Microsoft Corporation External Image Based Summarization Techniques
US8429099B1 (en) * 2010-10-14 2013-04-23 Aro, Inc. Dynamic gazetteers for entity recognition and fact association
EP2635965A4 (en) * 2010-11-05 2016-08-10 Rakuten Inc SYSTEMS AND METHODS RELATING TO KEYWORD EXTRACTION
US9251508B2 (en) 2010-12-09 2016-02-02 At&T Intellectual Property I, L.P. Intelligent message processing
US20120150862A1 (en) * 2010-12-13 2012-06-14 Xerox Corporation System and method for augmenting an index entry with related words in a document and searching an index for related keywords
US20120271844A1 (en) * 2011-04-20 2012-10-25 Microsoft Corporation Providng relevant information for a term in a user message
CN102810096B (zh) * 2011-06-02 2016-03-16 阿里巴巴集团控股有限公司 一种基于单字索引系统的检索方法和装置
US8676795B1 (en) * 2011-08-04 2014-03-18 Amazon Technologies, Inc. Dynamic visual representation of phrases
US9442928B2 (en) * 2011-09-07 2016-09-13 Venio Inc. System, method and computer program product for automatic topic identification using a hypertext corpus
US9442930B2 (en) * 2011-09-07 2016-09-13 Venio Inc. System, method and computer program product for automatic topic identification using a hypertext corpus
US9223769B2 (en) 2011-09-21 2015-12-29 Roman Tsibulevskiy Data processing systems, devices, and methods for content analysis
US8782058B2 (en) * 2011-10-12 2014-07-15 Desire2Learn Incorporated Search index dictionary
US20130191365A1 (en) * 2012-01-19 2013-07-25 Mauritius H.P.M. van Putten Method to search objectively for maximal information
US9547679B2 (en) * 2012-03-29 2017-01-17 Spotify Ab Demographic and media preference prediction using media content data analysis
US9406072B2 (en) 2012-03-29 2016-08-02 Spotify Ab Demographic and media preference prediction using media content data analysis
CN103684816B (zh) 2012-09-04 2017-12-22 华为技术有限公司 资源信息显示方法和装置
ITTO20120867A1 (it) * 2012-10-05 2014-04-06 Rai Radiotelevisione Italiana Metodo e sistema per la raccomandazione di contenuti multimediali su una piattaforma multimediale
WO2014100202A1 (en) 2012-12-18 2014-06-26 Lexisnexis, A Division Of Reed Elsevier Inc. Systems and methods for patent-related document analysis and searching
US11232137B2 (en) 2012-12-18 2022-01-25 RELX Inc. Methods for evaluating term support in patent-related documents
US8949228B2 (en) * 2013-01-15 2015-02-03 Google Inc. Identification of new sources for topics
US9317609B2 (en) * 2013-03-14 2016-04-19 FortyTwo, Inc. Semantic vector in a method and apparatus for keeping and finding information
US9465789B1 (en) 2013-03-27 2016-10-11 Google Inc. Apparatus and method for detecting spam
WO2014166540A1 (en) * 2013-04-11 2014-10-16 Longsand Limited Sentiment feedback
US9898523B2 (en) 2013-04-22 2018-02-20 Abb Research Ltd. Tabular data parsing in document(s)
US9251146B2 (en) * 2013-05-10 2016-02-02 International Business Machines Corporation Altering relevancy of a document and/or a search query
US20150242927A1 (en) * 2013-10-03 2015-08-27 Jason Will Method and system of an online travel website
WO2015117074A1 (en) * 2014-01-31 2015-08-06 Global Security Information Analysts, LLC Document relationship analysis system
JP5602980B1 (ja) * 2014-02-28 2014-10-08 楽天株式会社 情報処理システム、情報処理方法、および情報処理プログラム
US10963924B1 (en) 2014-03-10 2021-03-30 A9.Com, Inc. Media processing techniques for enhancing content
US9679050B2 (en) * 2014-04-30 2017-06-13 Adobe Systems Incorporated Method and apparatus for generating thumbnails
WO2016028770A1 (en) 2014-08-18 2016-02-25 HavenLock Inc. Improved locking apparatus, locking member, and method of use
TWI526856B (zh) * 2014-10-22 2016-03-21 財團法人資訊工業策進會 服務需求分析系統、方法與電腦可讀取記錄媒體
US10372718B2 (en) 2014-11-03 2019-08-06 SavantX, Inc. Systems and methods for enterprise data search and analysis
US10915543B2 (en) 2014-11-03 2021-02-09 SavantX, Inc. Systems and methods for enterprise data search and analysis
KR101668725B1 (ko) * 2015-03-18 2016-10-24 성균관대학교산학협력단 잠재 키워드 생성 방법 및 장치
CN104809106A (zh) * 2015-05-15 2015-07-29 合肥汇众知识产权管理有限公司 一种专利方案的挖掘系统及挖掘方法
US10565198B2 (en) 2015-06-23 2020-02-18 Microsoft Technology Licensing, Llc Bit vector search index using shards
US11281639B2 (en) 2015-06-23 2022-03-22 Microsoft Technology Licensing, Llc Match fix-up to remove matching documents
US10733164B2 (en) * 2015-06-23 2020-08-04 Microsoft Technology Licensing, Llc Updating a bit vector search index
US10229143B2 (en) 2015-06-23 2019-03-12 Microsoft Technology Licensing, Llc Storage and retrieval of data from a bit vector search index
US11392568B2 (en) 2015-06-23 2022-07-19 Microsoft Technology Licensing, Llc Reducing matching documents for a search query
US10467215B2 (en) 2015-06-23 2019-11-05 Microsoft Technology Licensing, Llc Matching documents using a bit vector search index
US10242071B2 (en) 2015-06-23 2019-03-26 Microsoft Technology Licensing, Llc Preliminary ranker for scoring matching documents
US10402400B2 (en) 2015-06-25 2019-09-03 International Business Machines Corporation Distributed processing of a search query with distributed posting lists
US11392582B2 (en) * 2015-10-15 2022-07-19 Sumo Logic, Inc. Automatic partitioning
US20170116194A1 (en) 2015-10-23 2017-04-27 International Business Machines Corporation Ingestion planning for complex tables
US9798823B2 (en) 2015-11-17 2017-10-24 Spotify Ab System, methods and computer products for determining affinity to a content creator
US20170147652A1 (en) * 2015-11-19 2017-05-25 Institute For Information Industry Search servers, end devices, and search methods for use in a distributed network
US10628466B2 (en) * 2016-01-06 2020-04-21 Quest Software Inc. Smart exchange database index
US20170192854A1 (en) * 2016-01-06 2017-07-06 Dell Software, Inc. Email recovery via emulation and indexing
US10839149B2 (en) 2016-02-01 2020-11-17 Microsoft Technology Licensing, Llc. Generating templates from user's past documents
US9922022B2 (en) * 2016-02-01 2018-03-20 Microsoft Technology Licensing, Llc. Automatic template generation based on previous documents
US10354066B2 (en) 2016-02-26 2019-07-16 Cylance Inc. Retention and accessibility of data characterizing events on an endpoint computer
US11347777B2 (en) * 2016-05-12 2022-05-31 International Business Machines Corporation Identifying key words within a plurality of documents
US10866992B2 (en) * 2016-05-14 2020-12-15 Gratiana Denisa Pol System and methods for identifying, aggregating, and visualizing tested variables and causal relationships from scientific research
US10621237B1 (en) * 2016-08-01 2020-04-14 Amazon Technologies, Inc. Contextual overlay for documents
CN107798637A (zh) * 2016-08-30 2018-03-13 北京国双科技有限公司 同案异判文书的获取方法及装置
US10691507B2 (en) * 2016-12-09 2020-06-23 Fujitsu Limited API learning
US10699012B2 (en) 2017-01-11 2020-06-30 Cylance Inc. Endpoint detection and response utilizing machine learning
US10528668B2 (en) * 2017-02-28 2020-01-07 SavantX, Inc. System and method for analysis and navigation of data
US11328128B2 (en) 2017-02-28 2022-05-10 SavantX, Inc. System and method for analysis and navigation of data
EP3616115B1 (en) * 2017-04-26 2023-12-06 Cylance Inc. Endpoint detection and response system event characterization data transfer
US11651333B2 (en) * 2017-05-05 2023-05-16 Microsoft Technology Licensing, Llc Specialized user interfaces and processes for increasing user interactions with job postings in a social network/top jobs
RU2652461C1 (ru) 2017-05-30 2018-04-26 Общество с ограниченной ответственностью "Аби Девелопмент" Дифференциальная классификация с использованием нескольких нейронных сетей
CN107315830A (zh) * 2017-07-10 2017-11-03 深圳市视维科技股份有限公司 一种智能分析文档的方法及系统
US11669574B2 (en) * 2017-08-01 2023-06-06 Informatica Llc Method, apparatus, and computer-readable medium for determining a data domain associated with data
US10885121B2 (en) * 2017-12-13 2021-01-05 International Business Machines Corporation Fast filtering for similarity searches on indexed data
CN110209663B (zh) * 2018-02-14 2023-06-20 阿里巴巴集团控股有限公司 搜索范围确定的方法、装置和存储介质
US10997225B2 (en) * 2018-03-20 2021-05-04 The Boeing Company Predictive query processing for complex system lifecycle management
EP3874383A1 (en) 2018-11-01 2021-09-08 rewardStyle, Inc. System and method for improved searching across multiple databases
US11144579B2 (en) * 2019-02-11 2021-10-12 International Business Machines Corporation Use of machine learning to characterize reference relationship applied over a citation graph
US11537581B2 (en) * 2019-03-22 2022-12-27 Hewlett Packard Enterprise Development Lp Co-parent keys for document information trees
US11314534B2 (en) * 2020-01-30 2022-04-26 Accenture Global Solutions Limited System and method for interactively guiding users through a procedure
US11405338B2 (en) * 2020-12-10 2022-08-02 Capital One Services, Llc Virtual-assistant-based resolution of user inquiries via failure-triggered document presentation
IT202100001133A1 (it) * 2021-01-22 2022-07-22 Aptus Ai S R L Procedimento e sistema autonomo di gestione e aggiornamento di documenti testuali digitali normativi
US20220398660A1 (en) * 2021-06-10 2022-12-15 SRAX, Inc. System and method for computational shelf forecasting
US11874880B2 (en) 2022-02-09 2024-01-16 My Job Matcher, Inc. Apparatuses and methods for classifying a user to a posting

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4839853A (en) 1988-09-15 1989-06-13 Bell Communications Research, Inc. Computer information retrieval using latent semantic structure
US5826261A (en) * 1996-05-10 1998-10-20 Spencer; Graham System and method for querying multiple, distributed databases by selective sharing of local relative significance information for terms related to the query
US6012053A (en) * 1997-06-23 2000-01-04 Lycos, Inc. Computer system with user-controlled relevance ranking of search results
US6163782A (en) * 1997-11-19 2000-12-19 At&T Corp. Efficient and effective distributed information management
US6490575B1 (en) * 1999-12-06 2002-12-03 International Business Machines Corporation Distributed network search engine
US7113943B2 (en) * 2000-12-06 2006-09-26 Content Analyst Company, Llc Method for document comparison and selection
US6978274B1 (en) * 2001-08-31 2005-12-20 Attenex Corporation System and method for dynamically evaluating latent concepts in unstructured documents
US6880002B2 (en) 2001-09-05 2005-04-12 Surgient, Inc. Virtualized logical server cloud providing non-deterministic allocation of logical attributes of logical servers to physical resources
US7137062B2 (en) 2001-12-28 2006-11-14 International Business Machines Corporation System and method for hierarchical segmentation with latent semantic indexing in scale space
GB0200980D0 (en) * 2002-01-15 2002-03-06 Ibm Method and apparatus for classification
US6847966B1 (en) * 2002-04-24 2005-01-25 Engenium Corporation Method and system for optimally searching a document database using a representative semantic space
US7324988B2 (en) * 2003-07-07 2008-01-29 International Business Machines Corporation Method of generating a distributed text index for parallel query processing
US7440964B2 (en) * 2003-08-29 2008-10-21 Vortaloptics, Inc. Method, device and software for querying and presenting search results
US7437353B2 (en) * 2003-12-31 2008-10-14 Google Inc. Systems and methods for unification of search results
US7599914B2 (en) 2004-07-26 2009-10-06 Google Inc. Phrase-based searching in an information retrieval system
US20060047441A1 (en) * 2004-08-31 2006-03-02 Ramin Homayouni Semantic gene organizer
US7433869B2 (en) 2005-07-01 2008-10-07 Ebrary, Inc. Method and apparatus for document clustering and document sketching
US20070150492A1 (en) * 2005-12-27 2007-06-28 Hitachi, Ltd. Method and system for allocating file in clustered file system
US8554758B1 (en) * 2005-12-29 2013-10-08 Amazon Technologies, Inc. Method and apparatus for monitoring and maintaining health in a searchable data service
US7860853B2 (en) 2007-02-14 2010-12-28 Provilla, Inc. Document matching engine using asymmetric signature generation
CN100517330C (zh) * 2007-06-06 2009-07-22 华东师范大学 一种基于语义的本地文档检索方法
US8027977B2 (en) 2007-06-20 2011-09-27 Microsoft Corporation Recommending content using discriminatively trained document similarity
US20100169339A1 (en) * 2008-12-30 2010-07-01 Yahoo! Inc., A Delaware Corporation System, method, or apparatus for updating stored search result values
US8266135B2 (en) * 2009-01-05 2012-09-11 International Business Machines Corporation Indexing for regular expressions in text-centric applications

Also Published As

Publication number Publication date
BR112012006743A2 (pt) 2019-09-24
EP2480987A4 (en) 2013-09-25
US8666994B2 (en) 2014-03-04
EP2480987A1 (en) 2012-08-01
US20120278341A1 (en) 2012-11-01
CA2775368A1 (en) 2011-03-31
AU2010300096B2 (en) 2012-10-04
AU2010300096A1 (en) 2012-04-19
WO2011035389A1 (en) 2011-03-31
CN102597991A (zh) 2012-07-18

Similar Documents

Publication Publication Date Title
NZ599047A (en) Document analysis and association system and method
AR107503A2 (es) Método para predecir la presencia de al menos un rasgo de búsqueda en una planta, método para seleccionar al menos una planta y método para la extracción de un conjunto de datos relacionados
WO2013163644A3 (en) Updating a search index used to facilitate application searches
EP2537106A4 (en) SYSTEM AND METHOD FOR ATTENTION GROUPING AND ANALYTICAL PROCEDURES AND VIEWS RELATING THERETO
WO2006072027A3 (en) System and method for retrieving information from citation-rich documents
CA2677307A1 (en) Searching structured geographical data
WO2010141799A3 (en) Feature engineering and user behavior analysis
WO2013188504A3 (en) Multilingual mixed search method and system
IN2014DN00244A (xx)
EP2427834A4 (en) METHOD AND SYSTEM FOR SEARCH ENGINE INDICATION AND SEARCH ENGINE WITH THE RELATED INDEX
MX2012010272A (es) Sistema y metodo de busqueda optimizada ascendente.
IN2013MU02064A (xx)
MX2012011904A (es) Sistema y método para la identificación de sujetos a partir de fuentes de datos de libre formato.
WO2011011063A3 (en) Method and system for document indexing and data querying
WO2011088521A3 (en) Improved searching using semantic keys
WO2013002940A3 (en) Method and apparatus for creating a search index for a composite document and searching same
WO2010137814A3 (en) Method of providing by-viewpoint patent map and system thereof
WO2008152614A3 (en) Local news seαrch engine
GB2489863A (en) Indexing documents
WO2013181151A3 (en) System and method for automated analysis comparing a wireless device location with another geographic location
WO2006107347A3 (en) System and method for grouping a collection of documents using document series
Fujimura et al. Reference guide for management of adult idiopathic thrombocytopenic purpura (ITP) 2012 version
WO2014081824A3 (en) Search engine results
GB2522369A (en) System, method and interface for providing a search result using segment constraints
Gleasure What Is a ‘Wicked Problem’for IS Research?

Legal Events

Date Code Title Description
ASS Change of ownership

Owner name: SAJARI PTY LTD, AU

Effective date: 20130221

PSEA Patent sealed
LAPS Patent lapsed