CN103384883A - 利用Top-K处理使语义丰富 - Google Patents
利用Top-K处理使语义丰富 Download PDFInfo
- Publication number
- CN103384883A CN103384883A CN2011800380128A CN201180038012A CN103384883A CN 103384883 A CN103384883 A CN 103384883A CN 2011800380128 A CN2011800380128 A CN 2011800380128A CN 201180038012 A CN201180038012 A CN 201180038012A CN 103384883 A CN103384883 A CN 103384883A
- Authority
- CN
- China
- Prior art keywords
- concept
- key word
- content
- wikipedia
- key
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/43—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2465—Query processing support for facilitating data mining operations in structured databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/44—Browsing; Visualisation therefor
- G06F16/444—Spatial browsing, e.g. 2D maps, 3D or virtual spaces
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/48—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/48—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/487—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using geographical or spatial information, e.g. location
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/93—Document management systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/253—Grammatical analysis; Style critique
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/211—Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
Abstract
Description
Claims (14)
Applications Claiming Priority (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US35125210P | 2010-06-03 | 2010-06-03 | |
US61/351,252 | 2010-06-03 | ||
US39778010P | 2010-06-17 | 2010-06-17 | |
US61/397,780 | 2010-06-17 | ||
US45677410P | 2010-11-13 | 2010-11-13 | |
US61/456,774 | 2010-11-13 | ||
PCT/US2011/038991 WO2011153392A2 (en) | 2010-06-03 | 2011-06-03 | Semantic enrichment by exploiting top-k processing |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103384883A true CN103384883A (zh) | 2013-11-06 |
CN103384883B CN103384883B (zh) | 2016-11-09 |
Family
ID=45067306
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201180038012.8A Expired - Fee Related CN103384883B (zh) | 2010-06-03 | 2011-06-03 | 利用Top-K处理使语义丰富 |
Country Status (6)
Country | Link |
---|---|
US (1) | US20130268261A1 (zh) |
EP (1) | EP2691845A4 (zh) |
JP (1) | JP5894149B2 (zh) |
KR (1) | KR101811468B1 (zh) |
CN (1) | CN103384883B (zh) |
WO (1) | WO2011153392A2 (zh) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8903712B1 (en) * | 2011-09-27 | 2014-12-02 | Nuance Communications, Inc. | Call steering data tagging interface with automatic semantic clustering |
CN102609449B (zh) * | 2012-01-06 | 2014-05-07 | 华中科技大学 | 一种基于维基百科构建概念型知识地图的方法 |
JP5936698B2 (ja) * | 2012-08-27 | 2016-06-22 | 株式会社日立製作所 | 単語意味関係抽出装置 |
CN103631823B (zh) | 2012-08-28 | 2017-01-18 | 腾讯科技(深圳)有限公司 | 一种媒体内容推荐方法及设备 |
US20140152891A1 (en) * | 2012-12-05 | 2014-06-05 | Silicon Image, Inc. | Method and Apparatus for Reducing Digital Video Image Data |
KR101616031B1 (ko) * | 2014-07-17 | 2016-04-28 | 동아대학교 산학협력단 | 위키피디아의 언어자원과 병렬 코퍼스를 이용한 교차언어 검색기의 질의어 번역 시스템 및 방법 |
WO2016170561A1 (en) * | 2015-04-24 | 2016-10-27 | Nec Corporation | An information processing system and an information processing method for semantic enrichment of text |
US10423891B2 (en) * | 2015-10-19 | 2019-09-24 | International Business Machines Corporation | System, method, and recording medium for vector representation of words in a language |
CN105279264B (zh) * | 2015-10-26 | 2018-07-03 | 深圳市智搜信息技术有限公司 | 一种文档的语义相关度计算方法 |
KR102036314B1 (ko) * | 2017-12-29 | 2019-10-25 | (주)터보소프트 | 분산 처리 기반 공간 웹 객체 검색 시스템 및 이를 이용한 분산 처리 기반 공간 웹 객체 검색 방법 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030217335A1 (en) * | 2002-05-17 | 2003-11-20 | Verity, Inc. | System and method for automatically discovering a hierarchy of concepts from a corpus of documents |
US20080109212A1 (en) * | 2006-11-07 | 2008-05-08 | Cycorp, Inc. | Semantics-based method and apparatus for document analysis |
CN101251841A (zh) * | 2007-05-17 | 2008-08-27 | 华东师范大学 | 基于语义的Web文档的特征矩阵的建立和检索方法 |
CN101408894A (zh) * | 2007-10-12 | 2009-04-15 | 莱克西私人有限公司 | 使用语义关键词改进搜索相关性 |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6978274B1 (en) * | 2001-08-31 | 2005-12-20 | Attenex Corporation | System and method for dynamically evaluating latent concepts in unstructured documents |
US6847966B1 (en) * | 2002-04-24 | 2005-01-25 | Engenium Corporation | Method and system for optimally searching a document database using a representative semantic space |
US7610313B2 (en) * | 2003-07-25 | 2009-10-27 | Attenex Corporation | System and method for performing efficient document scoring and clustering |
US8612208B2 (en) * | 2004-04-07 | 2013-12-17 | Oracle Otc Subsidiary Llc | Ontology for use with a system, method, and computer readable medium for retrieving information and response to a query |
US8140559B2 (en) * | 2005-06-27 | 2012-03-20 | Make Sence, Inc. | Knowledge correlation search engine |
US8898134B2 (en) * | 2005-06-27 | 2014-11-25 | Make Sence, Inc. | Method for ranking resources using node pool |
US20070106499A1 (en) * | 2005-08-09 | 2007-05-10 | Kathleen Dahlgren | Natural language search system |
US20080086490A1 (en) * | 2006-10-04 | 2008-04-10 | Sap Ag | Discovery of services matching a service request |
WO2009155281A1 (en) * | 2008-06-17 | 2009-12-23 | The Trustees Of Columbia University In The City Of New York | System and method for dynamically and interactively searching media data |
WO2010048172A1 (en) * | 2008-10-20 | 2010-04-29 | Cascaad Srl | Social graph based recommender |
US8751218B2 (en) * | 2010-02-09 | 2014-06-10 | Siemens Aktiengesellschaft | Indexing content at semantic level |
US8924391B2 (en) * | 2010-09-28 | 2014-12-30 | Microsoft Corporation | Text classification using concept kernel |
-
2011
- 2011-06-03 KR KR1020127034385A patent/KR101811468B1/ko active IP Right Grant
- 2011-06-03 EP EP11790440.9A patent/EP2691845A4/en not_active Withdrawn
- 2011-06-03 CN CN201180038012.8A patent/CN103384883B/zh not_active Expired - Fee Related
- 2011-06-03 US US13/701,347 patent/US20130268261A1/en not_active Abandoned
- 2011-06-03 WO PCT/US2011/038991 patent/WO2011153392A2/en active Application Filing
- 2011-06-03 JP JP2013513358A patent/JP5894149B2/ja not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030217335A1 (en) * | 2002-05-17 | 2003-11-20 | Verity, Inc. | System and method for automatically discovering a hierarchy of concepts from a corpus of documents |
US20080109212A1 (en) * | 2006-11-07 | 2008-05-08 | Cycorp, Inc. | Semantics-based method and apparatus for document analysis |
CN101251841A (zh) * | 2007-05-17 | 2008-08-27 | 华东师范大学 | 基于语义的Web文档的特征矩阵的建立和检索方法 |
CN101408894A (zh) * | 2007-10-12 | 2009-04-15 | 莱克西私人有限公司 | 使用语义关键词改进搜索相关性 |
Non-Patent Citations (4)
Title |
---|
BENJAMIN ARAI ET AL: "Anytime Measures for Top-k Algorithms", 《THE 33TH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES》 * |
EVGENIY GABRILOVICH ET AL: "Wikipedia-based semantic interpretation for natural language processing", 《JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH》 * |
IHAB F.ILYAS ET AL: "A survey of top-k query processing techniques in relational database systems", 《ACM COMPUTING SURVEYS》 * |
RAJASEKAR KRISHNAMURTHY ET AL: "Using structured queries for keyword information retrieval", 《IBM TECHNICAL REPORT》 * |
Also Published As
Publication number | Publication date |
---|---|
JP5894149B2 (ja) | 2016-03-23 |
WO2011153392A3 (en) | 2013-12-27 |
EP2691845A4 (en) | 2018-01-10 |
EP2691845A2 (en) | 2014-02-05 |
KR101811468B1 (ko) | 2017-12-21 |
CN103384883B (zh) | 2016-11-09 |
JP2014500528A (ja) | 2014-01-09 |
KR20130120381A (ko) | 2013-11-04 |
US20130268261A1 (en) | 2013-10-10 |
WO2011153392A2 (en) | 2011-12-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103384883A (zh) | 利用Top-K处理使语义丰富 | |
US8145648B2 (en) | Semantic metadata creation for videos | |
CN101267518B (zh) | 从内容元数据提取相关信息的方法和装置 | |
CN102265276B (zh) | 基于上下文的推荐系统 | |
US20170366828A1 (en) | Processing and delivery of segmented video | |
JP4328757B2 (ja) | 番組選択装置及び番組選択装置の制御方法 | |
JP2021535458A (ja) | 機械学習抽出物とセマンティックグラフとを使用して構造化データを作成し、検索、推奨および発見を促進するための方法およびシステム | |
CN101889281B (zh) | 内容检索装置及内容检索方法 | |
CN110430476B (zh) | 直播间搜索方法、系统、计算机设备和存储介质 | |
US8478759B2 (en) | Information presentation apparatus and mobile terminal | |
US20120317136A1 (en) | Systems and methods for domain-specific tokenization | |
US20130291019A1 (en) | Self-learning methods, entity relations, remote control, and other features for real-time processing, storage, indexing, and delivery of segmented video | |
CN102999498A (zh) | 多媒体节目的检索方法及装置 | |
CN103052954A (zh) | 推荐系统的基于简档内容检索 | |
CN112507163B (zh) | 时长预测模型训练方法、推荐方法、装置、设备及介质 | |
CN111368141B (zh) | 视频标签的扩展方法、装置、计算机设备和存储介质 | |
KR20130083829A (ko) | 디스플레이된 텔레비전 컨텐츠에 대한 자동 이미지 디스커버리 및 추천 | |
WO2012079254A1 (zh) | 节目推荐装置和节目推荐方法 | |
CN102084645A (zh) | 关联场景赋予装置以及关联场景赋予方法 | |
JP2002108892A (ja) | データ管理システム、データ管理方法、及び、記録媒体 | |
CN109600646B (zh) | 语音定位的方法及装置、智能电视、存储介质 | |
KR20110050823A (ko) | 지식노드 연결구조를 생성하기 위한 검색 데이터베이스 구축 장치 및 방법 | |
CN103559269B (zh) | 一种面向移动新闻订阅的知识推荐方法 | |
Hölbling et al. | Content-based tag generation to enable a tag-based collaborative tv-recommendation system. | |
Jadhav et al. | Twitris: socially influenced browsing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20190212 Address after: Paris France Patentee after: International Digital Madison Patent Holding Co. Address before: I Si Eli Murli Nor, France Patentee before: THOMSON LICENSING Effective date of registration: 20190212 Address after: I Si Eli Murli Nor, France Patentee after: THOMSON LICENSING Address before: I Si Eli Murli Nor, France Patentee before: THOMSON LICENSING |
|
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20161109 Termination date: 20190603 |