CN102782682A - 语义对象表征和搜索 - Google Patents
语义对象表征和搜索 Download PDFInfo
- Publication number
- CN102782682A CN102782682A CN2011800117556A CN201180011755A CN102782682A CN 102782682 A CN102782682 A CN 102782682A CN 2011800117556 A CN2011800117556 A CN 2011800117556A CN 201180011755 A CN201180011755 A CN 201180011755A CN 102782682 A CN102782682 A CN 102782682A
- Authority
- CN
- China
- Prior art keywords
- expression
- semantic object
- scale
- hash code
- view
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012512 characterization method Methods 0.000 title abstract description 9
- 230000014509 gene expression Effects 0.000 claims description 187
- 238000006243 chemical reaction Methods 0.000 claims description 97
- 239000013598 vector Substances 0.000 claims description 85
- 238000000034 method Methods 0.000 claims description 69
- 230000008569 process Effects 0.000 claims description 60
- 230000009471 action Effects 0.000 claims description 31
- 230000008878 coupling Effects 0.000 claims description 16
- 238000010168 coupling process Methods 0.000 claims description 16
- 238000005859 coupling reaction Methods 0.000 claims description 16
- 238000000605 extraction Methods 0.000 claims description 14
- 238000005516 engineering process Methods 0.000 claims description 7
- 230000009466 transformation Effects 0.000 claims description 5
- 238000010219 correlation analysis Methods 0.000 claims description 4
- 230000006870 function Effects 0.000 description 35
- 238000013515 script Methods 0.000 description 16
- 238000010586 diagram Methods 0.000 description 11
- 239000000284 extract Substances 0.000 description 5
- 238000005457 optimization Methods 0.000 description 5
- 230000008901 benefit Effects 0.000 description 4
- 230000003252 repetitive effect Effects 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000002349 favourable effect Effects 0.000 description 3
- 238000012549 training Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 238000000513 principal component analysis Methods 0.000 description 2
- 102100029469 WD repeat and HMG-box DNA-binding protein 1 Human genes 0.000 description 1
- 101710097421 WD repeat and HMG-box DNA-binding protein 1 Proteins 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3347—Query execution using vector based model
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (15)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/715,174 US8543598B2 (en) | 2010-03-01 | 2010-03-01 | Semantic object characterization and search |
US12/715,174 | 2010-03-01 | ||
PCT/US2011/026358 WO2011109251A2 (en) | 2010-03-01 | 2011-02-25 | Semantic object characterization and search |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102782682A true CN102782682A (zh) | 2012-11-14 |
CN102782682B CN102782682B (zh) | 2015-07-29 |
Family
ID=44505854
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201180011755.6A Active CN102782682B (zh) | 2010-03-01 | 2011-02-25 | 语义对象表征和搜索 |
Country Status (7)
Country | Link |
---|---|
US (1) | US8543598B2 (zh) |
EP (1) | EP2542988B1 (zh) |
JP (1) | JP5661813B2 (zh) |
CN (1) | CN102782682B (zh) |
CA (1) | CA2788670C (zh) |
HK (1) | HK1178277A1 (zh) |
WO (1) | WO2011109251A2 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103092828A (zh) * | 2013-02-06 | 2013-05-08 | 杭州电子科技大学 | 基于语义分析和语义关系网络的文本相似度度量方法 |
CN105229639B (zh) * | 2013-03-13 | 2016-09-21 | 脸谱公司 | 短词散列 |
CN113544659A (zh) * | 2019-03-06 | 2021-10-22 | 三星电子株式会社 | 基于散列的有效用户建模 |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8463053B1 (en) | 2008-08-08 | 2013-06-11 | The Research Foundation Of State University Of New York | Enhanced max margin learning on multimodal data mining in a multimedia database |
US8463797B2 (en) * | 2010-07-20 | 2013-06-11 | Barracuda Networks Inc. | Method for measuring similarity of diverse binary objects comprising bit patterns |
DE102011101154A1 (de) * | 2011-05-11 | 2012-11-15 | Abb Technology Ag | Verfahren und Einrichtung zur einheitlichen Benennung von gleichen Parametern unterschiedlicher Feldgeräte eines Automatisierungssystems |
US9489636B2 (en) * | 2012-04-18 | 2016-11-08 | Tagasauris, Inc. | Task-agnostic integration of human and machine intelligence |
WO2014050952A1 (ja) * | 2012-09-27 | 2014-04-03 | 日本電気株式会社 | バイナリデータ変換方法と装置及びプログラム |
JP6487449B2 (ja) | 2013-09-13 | 2019-03-20 | フィッシュバーグ、キース | アメニティ、特別サービスおよびフード/飲み物の検索並びに購入予約システム |
US20150278774A1 (en) * | 2014-03-31 | 2015-10-01 | Bank Of America Corporation | Techniques for hash indexing |
US9805099B2 (en) | 2014-10-30 | 2017-10-31 | The Johns Hopkins University | Apparatus and method for efficient identification of code similarity |
JP6712796B2 (ja) * | 2015-11-10 | 2020-06-24 | 国立大学法人 東京大学 | 画像を媒介した異言語文書間の学習法及び装置、言語横断文書検索方法及び装置 |
US11169964B2 (en) * | 2015-12-11 | 2021-11-09 | Hewlett Packard Enterprise Development Lp | Hash suppression |
CN105843960B (zh) * | 2016-04-18 | 2019-12-06 | 上海泥娃通信科技有限公司 | 基于语义树的索引方法和系统 |
US10642881B2 (en) * | 2016-06-30 | 2020-05-05 | Intel Corporation | System architecture for universal emotive autography |
CN106502996A (zh) * | 2016-12-13 | 2017-03-15 | 深圳爱拼信息科技有限公司 | 一种基于语义匹配的裁判文书检索方法和服务器 |
US10817774B2 (en) * | 2016-12-30 | 2020-10-27 | Facebook, Inc. | Systems and methods for providing content |
IL258689A (en) | 2018-04-12 | 2018-05-31 | Browarnik Abel | A system and method for computerized semantic indexing and searching |
US11175930B1 (en) * | 2018-04-27 | 2021-11-16 | Intuit, Inc. | Deducing a requirement to present optional data entry fields based on examining past user input records |
US11403327B2 (en) * | 2019-02-20 | 2022-08-02 | International Business Machines Corporation | Mixed initiative feature engineering |
US11238103B2 (en) | 2019-09-13 | 2022-02-01 | Ebay Inc. | Binary coding for improved semantic search |
CN110674719B (zh) * | 2019-09-18 | 2022-07-26 | 北京市商汤科技开发有限公司 | 目标对象匹配方法及装置、电子设备和存储介质 |
US11163805B2 (en) | 2019-11-25 | 2021-11-02 | The Nielsen Company (Us), Llc | Methods, systems, articles of manufacture, and apparatus to map client specifications with standardized characteristics |
US12130864B2 (en) * | 2020-08-07 | 2024-10-29 | International Business Machines Corporation | Discrete representation learning |
US20230367974A1 (en) * | 2022-05-16 | 2023-11-16 | Microsoft Technology Licensing, Llc | Cross-orthography fuzzy string comparisons |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050216253A1 (en) * | 2004-03-25 | 2005-09-29 | Microsoft Corporation | System and method for reverse transliteration using statistical alignment |
US20080147215A1 (en) * | 2006-12-13 | 2008-06-19 | Samsung Electronics Co., Ltd. | Music recommendation method with respect to message service |
Family Cites Families (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997008604A2 (en) * | 1995-08-16 | 1997-03-06 | Syracuse University | Multilingual document retrieval system and method using semantic vector matching |
US6002997A (en) * | 1996-06-21 | 1999-12-14 | Tou; Julius T. | Method for translating cultural subtleties in machine translation |
CA2310321C (en) * | 1997-11-17 | 2004-11-16 | Telcordia Technologies, Inc. | Method and system for determining approximate hamming distance and approximate nearest neighbors in an electronic storage device |
GB2337611A (en) * | 1998-05-20 | 1999-11-24 | Sharp Kk | Multilingual document retrieval system |
US6311183B1 (en) * | 1998-08-07 | 2001-10-30 | The United States Of America As Represented By The Director Of National Security Agency | Method for finding large numbers of keywords in continuous text streams |
US6154747A (en) * | 1998-08-26 | 2000-11-28 | Hunt; Rolf G. | Hash table implementation of an object repository |
US6466901B1 (en) * | 1998-11-30 | 2002-10-15 | Apple Computer, Inc. | Multi-language document search and retrieval system |
JP3685660B2 (ja) * | 1999-09-13 | 2005-08-24 | 沖電気工業株式会社 | 対訳情報収集装置 |
US6701014B1 (en) * | 2000-06-14 | 2004-03-02 | International Business Machines Corporation | Method and apparatus for matching slides in video |
US6999916B2 (en) * | 2001-04-20 | 2006-02-14 | Wordsniffer, Inc. | Method and apparatus for integrated, user-directed web site text translation |
WO2006087854A1 (ja) * | 2004-11-25 | 2006-08-24 | Sharp Kabushiki Kaisha | 情報分類装置、情報分類方法、情報分類プログラム、情報分類システム |
US9418139B2 (en) * | 2005-01-04 | 2016-08-16 | Thomson Reuters Global Resources | Systems, methods, software, and interfaces for multilingual information retrieval |
US8041557B2 (en) * | 2005-02-24 | 2011-10-18 | Fuji Xerox Co., Ltd. | Word translation device, translation method, and computer readable medium |
CN100474301C (zh) * | 2005-09-08 | 2009-04-01 | 富士通株式会社 | 基于数据挖掘获取词或词组单元译文信息的系统和方法 |
US20070185868A1 (en) * | 2006-02-08 | 2007-08-09 | Roth Mary A | Method and apparatus for semantic search of schema repositories |
US8010534B2 (en) * | 2006-08-31 | 2011-08-30 | Orcatec Llc | Identifying related objects using quantum clustering |
KR20090024460A (ko) | 2007-09-04 | 2009-03-09 | 엘지전자 주식회사 | 다국어 호환이 가능한 정보 검색 장치 및 방법 |
EP2570945A1 (en) * | 2007-09-21 | 2013-03-20 | Google Inc. | Cross-language search |
US7917488B2 (en) * | 2008-03-03 | 2011-03-29 | Microsoft Corporation | Cross-lingual search re-ranking |
BRPI0910412B1 (pt) | 2008-03-26 | 2019-08-06 | Zonit Structured Solutions, Llc | Equipamento e método de distribuição de energia |
KR101116581B1 (ko) | 2008-06-30 | 2012-03-15 | 주식회사 한글과컴퓨터 | 다국어 독음 검색 장치 |
-
2010
- 2010-03-01 US US12/715,174 patent/US8543598B2/en active Active
-
2011
- 2011-02-25 CN CN201180011755.6A patent/CN102782682B/zh active Active
- 2011-02-25 JP JP2012556120A patent/JP5661813B2/ja active Active
- 2011-02-25 EP EP11751118.8A patent/EP2542988B1/en active Active
- 2011-02-25 CA CA2788670A patent/CA2788670C/en active Active
- 2011-02-25 WO PCT/US2011/026358 patent/WO2011109251A2/en active Application Filing
-
2013
- 2013-04-26 HK HK13105103.8A patent/HK1178277A1/zh not_active IP Right Cessation
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050216253A1 (en) * | 2004-03-25 | 2005-09-29 | Microsoft Corporation | System and method for reverse transliteration using statistical alignment |
US20080147215A1 (en) * | 2006-12-13 | 2008-06-19 | Samsung Electronics Co., Ltd. | Music recommendation method with respect to message service |
Non-Patent Citations (2)
Title |
---|
HAIZHOU LI ETC.: "Semantic Transliteration of Personal Names", 《PROCEEDINGS OF THE 45TH ANNUAL MEETING OF THE ASSOCIATION OF COMPUTATIONAL LINGUISTICE》 * |
JONATHAN D. COHEN: "HARDWARE-ASSISTED ALGORITHM FOR FULL-TEXT LARGE-DICTIONARY STRING MATCHING USING N-GRAM HASHING", 《INFORMATION PROCESSING & MANAGEMENT》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103092828A (zh) * | 2013-02-06 | 2013-05-08 | 杭州电子科技大学 | 基于语义分析和语义关系网络的文本相似度度量方法 |
CN103092828B (zh) * | 2013-02-06 | 2015-08-12 | 杭州电子科技大学 | 基于语义分析和语义关系网络的文本相似度度量方法 |
CN105229639B (zh) * | 2013-03-13 | 2016-09-21 | 脸谱公司 | 短词散列 |
US10318652B2 (en) | 2013-03-13 | 2019-06-11 | Facebook, Inc. | Short-term hashes |
CN113544659A (zh) * | 2019-03-06 | 2021-10-22 | 三星电子株式会社 | 基于散列的有效用户建模 |
Also Published As
Publication number | Publication date |
---|---|
US8543598B2 (en) | 2013-09-24 |
EP2542988A2 (en) | 2013-01-09 |
US20110213784A1 (en) | 2011-09-01 |
JP5661813B2 (ja) | 2015-01-28 |
WO2011109251A2 (en) | 2011-09-09 |
EP2542988A4 (en) | 2017-01-04 |
WO2011109251A3 (en) | 2011-12-29 |
CN102782682B (zh) | 2015-07-29 |
CA2788670C (en) | 2017-02-14 |
JP2013521574A (ja) | 2013-06-10 |
CA2788670A1 (en) | 2011-09-09 |
HK1178277A1 (zh) | 2013-09-06 |
EP2542988B1 (en) | 2018-06-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102782682A (zh) | 语义对象表征和搜索 | |
US10289952B2 (en) | Semantic frame identification with distributed word representations | |
CN101454750B (zh) | 命名实体的消歧 | |
US8332205B2 (en) | Mining transliterations for out-of-vocabulary query terms | |
US20160321321A1 (en) | Deep structured semantic model produced using click-through data | |
Sarawagi et al. | Open-domain quantity queries on web tables: annotation, response, and consensus models | |
US20060282455A1 (en) | System and method for ranking web content | |
CN102663129A (zh) | 医疗领域深度问答方法及医学检索系统 | |
CN105988990A (zh) | 用于汉语中的零指代消解的装置和方法以及模型训练方法 | |
CN102349072A (zh) | 识别查询方面 | |
CN101449271A (zh) | 通过搜索进行注释 | |
CN104484380A (zh) | 个性化搜索方法及装置 | |
Wang et al. | DM_NLP at semeval-2018 task 12: A pipeline system for toponym resolution | |
CN111611452A (zh) | 搜索文本的歧义识别方法、系统、设备及存储介质 | |
CN112015907A (zh) | 一种学科知识图谱快速构建方法、装置及存储介质 | |
Burns et al. | Profiling of intertextuality in Latin literature using word embeddings | |
Chua et al. | Eff2Match results for OAEI 2010 | |
Lin et al. | Automatic tagging web services using machine learning techniques | |
Kumar et al. | Constructing knowledge graph from unstructured text | |
JP5812534B2 (ja) | 質問応答装置、方法、及びプログラム | |
Gupta et al. | Text analysis and information retrieval of text data | |
Jiomekong et al. | Towards an Approach Based on Knowledge Graph Refinement for Tabular Data to Knowledge Graph Matching. | |
CN116821292A (zh) | 一种知识库问答中基于抽象语义表示的实体和关系链接方法 | |
Lee et al. | Trustsql: A reliability benchmark for text-to-sql models with diverse unanswerable questions | |
CN111460808A (zh) | 同义文本识别及内容推荐方法、装置及电子设备 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1178277 Country of ref document: HK |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: MICROSOFT TECHNOLOGY LICENSING LLC Free format text: FORMER OWNER: MICROSOFT CORP. Effective date: 20150728 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20150728 Address after: Washington State Patentee after: Micro soft technique license Co., Ltd Address before: Washington State Patentee before: Microsoft Corp. |
|
REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1178277 Country of ref document: HK |