JP2012527701A5 - - Google Patents
Download PDFInfo
- Publication number
- JP2012527701A5 JP2012527701A5 JP2012511920A JP2012511920A JP2012527701A5 JP 2012527701 A5 JP2012527701 A5 JP 2012527701A5 JP 2012511920 A JP2012511920 A JP 2012511920A JP 2012511920 A JP2012511920 A JP 2012511920A JP 2012527701 A5 JP2012527701 A5 JP 2012527701A5
- Authority
- JP
- Japan
- Prior art keywords
- result
- translation model
- query
- queries
- items
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 claims 17
- 238000004590 computer program Methods 0.000 claims 2
- 238000000605 extraction Methods 0.000 claims 1
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US12/470,492 | 2009-05-22 | ||
| US12/470,492 US20100299132A1 (en) | 2009-05-22 | 2009-05-22 | Mining phrase pairs from an unstructured resource |
| PCT/US2010/035033 WO2010135204A2 (en) | 2009-05-22 | 2010-05-14 | Mining phrase pairs from an unstructured resource |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2012527701A JP2012527701A (ja) | 2012-11-08 |
| JP2012527701A5 true JP2012527701A5 (enExample) | 2013-06-27 |
| JP5479581B2 JP5479581B2 (ja) | 2014-04-23 |
Family
ID=43125158
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2012511920A Expired - Fee Related JP5479581B2 (ja) | 2009-05-22 | 2010-05-14 | 構造化されていないリソースからの句対のマイニング |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US20100299132A1 (enExample) |
| EP (1) | EP2433230A4 (enExample) |
| JP (1) | JP5479581B2 (enExample) |
| KR (1) | KR101683324B1 (enExample) |
| CN (1) | CN102439596B (enExample) |
| BR (1) | BRPI1011214A2 (enExample) |
| CA (1) | CA2758632C (enExample) |
| WO (1) | WO2010135204A2 (enExample) |
Families Citing this family (47)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20110015921A1 (en) * | 2009-07-17 | 2011-01-20 | Minerva Advisory Services, Llc | System and method for using lingual hierarchy, connotation and weight of authority |
| US8861844B2 (en) | 2010-03-29 | 2014-10-14 | Ebay Inc. | Pre-computing digests for image similarity searching of image-based listings in a network-based publication system |
| US9792638B2 (en) | 2010-03-29 | 2017-10-17 | Ebay Inc. | Using silhouette images to reduce product selection error in an e-commerce environment |
| US8412594B2 (en) | 2010-08-28 | 2013-04-02 | Ebay Inc. | Multilevel silhouettes in an online shopping environment |
| US9064004B2 (en) * | 2011-03-04 | 2015-06-23 | Microsoft Technology Licensing, Llc | Extensible surface for consuming information extraction services |
| CN102789461A (zh) * | 2011-05-19 | 2012-11-21 | 富士通株式会社 | 多语词典构建装置和多语词典构建方法 |
| US8909516B2 (en) * | 2011-10-27 | 2014-12-09 | Microsoft Corporation | Functionality for normalizing linguistic items |
| US8914371B2 (en) | 2011-12-13 | 2014-12-16 | International Business Machines Corporation | Event mining in social networks |
| KR101359718B1 (ko) * | 2012-05-17 | 2014-02-13 | 포항공과대학교 산학협력단 | 대화 관리 시스템 및 방법 |
| CN102779186B (zh) * | 2012-06-29 | 2014-12-24 | 浙江大学 | 一种非结构化数据管理的全过程建模方法 |
| US9183197B2 (en) | 2012-12-14 | 2015-11-10 | Microsoft Technology Licensing, Llc | Language processing resources for automated mobile language translation |
| CN105144200A (zh) * | 2013-04-27 | 2015-12-09 | 数据飞讯公司 | 用于处理非结构化数字的基于内容的检索引擎 |
| US20140350931A1 (en) * | 2013-05-24 | 2014-11-27 | Microsoft Corporation | Language model trained using predicted queries from statistical machine translation |
| WO2015094288A1 (en) * | 2013-12-19 | 2015-06-25 | Intel Corporation | Method and apparatus for communicating between companion devices |
| US9881006B2 (en) * | 2014-02-28 | 2018-01-30 | Paypal, Inc. | Methods for automatic generation of parallel corpora |
| US9740687B2 (en) | 2014-06-11 | 2017-08-22 | Facebook, Inc. | Classifying languages for objects and entities |
| US20160012124A1 (en) * | 2014-07-10 | 2016-01-14 | Jean-David Ruvini | Methods for automatic query translation |
| CN104462229A (zh) * | 2014-11-13 | 2015-03-25 | 苏州大学 | 一种事件分类方法及装置 |
| US9864744B2 (en) * | 2014-12-03 | 2018-01-09 | Facebook, Inc. | Mining multi-lingual data |
| US9830404B2 (en) | 2014-12-30 | 2017-11-28 | Facebook, Inc. | Analyzing language dependency structures |
| US10067936B2 (en) | 2014-12-30 | 2018-09-04 | Facebook, Inc. | Machine translation output reranking |
| US9830386B2 (en) | 2014-12-30 | 2017-11-28 | Facebook, Inc. | Determining trending topics in social media |
| US9477652B2 (en) | 2015-02-13 | 2016-10-25 | Facebook, Inc. | Machine learning dialect identification |
| US20160350289A1 (en) * | 2015-06-01 | 2016-12-01 | Linkedln Corporation | Mining parallel data from user profiles |
| US20170024701A1 (en) * | 2015-07-23 | 2017-01-26 | Linkedin Corporation | Providing recommendations based on job change indications |
| US9734142B2 (en) | 2015-09-22 | 2017-08-15 | Facebook, Inc. | Universal translation |
| US10586168B2 (en) | 2015-10-08 | 2020-03-10 | Facebook, Inc. | Deep translations |
| US9990361B2 (en) * | 2015-10-08 | 2018-06-05 | Facebook, Inc. | Language independent representations |
| US9747281B2 (en) | 2015-12-07 | 2017-08-29 | Linkedin Corporation | Generating multi-language social network user profiles by translation |
| US10133738B2 (en) | 2015-12-14 | 2018-11-20 | Facebook, Inc. | Translation confidence scores |
| US9734143B2 (en) | 2015-12-17 | 2017-08-15 | Facebook, Inc. | Multi-media context language processing |
| US9747283B2 (en) | 2015-12-28 | 2017-08-29 | Facebook, Inc. | Predicting future translations |
| US9805029B2 (en) | 2015-12-28 | 2017-10-31 | Facebook, Inc. | Predicting future translations |
| US10002125B2 (en) | 2015-12-28 | 2018-06-19 | Facebook, Inc. | Language model personalization |
| US10902215B1 (en) | 2016-06-30 | 2021-01-26 | Facebook, Inc. | Social hash for language models |
| US10902221B1 (en) | 2016-06-30 | 2021-01-26 | Facebook, Inc. | Social hash for language models |
| CN106960041A (zh) * | 2017-03-28 | 2017-07-18 | 山西同方知网数字出版技术有限公司 | 一种基于非平衡数据的知识结构化方法 |
| US10380249B2 (en) | 2017-10-02 | 2019-08-13 | Facebook, Inc. | Predicting future trending topics |
| KR102100951B1 (ko) * | 2017-11-16 | 2020-04-14 | 주식회사 마인즈랩 | 기계 독해를 위한 질의응답 데이터 생성 시스템 |
| CN110110078B (zh) * | 2018-01-11 | 2024-04-30 | 北京搜狗科技发展有限公司 | 数据处理方法和装置、用于数据处理的装置 |
| CN110472251B (zh) * | 2018-05-10 | 2023-05-30 | 腾讯科技(深圳)有限公司 | 翻译模型训练的方法、语句翻译的方法、设备及存储介质 |
| CN109033303B (zh) * | 2018-07-17 | 2021-07-02 | 东南大学 | 一种基于约简锚点的大规模知识图谱融合方法 |
| EP3895069A4 (en) * | 2018-12-12 | 2022-07-27 | Microsoft Technology Licensing, LLC | Automatically generating training data sets for object recognition |
| US11664010B2 (en) | 2020-11-03 | 2023-05-30 | Florida Power & Light Company | Natural language domain corpus data set creation based on enhanced root utterances |
| CN113010643B (zh) * | 2021-03-22 | 2023-07-21 | 平安科技(深圳)有限公司 | 佛学领域词汇的处理方法、装置、设备及存储介质 |
| US11656881B2 (en) | 2021-10-21 | 2023-05-23 | Abbyy Development Inc. | Detecting repetitive patterns of user interface actions |
| US20250077800A1 (en) * | 2023-09-06 | 2025-03-06 | 7299362 Canada Inc. (O/A Alexa Translations) | System and method for rule-based language translation |
Family Cites Families (58)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6006221A (en) * | 1995-08-16 | 1999-12-21 | Syracuse University | Multilingual document retrieval system and method using semantic vector matching |
| JP3614618B2 (ja) * | 1996-07-05 | 2005-01-26 | 株式会社日立製作所 | 文献検索支援方法及び装置およびこれを用いた文献検索サービス |
| US6076051A (en) * | 1997-03-07 | 2000-06-13 | Microsoft Corporation | Information retrieval utilizing semantic representation of text |
| US6266642B1 (en) * | 1999-01-29 | 2001-07-24 | Sony Corporation | Method and portable apparatus for performing spoken language translation |
| US6243669B1 (en) * | 1999-01-29 | 2001-06-05 | Sony Corporation | Method and apparatus for providing syntactic analysis and data structure for translation knowledge in example-based language translation |
| US6442524B1 (en) * | 1999-01-29 | 2002-08-27 | Sony Corporation | Analyzing inflectional morphology in a spoken language translation system |
| US6924828B1 (en) * | 1999-04-27 | 2005-08-02 | Surfnotes | Method and apparatus for improved information representation |
| JP2001043236A (ja) * | 1999-07-30 | 2001-02-16 | Matsushita Electric Ind Co Ltd | 類似語抽出方法、文書検索方法及びこれらに用いる装置 |
| US6757646B2 (en) * | 2000-03-22 | 2004-06-29 | Insightful Corporation | Extended functionality for an inverse inference engine based web search |
| US20070027672A1 (en) * | 2000-07-31 | 2007-02-01 | Michel Decary | Computer method and apparatus for extracting data from web pages |
| US7478047B2 (en) * | 2000-11-03 | 2009-01-13 | Zoesis, Inc. | Interactive character system |
| JP2002245070A (ja) * | 2001-02-20 | 2002-08-30 | Hitachi Ltd | データ表示方法及び装置並びにその処理プログラムを記憶した媒体 |
| US7711547B2 (en) * | 2001-03-16 | 2010-05-04 | Meaningful Machines, L.L.C. | Word association method and apparatus |
| US7191115B2 (en) * | 2001-06-20 | 2007-03-13 | Microsoft Corporation | Statistical method and apparatus for learning translation relationships among words |
| WO2003005235A1 (en) * | 2001-07-04 | 2003-01-16 | Cogisum Intermedia Ag | Category based, extensible and interactive system for document retrieval |
| WO2004001623A2 (en) * | 2002-03-26 | 2003-12-31 | University Of Southern California | Constructing a translation lexicon from comparable, non-parallel corpora |
| WO2003105023A2 (en) * | 2002-03-26 | 2003-12-18 | University Of Southern California | Statistical translation using a large monolingual corpus |
| US7031911B2 (en) * | 2002-06-28 | 2006-04-18 | Microsoft Corporation | System and method for automatic detection of collocation mistakes in documents |
| US7194455B2 (en) * | 2002-09-19 | 2007-03-20 | Microsoft Corporation | Method and system for retrieving confirming sentences |
| JP2004252495A (ja) * | 2002-09-19 | 2004-09-09 | Advanced Telecommunication Research Institute International | 統計的機械翻訳装置をトレーニングするためのトレーニングデータを生成する方法および装置、換言装置、ならびに換言装置をトレーニングする方法及びそのためのデータ処理システムおよびコンピュータプログラム |
| US7249012B2 (en) * | 2002-11-20 | 2007-07-24 | Microsoft Corporation | Statistical method and apparatus for learning translation relationships among phrases |
| WO2004049196A2 (en) * | 2002-11-22 | 2004-06-10 | Transclick, Inc. | System and method for speech translation using remote devices |
| JP2004206517A (ja) * | 2002-12-26 | 2004-07-22 | Nifty Corp | ホットキーワード提示方法及びホットサイト提示方法 |
| CN1290036C (zh) * | 2002-12-30 | 2006-12-13 | 国际商业机器公司 | 根据机器可读词典建立概念知识的计算机系统及方法 |
| US7346487B2 (en) * | 2003-07-23 | 2008-03-18 | Microsoft Corporation | Method and apparatus for identifying translations |
| US7412385B2 (en) * | 2003-11-12 | 2008-08-12 | Microsoft Corporation | System for identifying paraphrases using machine translation |
| US7584092B2 (en) * | 2004-11-15 | 2009-09-01 | Microsoft Corporation | Unsupervised learning of paraphrase/translation alternations and selective application thereof |
| US7698125B2 (en) * | 2004-03-15 | 2010-04-13 | Language Weaver, Inc. | Training tree transducers for probabilistic operations |
| US8296127B2 (en) * | 2004-03-23 | 2012-10-23 | University Of Southern California | Discovery of parallel text portions in comparable collections of corpora and training using comparable texts |
| US20050216253A1 (en) * | 2004-03-25 | 2005-09-29 | Microsoft Corporation | System and method for reverse transliteration using statistical alignment |
| US7593843B2 (en) * | 2004-03-30 | 2009-09-22 | Microsoft Corporation | Statistical language model for logical form using transfer mappings |
| US7620539B2 (en) * | 2004-07-12 | 2009-11-17 | Xerox Corporation | Methods and apparatuses for identifying bilingual lexicons in comparable corpora using geometric processing |
| US7577562B2 (en) * | 2004-11-04 | 2009-08-18 | Microsoft Corporation | Extracting treelet translation pairs |
| US7546235B2 (en) * | 2004-11-15 | 2009-06-09 | Microsoft Corporation | Unsupervised learning of paraphrase/translation alternations and selective application thereof |
| US7552046B2 (en) * | 2004-11-15 | 2009-06-23 | Microsoft Corporation | Unsupervised learning of paraphrase/translation alternations and selective application thereof |
| US20060224579A1 (en) * | 2005-03-31 | 2006-10-05 | Microsoft Corporation | Data mining techniques for improving search engine relevance |
| US7813918B2 (en) * | 2005-08-03 | 2010-10-12 | Language Weaver, Inc. | Identifying documents which form translated pairs, within a document collection |
| US20070043553A1 (en) * | 2005-08-16 | 2007-02-22 | Microsoft Corporation | Machine translation models incorporating filtered training data |
| US8312021B2 (en) * | 2005-09-16 | 2012-11-13 | Palo Alto Research Center Incorporated | Generalized latent semantic analysis |
| US7937265B1 (en) * | 2005-09-27 | 2011-05-03 | Google Inc. | Paraphrase acquisition |
| US7908132B2 (en) * | 2005-09-29 | 2011-03-15 | Microsoft Corporation | Writing assistance using machine translation techniques |
| US8943080B2 (en) * | 2006-04-07 | 2015-01-27 | University Of Southern California | Systems and methods for identifying parallel documents and sentence fragments in multilingual document collections |
| US7949514B2 (en) * | 2007-04-20 | 2011-05-24 | Xerox Corporation | Method for building parallel corpora |
| US9020804B2 (en) * | 2006-05-10 | 2015-04-28 | Xerox Corporation | Method for aligning sentences at the word level enforcing selective contiguity constraints |
| US10460327B2 (en) * | 2006-07-28 | 2019-10-29 | Palo Alto Research Center Incorporated | Systems and methods for persistent context-aware guides |
| US20080040339A1 (en) * | 2006-08-07 | 2008-02-14 | Microsoft Corporation | Learning question paraphrases from log data |
| GB2444084A (en) * | 2006-11-23 | 2008-05-28 | Sharp Kk | Selecting examples in an example based machine translation system |
| WO2008078670A1 (ja) * | 2006-12-22 | 2008-07-03 | Nec Corporation | 文言い換え方法、プログラムおよびシステム |
| US8244521B2 (en) * | 2007-01-11 | 2012-08-14 | Microsoft Corporation | Paraphrasing the web by search-based data collection |
| US8332207B2 (en) * | 2007-03-26 | 2012-12-11 | Google Inc. | Large language models in machine translation |
| US9002869B2 (en) * | 2007-06-22 | 2015-04-07 | Google Inc. | Machine translation for query expansion |
| US7983903B2 (en) * | 2007-09-07 | 2011-07-19 | Microsoft Corporation | Mining bilingual dictionaries from monolingual web pages |
| US20090119090A1 (en) * | 2007-11-01 | 2009-05-07 | Microsoft Corporation | Principled Approach to Paraphrasing |
| US8209164B2 (en) * | 2007-11-21 | 2012-06-26 | University Of Washington | Use of lexical translations for facilitating searches |
| US20090182547A1 (en) * | 2008-01-16 | 2009-07-16 | Microsoft Corporation | Adaptive Web Mining of Bilingual Lexicon for Query Translation |
| US8326630B2 (en) * | 2008-08-18 | 2012-12-04 | Microsoft Corporation | Context based online advertising |
| US8306806B2 (en) * | 2008-12-02 | 2012-11-06 | Microsoft Corporation | Adaptive web mining of bilingual lexicon |
| US8352321B2 (en) * | 2008-12-12 | 2013-01-08 | Microsoft Corporation | In-text embedded advertising |
-
2009
- 2009-05-22 US US12/470,492 patent/US20100299132A1/en not_active Abandoned
-
2010
- 2010-05-14 KR KR1020117027693A patent/KR101683324B1/ko not_active Expired - Fee Related
- 2010-05-14 JP JP2012511920A patent/JP5479581B2/ja not_active Expired - Fee Related
- 2010-05-14 EP EP10778179.1A patent/EP2433230A4/en not_active Withdrawn
- 2010-05-14 CA CA2758632A patent/CA2758632C/en not_active Expired - Fee Related
- 2010-05-14 CN CN201080023190.9A patent/CN102439596B/zh not_active Expired - Fee Related
- 2010-05-14 BR BRPI1011214A patent/BRPI1011214A2/pt not_active Application Discontinuation
- 2010-05-14 WO PCT/US2010/035033 patent/WO2010135204A2/en not_active Ceased
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2012527701A5 (enExample) | ||
| JP7302022B2 (ja) | テキスト分類方法、装置、コンピュータ読み取り可能な記憶媒体及びテキスト分類プログラム。 | |
| CN110888991B (zh) | 一种弱标注环境下的分段式语义标注方法 | |
| CN110083696B (zh) | 基于元结构技术的全局引文推荐方法、推荐系统 | |
| US20160180247A1 (en) | Latency-Efficient Multi-Stage Tagging Mechanism | |
| Usman et al. | Urdu text classification using majority voting | |
| US20130036076A1 (en) | Method for keyword extraction | |
| CN104050256A (zh) | 基于主动学习的问答方法及采用该方法的问答系统 | |
| WO2021150313A1 (en) | Contrastive learning for question answering (qa) | |
| CN104484380A (zh) | 个性化搜索方法及装置 | |
| CN113722512A (zh) | 基于语言模型的文本检索方法、装置、设备及存储介质 | |
| CN105095271B (zh) | 微博检索方法和微博检索装置 | |
| CN103853792B (zh) | 一种图片语义自动标注方法与系统 | |
| KR101333485B1 (ko) | 온라인 사전을 이용한 개체명 사전 구축 방법 및 이를 실행하는 장치 | |
| Han et al. | Vertical domain text classification: towards understanding IT tickets using deep Neural Networks | |
| CN108268883B (zh) | 基于开放数据的移动端信息模板自构建系统 | |
| Polignano et al. | Identification Of Bot Accounts In Twitter Using 2D CNNs On User-generated Contents. | |
| CN105677684A (zh) | 一种基于外部数据源对用户生成内容进行语义标注的方法 | |
| CN111191413A (zh) | 一种基于图排序模型的事件核心内容自动标记方法、装置及系统 | |
| CN104077419B (zh) | 结合语义与视觉信息的长查询图像检索重排序方法 | |
| Wu et al. | Mining query subtopics from questions in community question answering | |
| JP5812534B2 (ja) | 質問応答装置、方法、及びプログラム | |
| CN109471980A (zh) | 一种基于文本匹配的搜索推荐相关度计算方法 | |
| CN103177126A (zh) | 用于搜索引擎的色情用户查询识别方法及设备 | |
| Vukotić et al. | IRISA at DeFT 2015: supervised and unsupervised methods in sentiment analysis |