CN100361125C - 基于加权编辑距离的自动例句检索的系统和方法 - Google Patents

基于加权编辑距离的自动例句检索的系统和方法 Download PDF

Info

Publication number
CN100361125C
CN100361125C CNB031457274A CN03145727A CN100361125C CN 100361125 C CN100361125 C CN 100361125C CN B031457274 A CNB031457274 A CN B031457274A CN 03145727 A CN03145727 A CN 03145727A CN 100361125 C CN100361125 C CN 100361125C
Authority
CN
China
Prior art keywords
sentence
candidate
example sentence
term
input query
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB031457274A
Other languages
English (en)
Chinese (zh)
Other versions
CN1471030A (zh
Inventor
周明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of CN1471030A publication Critical patent/CN1471030A/zh
Application granted granted Critical
Publication of CN100361125C publication Critical patent/CN100361125C/zh
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3346Query execution using probabilistic model
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/42Data-driven translation
    • G06F40/45Example-based machine translation; Alignment

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • Probability & Statistics with Applications (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
CNB031457274A 2002-06-28 2003-06-30 基于加权编辑距离的自动例句检索的系统和方法 Expired - Fee Related CN100361125C (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/186,174 2002-06-28
US10/186,174 US20040002849A1 (en) 2002-06-28 2002-06-28 System and method for automatic retrieval of example sentences based upon weighted editing distance

Publications (2)

Publication Number Publication Date
CN1471030A CN1471030A (zh) 2004-01-28
CN100361125C true CN100361125C (zh) 2008-01-09

Family

ID=29779831

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB031457274A Expired - Fee Related CN100361125C (zh) 2002-06-28 2003-06-30 基于加权编辑距离的自动例句检索的系统和方法

Country Status (3)

Country Link
US (1) US20040002849A1 (https=)
JP (1) JP4173774B2 (https=)
CN (1) CN100361125C (https=)

Families Citing this family (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7251648B2 (en) * 2002-06-28 2007-07-31 Microsoft Corporation Automatically ranking answers to database queries
US7577654B2 (en) * 2003-07-25 2009-08-18 Palo Alto Research Center Incorporated Systems and methods for new event detection
US8650187B2 (en) * 2003-07-25 2014-02-11 Palo Alto Research Center Incorporated Systems and methods for linked event detection
GB2415518A (en) * 2004-06-24 2005-12-28 Sharp Kk Method and apparatus for translation based on a repository of existing translations
US8595223B2 (en) * 2004-10-15 2013-11-26 Microsoft Corporation Method and apparatus for intranet searching
CN101346667A (zh) * 2005-12-20 2009-01-14 皇家飞利浦电子股份有限公司 混合传感器系统和方法
US7562811B2 (en) 2007-01-18 2009-07-21 Varcode Ltd. System and method for improved quality management in a product logistic chain
WO2007129316A2 (en) 2006-05-07 2007-11-15 Varcode Ltd. A system and method for improved quality management in a product logistic chain
US8528808B2 (en) 2007-05-06 2013-09-10 Varcode Ltd. System and method for quality management utilizing barcode indicators
US7818278B2 (en) * 2007-06-14 2010-10-19 Microsoft Corporation Large scale item representation matching
US8914278B2 (en) 2007-08-01 2014-12-16 Ginger Software, Inc. Automatic context sensitive language correction and enhancement using an internet corpus
US8500014B2 (en) 2007-11-14 2013-08-06 Varcode Ltd. System and method for quality management utilizing barcode indicators
US11704526B2 (en) 2008-06-10 2023-07-18 Varcode Ltd. Barcoded indicators for quality management
US20100153366A1 (en) * 2008-12-15 2010-06-17 Motorola, Inc. Assigning an indexing weight to a search term
US8949265B2 (en) * 2009-03-05 2015-02-03 Ebay Inc. System and method to provide query linguistic service
US20100281435A1 (en) * 2009-04-30 2010-11-04 At&T Intellectual Property I, L.P. System and method for multimodal interaction using robust gesture processing
CN101957828B (zh) * 2009-07-20 2013-03-06 阿里巴巴集团控股有限公司 一种对搜索结果进行排序的方法和装置
US8479094B2 (en) * 2009-09-08 2013-07-02 Kenneth Peyton Fouts Interactive writing aid to assist a user in finding information and incorporating information correctly into a written work
CA2787390A1 (en) 2010-02-01 2011-08-04 Ginger Software, Inc. Automatic context sensitive language correction using an internet corpus particularly for small keyboard devices
WO2011100573A1 (en) * 2010-02-12 2011-08-18 Google Inc. Compound splitting
US8448089B2 (en) 2010-10-26 2013-05-21 Microsoft Corporation Context-aware user input prediction
US20120143593A1 (en) * 2010-12-07 2012-06-07 Microsoft Corporation Fuzzy matching and scoring based on direct alignment
US8620902B2 (en) 2011-06-01 2013-12-31 Lexisnexis, A Division Of Reed Elsevier Inc. Computer program products and methods for query collection optimization
JP5803481B2 (ja) * 2011-09-20 2015-11-04 富士ゼロックス株式会社 情報処理装置及び情報処理プログラム
US9977829B2 (en) * 2012-10-12 2018-05-22 Hewlett-Packard Development Company, L.P. Combinatorial summarizer
US8807422B2 (en) 2012-10-22 2014-08-19 Varcode Ltd. Tamper-proof quality management barcode indicators
CN102890723B (zh) * 2012-10-25 2016-08-31 深圳市宜搜科技发展有限公司 一种例句检索的方法及系统
JP5846340B2 (ja) * 2013-09-20 2016-01-20 三菱電機株式会社 文字列検索装置
CN106033416B (zh) * 2015-03-09 2019-12-24 阿里巴巴集团控股有限公司 一种字符串处理方法及装置
EP3298367B1 (en) 2015-05-18 2020-04-29 Varcode Ltd. Thermochromic ink indicia for activatable quality labels
EP3320315B1 (en) 2015-07-07 2020-03-04 Varcode Ltd. Electronic quality indicator
EP3203384A1 (en) * 2016-02-02 2017-08-09 Theo Hoffenberg Method, device, and computer program for providing a definition or a translation of a word belonging to a sentence as a function of neighbouring words and of databases
JP7228083B2 (ja) * 2019-01-31 2023-02-24 日本電信電話株式会社 データ検索装置、方法およびプログラム
JP6751188B1 (ja) * 2019-08-05 2020-09-02 Dmg森精機株式会社 情報処理装置、情報処理方法および情報処理プログラム
CN110795942B (zh) * 2019-09-18 2022-10-14 平安科技(深圳)有限公司 基于语义识别的关键词确定方法、装置和存储介质
CN112307190B (zh) * 2020-10-31 2023-07-25 平安科技(深圳)有限公司 医学文献排序方法、装置、电子设备及存储介质
CN113515933A (zh) * 2021-09-13 2021-10-19 中国电力科学研究院有限公司 电力一二次设备融合处理方法、系统、设备及存储介质
JP2023107339A (ja) 2022-01-24 2023-08-03 富士通株式会社 データ検索方法及びプログラム

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1131302A (zh) * 1994-10-28 1996-09-18 惠普公司 进行串匹配的方法
US6006221A (en) * 1995-08-16 1999-12-21 Syracuse University Multilingual document retrieval system and method using semantic vector matching
CN1302412A (zh) * 1997-07-22 2001-07-04 微软公司 应用搜索结果的自然语言处理以改进整体精度的信息检索系统的设备和方法

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5675819A (en) * 1994-06-16 1997-10-07 Xerox Corporation Document information retrieval using global word co-occurrence patterns
US6424983B1 (en) * 1998-05-26 2002-07-23 Global Information Research And Technologies, Llc Spelling and grammar checking system
US6922669B2 (en) * 1998-12-29 2005-07-26 Koninklijke Philips Electronics N.V. Knowledge-based strategies applied to N-best lists in automatic speech recognition systems

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1131302A (zh) * 1994-10-28 1996-09-18 惠普公司 进行串匹配的方法
US6006221A (en) * 1995-08-16 1999-12-21 Syracuse University Multilingual document retrieval system and method using semantic vector matching
CN1302412A (zh) * 1997-07-22 2001-07-04 微软公司 应用搜索结果的自然语言处理以改进整体精度的信息检索系统的设备和方法

Also Published As

Publication number Publication date
CN1471030A (zh) 2004-01-28
JP2004062893A (ja) 2004-02-26
US20040002849A1 (en) 2004-01-01
JP4173774B2 (ja) 2008-10-29

Similar Documents

Publication Publication Date Title
CN100361125C (zh) 基于加权编辑距离的自动例句检索的系统和方法
US11216504B2 (en) Document recommendation method and device based on semantic tag
CN100507903C (zh) 检索确认句的方法和系统
KR102765364B1 (ko) 검색 능력 개선 및 생성형 ai 정확도를 높이는 rag기반 법률 정보 질의응답 시스템 및 방법
KR101932618B1 (ko) 검색 쿼리에 응답하여 유사성 스코어에 기초하여 이미지와 콘텐츠에 대해 평가 및 랭킹을 진행하기 위한 방법 및 시스템
CN108334490B (zh) 关键词提取方法以及关键词提取装置
WO2024228712A1 (en) Process for delivering responses to queries expressed in natural language based on a dynamic document corpus
CN116911312A (zh) 一种任务型对话系统及其实现方法
CN112988980B (zh) 目标产品查询方法、装置、计算机设备和存储介质
AU2018250372B2 (en) Method to construct content based on a content repository
CN111191105B (zh) 政务信息的搜索方法、装置、系统、设备及存储介质
CN113505196A (zh) 基于词性的文本检索方法、装置、电子设备及存储介质
JP6936014B2 (ja) 教師データ収集装置、教師データ収集方法、及びプログラム
JP3309077B2 (ja) 構文情報を用いた検索方法およびシステム
WO2021189920A1 (zh) 医疗文献簇的主题确定方法、装置、电子设备及存储介质
CN110442681A (zh) 一种机器阅读理解的方法、电子设备及可读存储介质
CN113761104A (zh) 知识图谱中实体关系的检测方法、装置和电子设备
CN118797005A (zh) 智能问答方法、装置、电子设备、存储介质及产品
CN117609468A (zh) 生成检索语句的方法及装置
CN120256587B (zh) 问题查询处理方法及电子设备
CN119938814A (zh) 一种基于llm和rag的智能找矿问答系统
JP2023132977A (ja) 検索プログラム、装置、及び方法
CN118152508A (zh) 一种核电厂长文本检索系统和方法
JP2002092017A (ja) 概念辞書拡張方法、装置、および概念辞書拡張プログラムを記録した記録媒体
Naamha et al. Web page ranking based on text content and link information using data mining techniques

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: MICROSOFT TECHNOLOGY LICENSING LLC

Free format text: FORMER OWNER: MICROSOFT CORP.

Effective date: 20150429

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20150429

Address after: Washington State

Patentee after: Micro soft technique license Co., Ltd

Address before: Washington State

Patentee before: Microsoft Corp.

CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20080109

Termination date: 20180630