CN100361125C - 基于加权编辑距离的自动例句检索的系统和方法 - Google Patents

基于加权编辑距离的自动例句检索的系统和方法 Download PDF

Info

Publication number
CN100361125C
CN100361125C CNB031457274A CN03145727A CN100361125C CN 100361125 C CN100361125 C CN 100361125C CN B031457274 A CNB031457274 A CN B031457274A CN 03145727 A CN03145727 A CN 03145727A CN 100361125 C CN100361125 C CN 100361125C
Authority
CN
China
Prior art keywords
sentence
candidate
example sentence
term
input query
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB031457274A
Other languages
English (en)
Chinese (zh)
Other versions
CN1471030A (zh
Inventor
周明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of CN1471030A publication Critical patent/CN1471030A/zh
Application granted granted Critical
Publication of CN100361125C publication Critical patent/CN100361125C/zh
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3346Query execution using probabilistic model
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/42Data-driven translation
    • G06F40/45Example-based machine translation; Alignment

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
CNB031457274A 2002-06-28 2003-06-30 基于加权编辑距离的自动例句检索的系统和方法 Expired - Fee Related CN100361125C (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/186,174 US20040002849A1 (en) 2002-06-28 2002-06-28 System and method for automatic retrieval of example sentences based upon weighted editing distance
US10/186,174 2002-06-28

Publications (2)

Publication Number Publication Date
CN1471030A CN1471030A (zh) 2004-01-28
CN100361125C true CN100361125C (zh) 2008-01-09

Family

ID=29779831

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB031457274A Expired - Fee Related CN100361125C (zh) 2002-06-28 2003-06-30 基于加权编辑距离的自动例句检索的系统和方法

Country Status (3)

Country Link
US (1) US20040002849A1 (enExample)
JP (1) JP4173774B2 (enExample)
CN (1) CN100361125C (enExample)

Families Citing this family (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7251648B2 (en) * 2002-06-28 2007-07-31 Microsoft Corporation Automatically ranking answers to database queries
US7577654B2 (en) * 2003-07-25 2009-08-18 Palo Alto Research Center Incorporated Systems and methods for new event detection
US8650187B2 (en) * 2003-07-25 2014-02-11 Palo Alto Research Center Incorporated Systems and methods for linked event detection
GB2415518A (en) * 2004-06-24 2005-12-28 Sharp Kk Method and apparatus for translation based on a repository of existing translations
US8595223B2 (en) * 2004-10-15 2013-11-26 Microsoft Corporation Method and apparatus for intranet searching
CN101346667A (zh) * 2005-12-20 2009-01-14 皇家飞利浦电子股份有限公司 混合传感器系统和方法
WO2007129316A2 (en) 2006-05-07 2007-11-15 Varcode Ltd. A system and method for improved quality management in a product logistic chain
US7562811B2 (en) 2007-01-18 2009-07-21 Varcode Ltd. System and method for improved quality management in a product logistic chain
JP2010526386A (ja) 2007-05-06 2010-07-29 バーコード リミティド バーコード標識を利用する品質管理のシステムと方法
US7818278B2 (en) * 2007-06-14 2010-10-19 Microsoft Corporation Large scale item representation matching
CA2694327A1 (en) 2007-08-01 2009-02-05 Ginger Software, Inc. Automatic context sensitive language correction and enhancement using an internet corpus
EP2218042B1 (en) 2007-11-14 2020-01-01 Varcode Ltd. A system and method for quality management utilizing barcode indicators
US11704526B2 (en) 2008-06-10 2023-07-18 Varcode Ltd. Barcoded indicators for quality management
US20100153366A1 (en) * 2008-12-15 2010-06-17 Motorola, Inc. Assigning an indexing weight to a search term
US8949265B2 (en) 2009-03-05 2015-02-03 Ebay Inc. System and method to provide query linguistic service
US20100281435A1 (en) * 2009-04-30 2010-11-04 At&T Intellectual Property I, L.P. System and method for multimodal interaction using robust gesture processing
CN101957828B (zh) * 2009-07-20 2013-03-06 阿里巴巴集团控股有限公司 一种对搜索结果进行排序的方法和装置
US8479094B2 (en) * 2009-09-08 2013-07-02 Kenneth Peyton Fouts Interactive writing aid to assist a user in finding information and incorporating information correctly into a written work
EP2531930A1 (en) 2010-02-01 2012-12-12 Ginger Software, Inc. Automatic context sensitive language correction using an internet corpus particularly for small keyboard devices
KR101744861B1 (ko) * 2010-02-12 2017-06-08 구글 인코포레이티드 합성어 분할
US8448089B2 (en) 2010-10-26 2013-05-21 Microsoft Corporation Context-aware user input prediction
US20120143593A1 (en) * 2010-12-07 2012-06-07 Microsoft Corporation Fuzzy matching and scoring based on direct alignment
US8620902B2 (en) 2011-06-01 2013-12-31 Lexisnexis, A Division Of Reed Elsevier Inc. Computer program products and methods for query collection optimization
JP5803481B2 (ja) * 2011-09-20 2015-11-04 富士ゼロックス株式会社 情報処理装置及び情報処理プログラム
EP2870543A4 (en) * 2012-10-12 2016-04-06 Hewlett Packard Development Co COMBINATORY SUMMARY
US8807422B2 (en) 2012-10-22 2014-08-19 Varcode Ltd. Tamper-proof quality management barcode indicators
CN102890723B (zh) * 2012-10-25 2016-08-31 深圳市宜搜科技发展有限公司 一种例句检索的方法及系统
JP5846340B2 (ja) * 2013-09-20 2016-01-20 三菱電機株式会社 文字列検索装置
CN106033416B (zh) * 2015-03-09 2019-12-24 阿里巴巴集团控股有限公司 一种字符串处理方法及装置
CA2985160C (en) 2015-05-18 2023-09-05 Varcode Ltd. Thermochromic ink indicia for activatable quality labels
CN107709946B (zh) 2015-07-07 2022-05-10 发可有限公司 电子质量标志
EP3203384A1 (en) * 2016-02-02 2017-08-09 Theo Hoffenberg Method, device, and computer program for providing a definition or a translation of a word belonging to a sentence as a function of neighbouring words and of databases
JP7228083B2 (ja) * 2019-01-31 2023-02-24 日本電信電話株式会社 データ検索装置、方法およびプログラム
JP6751188B1 (ja) * 2019-08-05 2020-09-02 Dmg森精機株式会社 情報処理装置、情報処理方法および情報処理プログラム
CN110795942B (zh) * 2019-09-18 2022-10-14 平安科技(深圳)有限公司 基于语义识别的关键词确定方法、装置和存储介质
CN112307190B (zh) * 2020-10-31 2023-07-25 平安科技(深圳)有限公司 医学文献排序方法、装置、电子设备及存储介质
CN113515933A (zh) * 2021-09-13 2021-10-19 中国电力科学研究院有限公司 电力一二次设备融合处理方法、系统、设备及存储介质
JP2023107339A (ja) 2022-01-24 2023-08-03 富士通株式会社 データ検索方法及びプログラム

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1131302A (zh) * 1994-10-28 1996-09-18 惠普公司 进行串匹配的方法
US6006221A (en) * 1995-08-16 1999-12-21 Syracuse University Multilingual document retrieval system and method using semantic vector matching
CN1302412A (zh) * 1997-07-22 2001-07-04 微软公司 应用搜索结果的自然语言处理以改进整体精度的信息检索系统的设备和方法

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5675819A (en) * 1994-06-16 1997-10-07 Xerox Corporation Document information retrieval using global word co-occurrence patterns
US6424983B1 (en) * 1998-05-26 2002-07-23 Global Information Research And Technologies, Llc Spelling and grammar checking system
US6922669B2 (en) * 1998-12-29 2005-07-26 Koninklijke Philips Electronics N.V. Knowledge-based strategies applied to N-best lists in automatic speech recognition systems

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1131302A (zh) * 1994-10-28 1996-09-18 惠普公司 进行串匹配的方法
US6006221A (en) * 1995-08-16 1999-12-21 Syracuse University Multilingual document retrieval system and method using semantic vector matching
CN1302412A (zh) * 1997-07-22 2001-07-04 微软公司 应用搜索结果的自然语言处理以改进整体精度的信息检索系统的设备和方法

Also Published As

Publication number Publication date
CN1471030A (zh) 2004-01-28
US20040002849A1 (en) 2004-01-01
JP2004062893A (ja) 2004-02-26
JP4173774B2 (ja) 2008-10-29

Similar Documents

Publication Publication Date Title
CN100361125C (zh) 基于加权编辑距离的自动例句检索的系统和方法
US11216504B2 (en) Document recommendation method and device based on semantic tag
JP4974445B2 (ja) 確認文を提供する方法およびシステム
CN108334490B (zh) 关键词提取方法以及关键词提取装置
WO2024228712A1 (en) Process for delivering responses to queries expressed in natural language based on a dynamic document corpus
CN116911312A (zh) 一种任务型对话系统及其实现方法
CN112988980B (zh) 目标产品查询方法、装置、计算机设备和存储介质
KR102765364B1 (ko) 검색 능력 개선 및 생성형 ai 정확도를 높이는 rag기반 법률 정보 질의응답 시스템 및 방법
CA2536262A1 (en) System and method for processing text utilizing a suite of disambiguation techniques
CN101887436A (zh) 一种检索方法、装置和系统
AU2018250372B2 (en) Method to construct content based on a content repository
JP6936014B2 (ja) 教師データ収集装置、教師データ収集方法、及びプログラム
JP3309077B2 (ja) 構文情報を用いた検索方法およびシステム
CN113505196A (zh) 基于词性的文本检索方法、装置、电子设备及存储介质
WO2021189920A1 (zh) 医疗文献簇的主题确定方法、装置、电子设备及存储介质
CN113761104A (zh) 知识图谱中实体关系的检测方法、装置和电子设备
CN118797005A (zh) 智能问答方法、装置、电子设备、存储介质及产品
CN110442681A (zh) 一种机器阅读理解的方法、电子设备及可读存储介质
CN120256587B (zh) 问题查询处理方法及电子设备
Berger et al. The Weaver System for Document Retrieval.
JP2023132977A (ja) 検索プログラム、装置、及び方法
JP2002092017A (ja) 概念辞書拡張方法、装置、および概念辞書拡張プログラムを記録した記録媒体
Sharma et al. An upgraded model of query expansion using inverse-term frequency with pertinent response for internet of things
CN119808778B (zh) 一种基于大语言模型的智能文本切分方法和系统
JP7646091B2 (ja) 情報処理装置、検索方法、及び検索プログラム

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: MICROSOFT TECHNOLOGY LICENSING LLC

Free format text: FORMER OWNER: MICROSOFT CORP.

Effective date: 20150429

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20150429

Address after: Washington State

Patentee after: Micro soft technique license Co., Ltd

Address before: Washington State

Patentee before: Microsoft Corp.

CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20080109

Termination date: 20180630