CN1637744A - 为在大量电子文档中搜索而确定文档相关性的机器学习方法 - Google Patents

为在大量电子文档中搜索而确定文档相关性的机器学习方法 Download PDF

Info

Publication number
CN1637744A
CN1637744A CNA2005100040669A CN200510004066A CN1637744A CN 1637744 A CN1637744 A CN 1637744A CN A2005100040669 A CNA2005100040669 A CN A2005100040669A CN 200510004066 A CN200510004066 A CN 200510004066A CN 1637744 A CN1637744 A CN 1637744A
Authority
CN
China
Prior art keywords
document
training
data
information
subclauses
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2005100040669A
Other languages
English (en)
Chinese (zh)
Inventor
H·陈
R·钱德拉西卡
S·H·科斯顿
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of CN1637744A publication Critical patent/CN1637744A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3346Query execution using probabilistic model
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CNA2005100040669A 2004-01-09 2005-01-07 为在大量电子文档中搜索而确定文档相关性的机器学习方法 Pending CN1637744A (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/754,159 2004-01-09
US10/754,159 US7287012B2 (en) 2004-01-09 2004-01-09 Machine-learned approach to determining document relevance for search over large electronic collections of documents

Publications (1)

Publication Number Publication Date
CN1637744A true CN1637744A (zh) 2005-07-13

Family

ID=34739321

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2005100040669A Pending CN1637744A (zh) 2004-01-09 2005-01-07 为在大量电子文档中搜索而确定文档相关性的机器学习方法

Country Status (5)

Country Link
US (1) US7287012B2 (enExample)
EP (1) EP1574972A3 (enExample)
JP (2) JP2005222532A (enExample)
KR (1) KR101027864B1 (enExample)
CN (1) CN1637744A (enExample)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102314453A (zh) * 2010-06-30 2012-01-11 百度在线网络技术(北京)有限公司 高质量版本的筛选方法及系统
CN102436505A (zh) * 2010-12-16 2012-05-02 微软公司 导出文档相似性索引
CN102436510A (zh) * 2011-12-30 2012-05-02 浙江乐得网络科技有限公司 通过离线查询提高在线实时搜索质量的方法与系统
CN101283356B (zh) * 2005-10-14 2012-10-10 微软公司 注入到客户机应用程序的搜索结果
CN103198217A (zh) * 2013-03-26 2013-07-10 X·Q·李 一种故障检测方法及系统
CN105144164A (zh) * 2013-03-13 2015-12-09 谷歌公司 使用深度网络对概念术语评分
CN105210064A (zh) * 2013-03-13 2015-12-30 谷歌公司 使用深度网络将资源分类
CN105260482A (zh) * 2015-11-16 2016-01-20 金陵科技学院 基于众包技术的网络新词发现装置以及方法
CN110023962A (zh) * 2016-12-22 2019-07-16 英特尔公司 人类体验到机器人和其他自主机器的高效传递
CN110532376A (zh) * 2018-04-13 2019-12-03 国际商业机器公司 分类文本以确定用于选择机器学习算法结果的目标类型
CN111539756A (zh) * 2019-02-07 2020-08-14 卡巴斯基实验室股份制公司 基于搜索要求识别用户并将用户选为目标的系统和方法
CN113127642A (zh) * 2021-04-29 2021-07-16 广盟数据科技(上海)有限公司 文档可控式自动分类方法、装置、设备及存储介质

Families Citing this family (77)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8271316B2 (en) * 1999-12-17 2012-09-18 Buzzmetrics Ltd Consumer to business data capturing system
US7197470B1 (en) 2000-10-11 2007-03-27 Buzzmetrics, Ltd. System and method for collection analysis of electronic discussion methods
US7725414B2 (en) 2004-03-16 2010-05-25 Buzzmetrics, Ltd An Israel Corporation Method for developing a classifier for classifying communications
US8527442B2 (en) * 2004-05-14 2013-09-03 Lawrence Fu Method for predicting citation counts
US8275772B2 (en) * 2004-05-14 2012-09-25 Yin Aphinyanaphongs Content and quality assessment method and apparatus for quality searching
US7296021B2 (en) * 2004-05-21 2007-11-13 International Business Machines Corporation Method, system, and article to specify compound query, displaying visual indication includes a series of graphical bars specify weight relevance, ordered segments of unique colors where each segment length indicative of the extent of match of each object with one of search parameters
US7617176B2 (en) * 2004-07-13 2009-11-10 Microsoft Corporation Query-based snippet clustering for search result grouping
US20180146879A9 (en) * 2004-08-30 2018-05-31 Kalford C. Fadem Biopotential Waveform Data Fusion Analysis and Classification Method
US20060053156A1 (en) * 2004-09-03 2006-03-09 Howard Kaushansky Systems and methods for developing intelligence from information existing on a network
US7801887B2 (en) * 2004-10-27 2010-09-21 Harris Corporation Method for re-ranking documents retrieved from a document database
US7797165B1 (en) * 2005-02-17 2010-09-14 E-Scan Data Systems, Inc. Lossless account compression for health care patient benefits eligibility research system and methods
US7778850B2 (en) 2005-02-17 2010-08-17 E-Scan Data Systems, Inc. Health care patient benefits eligibility research system and methods
US9158855B2 (en) 2005-06-16 2015-10-13 Buzzmetrics, Ltd Extracting structured data from weblogs
US20060288001A1 (en) * 2005-06-20 2006-12-21 Costa Rafael Rego P R System and method for dynamically identifying the best search engines and searchable databases for a query, and model of presentation of results - the search assistant
US8195654B1 (en) * 2005-07-13 2012-06-05 Google Inc. Prediction of human ratings or rankings of information retrieval quality
US20070100779A1 (en) * 2005-08-05 2007-05-03 Ori Levy Method and system for extracting web data
GB0521552D0 (en) * 2005-10-22 2005-11-30 Ibm Method and system for constructing a classifier
US8726144B2 (en) * 2005-12-23 2014-05-13 Xerox Corporation Interactive learning-based document annotation
US7644373B2 (en) * 2006-01-23 2010-01-05 Microsoft Corporation User interface for viewing clusters of images
US7836050B2 (en) * 2006-01-25 2010-11-16 Microsoft Corporation Ranking content based on relevance and quality
US7814040B1 (en) 2006-01-31 2010-10-12 The Research Foundation Of State University Of New York System and method for image annotation and multi-modal image retrieval using probabilistic semantic models
US9529903B2 (en) * 2006-04-26 2016-12-27 The Bureau Of National Affairs, Inc. System and method for topical document searching
US7860818B2 (en) * 2006-06-29 2010-12-28 Siemens Corporation System and method for case-based multilabel classification and ranking
US7660783B2 (en) * 2006-09-27 2010-02-09 Buzzmetrics, Inc. System and method of ad-hoc analysis of data
US7707208B2 (en) * 2006-10-10 2010-04-27 Microsoft Corporation Identifying sight for a location
US7672912B2 (en) * 2006-10-26 2010-03-02 Microsoft Corporation Classifying knowledge aging in emails using Naïve Bayes Classifier
US20080201634A1 (en) * 2007-02-20 2008-08-21 Gibb Erik W System and method for customizing a user interface
US20080215607A1 (en) * 2007-03-02 2008-09-04 Umbria, Inc. Tribe or group-based analysis of social media including generating intelligence from a tribe's weblogs or blogs
US20090037401A1 (en) * 2007-07-31 2009-02-05 Microsoft Corporation Information Retrieval and Ranking
US8122015B2 (en) * 2007-09-21 2012-02-21 Microsoft Corporation Multi-ranker for search
US20090150387A1 (en) * 2007-11-08 2009-06-11 Marchewitz Jodi L Guided research tool
US8347326B2 (en) 2007-12-18 2013-01-01 The Nielsen Company (US) Identifying key media events and modeling causal relationships between key events and reported feelings
US20090240549A1 (en) * 2008-03-21 2009-09-24 Microsoft Corporation Recommendation system for a task brokerage system
US20090240539A1 (en) * 2008-03-21 2009-09-24 Microsoft Corporation Machine learning system for a task brokerage system
US8171007B2 (en) * 2008-04-18 2012-05-01 Microsoft Corporation Creating business value by embedding domain tuned search on web-sites
KR100921892B1 (ko) * 2008-04-25 2009-10-13 엔에이치엔(주) 가중치 정규화를 이용한 랭크 학습 모델 생성 방법 및시스템
US8290946B2 (en) * 2008-06-24 2012-10-16 Microsoft Corporation Consistent phrase relevance measures
US8793249B2 (en) * 2008-09-24 2014-07-29 Yahoo! Inc. Optimization filters for user generated content searches
US20100082639A1 (en) * 2008-09-30 2010-04-01 Microsoft Corporation Processing maximum likelihood for listwise rankings
US8849790B2 (en) * 2008-12-24 2014-09-30 Yahoo! Inc. Rapid iterative development of classifiers
US7958137B2 (en) * 2009-02-03 2011-06-07 Honeywell International Inc. Method to assist user in creation of highly inter-related models in complex databases
US9330165B2 (en) * 2009-02-13 2016-05-03 Microsoft Technology Licensing, Llc Context-aware query suggestion by mining log data
US20100257167A1 (en) * 2009-04-01 2010-10-07 Microsoft Corporation Learning to rank using query-dependent loss functions
US8527523B1 (en) * 2009-04-22 2013-09-03 Equivio Ltd. System for enhancing expert-based computerized analysis of a set of digital documents and methods useful in conjunction therewith
KR101067376B1 (ko) * 2009-04-29 2011-09-23 서울대학교산학협력단 멀티셋을 이용한 연상 정보 처리 방법 및 그 메모리 장치
US8935258B2 (en) * 2009-06-15 2015-01-13 Microsoft Corporation Identification of sample data items for re-judging
US10353967B2 (en) 2009-06-22 2019-07-16 Microsoft Technology Licensing, Llc Assigning relevance weights based on temporal dynamics
US11403568B2 (en) * 2010-01-06 2022-08-02 Integral Ad Science, Inc. Methods, systems, and media for providing direct and hybrid data acquisition approaches
EP2369504A1 (en) 2010-03-26 2011-09-28 British Telecommunications public limited company System
US10235679B2 (en) 2010-04-22 2019-03-19 Microsoft Technology Licensing, Llc Learning a ranker to rank entities with automatically derived domain-specific preferences
CN101937445B (zh) * 2010-05-24 2011-12-07 中国科学技术信息研究所 一种文件自动分类系统
US8874727B2 (en) 2010-05-31 2014-10-28 The Nielsen Company (Us), Llc Methods, apparatus, and articles of manufacture to rank users in an online social network
CN102419755B (zh) 2010-09-28 2013-04-24 阿里巴巴集团控股有限公司 一种搜索结果的排序方法和装置
WO2012075221A1 (en) * 2010-12-01 2012-06-07 Data Engines Corporation Method for inferring attributes of a data set and recognizers used thereon
WO2013123182A1 (en) * 2012-02-17 2013-08-22 The Trustees Of Columbia University In The City Of New York Computer-implemented systems and methods of performing contract review
US9292517B2 (en) 2013-01-03 2016-03-22 Board Of Regents, The University Of Texas System Efficiently identifying images, videos, songs or documents most relevant to the user based on attribute feedback
US9176993B2 (en) 2013-01-03 2015-11-03 Board Of Regents, The University Of Texas System Efficiently identifying images, videos, songs or documents most relevant to the user using binary search trees on attributes for guiding relevance feedback
US9275291B2 (en) 2013-06-17 2016-03-01 Texifter, LLC System and method of classifier ranking for incorporation into enhanced machine learning
RU2583739C2 (ru) 2013-10-16 2016-05-10 Общество С Ограниченной Ответственностью "Яндекс" Сервер для определения поисковой выдачи на поисковый запрос и электронное устройство
JP5576544B1 (ja) * 2013-10-17 2014-08-20 株式会社プリファードインフラストラクチャー 情報処理装置
US10417568B2 (en) * 2014-05-22 2019-09-17 International Business Machines Corporation Discovering cognition bias toward data presentation styles through file system analysis
CN104077412B (zh) * 2014-07-14 2018-04-13 福州大学 一种基于多Markov链的微博用户兴趣预测方法
US10733520B2 (en) 2015-05-13 2020-08-04 Microsoft Technology Licensing, Llc Making a prediction regarding development of a software product
RU2632133C2 (ru) 2015-09-29 2017-10-02 Общество С Ограниченной Ответственностью "Яндекс" Способ (варианты) и система (варианты) создания модели прогнозирования и определения точности модели прогнозирования
US9876699B2 (en) * 2015-10-21 2018-01-23 Wipro Limited System and method for generating a report in real-time from a resource management system
KR102570278B1 (ko) 2017-07-31 2023-08-24 삼성전자주식회사 교사 모델로부터 학생 모델을 트레이닝하는데 사용되는 학습 데이터를 생성하는 장치 및 방법
RU2693324C2 (ru) 2017-11-24 2019-07-02 Общество С Ограниченной Ответственностью "Яндекс" Способ и сервер преобразования значения категориального фактора в его числовое представление
RU2692048C2 (ru) 2017-11-24 2019-06-19 Общество С Ограниченной Ответственностью "Яндекс" Способ и сервер для преобразования значения категориального фактора в его числовое представление и для создания разделяющего значения категориального фактора
KR102069621B1 (ko) * 2018-05-28 2020-01-23 인천대학교 산학협력단 문서 구조와 딥러닝을 이용한 문서 분류 장치 및 방법
US12387146B2 (en) 2018-11-15 2025-08-12 Semiconductor Energy Laboratory Co., Ltd. Content classification method
JP7730641B2 (ja) 2018-12-13 2025-08-28 株式会社半導体エネルギー研究所 コンテンツの分類方法
KR20210126033A (ko) 2019-02-15 2021-10-19 가부시키가이샤 한도오따이 에네루기 켄큐쇼 파라미터 탐색 방법
KR102046748B1 (ko) 2019-04-25 2019-11-19 숭실대학교산학협력단 트리 부스팅 기반 애플리케이션의 위험도 평가 방법, 이를 수행하기 위한 기록 매체 및 장치
US11455346B2 (en) 2020-03-31 2022-09-27 International Business Machines Corporation Advanced search and document retrieval for development and verification system prototypes
EP3951614A1 (en) * 2020-08-07 2022-02-09 Basf Se Practical supervised classification of data sets
KR102442300B1 (ko) * 2020-10-12 2022-09-13 주식회사 어반데이터랩 은닉 마르코프 모델을 이용한 온라인 쇼핑몰 판매 전략 예측 시스템
US11449516B2 (en) 2020-11-04 2022-09-20 International Business Machines Corporation Ranking of documents belonging to different domains based on comparison of descriptors thereof

Family Cites Families (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5341142A (en) * 1987-07-24 1994-08-23 Northrop Grumman Corporation Target acquisition and tracking system
US5903454A (en) * 1991-12-23 1999-05-11 Hoffberg; Linda Irene Human-factored interface corporating adaptive pattern recognition based controller apparatus
US5875108A (en) * 1991-12-23 1999-02-23 Hoffberg; Steven M. Ergonomic man-machine interface incorporating adaptive pattern recognition based control system
US5640409A (en) * 1993-07-02 1997-06-17 Sony Corporation Semiconductor laser
US5671333A (en) * 1994-04-07 1997-09-23 Lucent Technologies Inc. Training apparatus and method
US5854855A (en) * 1994-09-09 1998-12-29 Motorola, Inc. Method and system using meta-classes and polynomial discriminant functions for handwriting recognition
US5802205A (en) * 1994-09-09 1998-09-01 Motorola, Inc. Method and system for lexical processing
US5768417A (en) * 1994-09-09 1998-06-16 Motorola, Inc. Method and system for velocity-based handwriting recognition
US5978497A (en) * 1994-09-20 1999-11-02 Neopath, Inc. Apparatus for the identification of free-lying cells
US5701400A (en) * 1995-03-08 1997-12-23 Amado; Carlos Armando Method and apparatus for applying if-then-else rules to data sets in a relational data base and generating from the results of application of said rules a database of diagnostics linked to said data sets to aid executive analysis of financial data
US5729452A (en) * 1995-03-31 1998-03-17 Envirotest Acquisition Co. Method and system for diagnosing and reporting failure of a vehicle emission test
US5799276A (en) * 1995-11-07 1998-08-25 Accent Incorporated Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals
JPH09231238A (ja) * 1996-02-20 1997-09-05 Omron Corp テキスト検索結果表示方法及び装置
US5862259A (en) * 1996-03-27 1999-01-19 Caere Corporation Pattern recognition employing arbitrary segmentation and compound probabilistic evaluation
US5920852A (en) * 1996-04-30 1999-07-06 Grannet Corporation Large memory storage and retrieval (LAMSTAR) network
JP2940501B2 (ja) * 1996-12-25 1999-08-25 日本電気株式会社 ドキュメント分類装置及び方法
US6373483B1 (en) * 1997-01-13 2002-04-16 Silicon Graphics, Inc. Method, system and computer program product for visually approximating scattered data using color to represent values of a categorical variable
US6278464B1 (en) * 1997-03-07 2001-08-21 Silicon Graphics, Inc. Method, system, and computer program product for visualizing a decision-tree classifier
US6137499A (en) * 1997-03-07 2000-10-24 Silicon Graphics, Inc. Method, system, and computer program product for visualizing data using partial hierarchies
US5884294A (en) * 1997-04-18 1999-03-16 Northrop Grumman Corporation System and method for functional recognition of emitters
US5930803A (en) * 1997-04-30 1999-07-27 Silicon Graphics, Inc. Method, system, and computer program product for visualizing an evidence classifier
US5902477A (en) * 1997-04-30 1999-05-11 John Vena Combined sewer overflow and storm water diverter screen
WO1998050892A1 (en) * 1997-05-07 1998-11-12 Cummins-Allison Corp. Intelligent currency handling system
US6137911A (en) * 1997-06-16 2000-10-24 The Dialog Corporation Plc Test classification system and method
US6278961B1 (en) * 1997-07-02 2001-08-21 Nonlinear Solutions, Inc. Signal and pattern detection or classification by estimation of continuous dynamical models
US5933822A (en) * 1997-07-22 1999-08-03 Microsoft Corporation Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision
JP3178406B2 (ja) * 1998-02-27 2001-06-18 日本電気株式会社 階層型文章分類装置およびプログラムを記録した機械読み取り可能な記録媒体
US6161130A (en) * 1998-06-23 2000-12-12 Microsoft Corporation Technique which utilizes a probabilistic classifier to detect "junk" e-mail by automatically updating a training and re-training the classifier based on the updated training set
US6301579B1 (en) * 1998-10-20 2001-10-09 Silicon Graphics, Inc. Method, system, and computer program product for visualizing a data structure
EP1006458A1 (en) * 1998-12-01 2000-06-07 BRITISH TELECOMMUNICATIONS public limited company Methods and apparatus for information retrieval
US6460049B1 (en) * 1998-12-22 2002-10-01 Silicon Graphics, Inc. Method system and computer program product for visualizing an evidence classifier
US6697799B1 (en) * 1999-09-10 2004-02-24 Requisite Technology, Inc. Automated classification of items using cascade searches
US6546388B1 (en) * 2000-01-14 2003-04-08 International Business Machines Corporation Metadata search results ranking system
CA2307404A1 (en) * 2000-05-02 2001-11-02 Provenance Systems Inc. Computer readable electronic records automated classification system
US6578032B1 (en) * 2000-06-28 2003-06-10 Microsoft Corporation Method and system for performing phrase/word clustering and cluster merging
US6892193B2 (en) * 2001-05-10 2005-05-10 International Business Machines Corporation Method and apparatus for inducing classifiers for multimedia based on unified representation of features reflecting disparate modalities
US6993535B2 (en) * 2001-06-18 2006-01-31 International Business Machines Corporation Business method and apparatus for employing induced multimedia classifiers based on unified representation of features reflecting disparate modalities
US7136845B2 (en) * 2001-07-12 2006-11-14 Microsoft Corporation System and method for query refinement to enable improved searching based on identifying and utilizing popular concepts related to users' queries
US6978264B2 (en) * 2002-01-03 2005-12-20 Microsoft Corporation System and method for performing a search and a browse on a query
US7043468B2 (en) * 2002-01-31 2006-05-09 Hewlett-Packard Development Company, L.P. Method and system for measuring the quality of a hierarchy
JP3873135B2 (ja) * 2002-03-08 2007-01-24 インターナショナル・ビジネス・マシーンズ・コーポレーション データ処理方法、これを用いた情報処理システム及びプログラム
US20030225763A1 (en) * 2002-04-15 2003-12-04 Microsoft Corporation Self-improving system and method for classifying pages on the world wide web
US7158957B2 (en) 2002-11-21 2007-01-02 Honeywell International Inc. Supervised self organizing maps with fuzzy error correction
US7020593B2 (en) * 2002-12-04 2006-03-28 International Business Machines Corporation Method for ensemble predictive modeling by multiplicative adjustment of class probability: APM (adjusted probability model)
JP3939264B2 (ja) * 2003-03-24 2007-07-04 沖電気工業株式会社 形態素解析装置

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101283356B (zh) * 2005-10-14 2012-10-10 微软公司 注入到客户机应用程序的搜索结果
CN102314453A (zh) * 2010-06-30 2012-01-11 百度在线网络技术(北京)有限公司 高质量版本的筛选方法及系统
CN102314453B (zh) * 2010-06-30 2015-11-25 百度在线网络技术(北京)有限公司 高质量版本的筛选方法及系统
CN102436505B (zh) * 2010-12-16 2014-08-20 微软公司 导出文档相似性索引
US8793242B2 (en) 2010-12-16 2014-07-29 Microsoft Corporation Deriving document similarity indices
CN102436505A (zh) * 2010-12-16 2012-05-02 微软公司 导出文档相似性索引
CN102436510A (zh) * 2011-12-30 2012-05-02 浙江乐得网络科技有限公司 通过离线查询提高在线实时搜索质量的方法与系统
CN105210064B (zh) * 2013-03-13 2020-08-04 谷歌有限责任公司 使用深度网络将资源分类
CN105144164A (zh) * 2013-03-13 2015-12-09 谷歌公司 使用深度网络对概念术语评分
CN105210064A (zh) * 2013-03-13 2015-12-30 谷歌公司 使用深度网络将资源分类
CN105144164B (zh) * 2013-03-13 2020-10-27 谷歌有限责任公司 使用深度网络对概念术语评分
CN103198217A (zh) * 2013-03-26 2013-07-10 X·Q·李 一种故障检测方法及系统
CN103198217B (zh) * 2013-03-26 2016-06-22 X·Q·李 一种故障检测方法及系统
CN105260482A (zh) * 2015-11-16 2016-01-20 金陵科技学院 基于众包技术的网络新词发现装置以及方法
CN110023962A (zh) * 2016-12-22 2019-07-16 英特尔公司 人类体验到机器人和其他自主机器的高效传递
CN110023962B (zh) * 2016-12-22 2024-03-12 英特尔公司 人类体验到机器人和其他自主机器的高效传递
CN110532376A (zh) * 2018-04-13 2019-12-03 国际商业机器公司 分类文本以确定用于选择机器学习算法结果的目标类型
CN110532376B (zh) * 2018-04-13 2024-03-19 玛雷迪夫美国公司 分类文本以确定用于选择机器学习算法结果的目标类型
CN111539756A (zh) * 2019-02-07 2020-08-14 卡巴斯基实验室股份制公司 基于搜索要求识别用户并将用户选为目标的系统和方法
CN111539756B (zh) * 2019-02-07 2023-08-22 卡巴斯基实验室股份制公司 基于搜索要求识别用户并将用户选为目标的系统和方法
CN113127642A (zh) * 2021-04-29 2021-07-16 广盟数据科技(上海)有限公司 文档可控式自动分类方法、装置、设备及存储介质
CN113127642B (zh) * 2021-04-29 2022-12-23 广盟数据科技(上海)有限公司 文档可控式自动分类方法、装置、设备及存储介质

Also Published As

Publication number Publication date
US20050154686A1 (en) 2005-07-14
EP1574972A3 (en) 2006-05-24
JP2005222532A (ja) 2005-08-18
JP2009104630A (ja) 2009-05-14
KR101027864B1 (ko) 2011-04-07
US7287012B2 (en) 2007-10-23
EP1574972A2 (en) 2005-09-14
KR20050073429A (ko) 2005-07-13

Similar Documents

Publication Publication Date Title
CN1637744A (zh) 为在大量电子文档中搜索而确定文档相关性的机器学习方法
CN112214670B (zh) 一种在线课程推荐方法、装置、电子设备及存储介质
CN106095928B (zh) 一种事件类型识别方法及装置
AU2005209586B2 (en) Systems, methods, and interfaces for providing personalized search and information access
US9678992B2 (en) Text to image translation
CN117909466A (zh) 领域问答系统、构造方法、电子设备及存储介质
CN111104526A (zh) 一种基于关键词语义的金融标签提取方法及系统
US20040199546A1 (en) Construction of trainable semantic vectors and clustering, classification, and searching using trainable semantic vectors
US20200175052A1 (en) Classification of electronic documents
CN118277521A (zh) 一种基于llm的电力领域智能问答方法、系统、设备和介质
CN1841380A (zh) 用于改进搜索引擎相关性的数据挖掘技术
US20180341686A1 (en) System and method for data search based on top-to-bottom similarity analysis
CN117891929B (zh) 改进型深度学习算法的知识图谱智能问答信息识别方法
CN110046264A (zh) 一种面向手机文档的自动分类方法
CN116049376B (zh) 一种信创知识检索回复的方法、装置和系统
CN113553419A (zh) 民航知识图谱问答系统
CN118966192A (zh) 一种政策文本解析方法
CN118035286A (zh) 一种基于医疗大模型的信息查询系统
CN118095272A (zh) 文本的识别方法、装置、电子设备及存储介质
CN119646191B (zh) 基于大模型和聚类算法的自动标注方法、装置和设备
CN112148983B (zh) 一种用于税务行业的内容更新推荐方法
CN117763109B (zh) 一种用于档案全文检索的数据核查方法
KR20220075490A (ko) 학습 콘텐츠 추천 방법
CN119474307A (zh) 一种基于llm、rag与知识图谱的文档问答方法和系统
Tacioli et al. An architecture for animal sound identification based on multiple feature extraction and classification algorithms

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication