CN118043801A - 处理方法、处理程序以及信息处理装置 - Google Patents

处理方法、处理程序以及信息处理装置 Download PDF

Info

Publication number
CN118043801A
CN118043801A CN202180102907.7A CN202180102907A CN118043801A CN 118043801 A CN118043801 A CN 118043801A CN 202180102907 A CN202180102907 A CN 202180102907A CN 118043801 A CN118043801 A CN 118043801A
Authority
CN
China
Prior art keywords
cluster
vector
vectors
sentence
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202180102907.7A
Other languages
English (en)
Chinese (zh)
Inventor
片冈正弘
永浦良平
瓦伊·丹·妙
尾上聪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Publication of CN118043801A publication Critical patent/CN118043801A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2237Vectors, bitmaps or matrices
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3347Query execution using vector based model
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CN202180102907.7A 2021-10-04 2021-10-04 处理方法、处理程序以及信息处理装置 Pending CN118043801A (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2021/036696 WO2023058099A1 (ja) 2021-10-04 2021-10-04 処理方法、処理プログラムおよび情報処理装置

Publications (1)

Publication Number Publication Date
CN118043801A true CN118043801A (zh) 2024-05-14

Family

ID=85804003

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202180102907.7A Pending CN118043801A (zh) 2021-10-04 2021-10-04 处理方法、处理程序以及信息处理装置

Country Status (6)

Country Link
US (1) US12517927B2 (https=)
EP (1) EP4414861A4 (https=)
JP (1) JP7643580B2 (https=)
CN (1) CN118043801A (https=)
AU (1) AU2021467326B2 (https=)
WO (1) WO2023058099A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20260105037A1 (en) * 2024-10-13 2026-04-16 Oracle International Corporation Partitioning of inverted file (ivf) vector indexes in a database system

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4423841B2 (ja) 2002-08-14 2010-03-03 日本電気株式会社 キーワード決定装置、決定方法、文書検索装置、検索方法、文書分類装置及び分類方法並びにプログラム
JP2005258910A (ja) 2004-03-12 2005-09-22 Yamatake Corp 階層キーワード抽出装置、方法、およびプログラム
US9171071B2 (en) * 2010-03-26 2015-10-27 Nec Corporation Meaning extraction system, meaning extraction method, and recording medium
US8719257B2 (en) 2011-02-16 2014-05-06 Symantec Corporation Methods and systems for automatically generating semantic/concept searches
JP6635587B2 (ja) 2015-12-14 2020-01-29 日本放送協会 広告文選択装置及びプログラム
US10606873B2 (en) * 2016-08-16 2020-03-31 Ebay Inc. Search index trimming
US11036764B1 (en) 2017-01-12 2021-06-15 Parallels International Gmbh Document classification filter for search queries
JP2021022133A (ja) 2019-07-26 2021-02-18 京セラドキュメントソリューションズ株式会社 フォルダー分けシステムおよびフォルダー分けプログラム
CN112256880B (zh) * 2020-11-11 2024-12-10 腾讯科技(深圳)有限公司 文本识别方法和装置、存储介质及电子设备

Also Published As

Publication number Publication date
EP4414861A4 (en) 2025-03-05
AU2021467326A1 (en) 2024-04-04
JPWO2023058099A1 (https=) 2023-04-13
JP7643580B2 (ja) 2025-03-11
WO2023058099A1 (ja) 2023-04-13
US12517927B2 (en) 2026-01-06
AU2021467326B2 (en) 2025-06-05
EP4414861A1 (en) 2024-08-14
US20240241891A1 (en) 2024-07-18

Similar Documents

Publication Publication Date Title
List et al. Sequence comparison in computational historical linguistics
CN111444320A (zh) 文本检索方法、装置、计算机设备和存储介质
CN115563287B (zh) 一种获取关联对象的数据处理系统
US11507746B2 (en) Method and apparatus for generating context information
CN110162771B (zh) 事件触发词的识别方法、装置、电子设备
US20130198192A1 (en) Author disambiguation
CN111753550A (zh) 一种自然语言的语义解析方法
CN111324771A (zh) 视频标签的确定方法、装置、电子设备及存储介质
CN109448793B (zh) 基因序列的权利范围标注、检索及信息标注方法、系统
CN113961666A (zh) 关键词识别方法、装置、设备、介质及计算机程序产品
CN115470358B (zh) 一种跨语言实体链接方法、系统、设备及终端
CN113468311B (zh) 一种基于知识图谱的复杂问句问答方法、装置及存储介质
CN116049354B (zh) 基于自然语言的多表格检索方法及装置
CN112632264A (zh) 智能问答方法、装置、电子设备及存储介质
CN119129600B (zh) 一种基于空间关系感知的地名实体识别方法、介质和设备
CN113868406B (zh) 搜索方法、系统、计算机可读存储介质
Siddalingappa et al. Bi-directional long short term memory using recurrent neural network for biological entity recognition
Craig et al. Scaling address parsing sequence models through active learning
JP6260678B2 (ja) 情報処理装置、情報処理方法、及び情報処理プログラム
CN115687314B (zh) 一种唐卡文化知识图谱展示系统及其构建方法
CN118043801A (zh) 处理方法、处理程序以及信息处理装置
CN112667809A (zh) 一种文本处理方法、装置及电子设备、存储介质
CN113076758A (zh) 一种面向任务型对话的多域请求式意图识别方法
US20220171937A1 (en) Document sentence concept labeling system, training method and labeling method thereof
US20210073258A1 (en) Information processing apparatus and non-transitory computer readable medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination