JP7643580B2 - 処理方法、処理プログラムおよび情報処理装置 - Google Patents

処理方法、処理プログラムおよび情報処理装置 Download PDF

Info

Publication number
JP7643580B2
JP7643580B2 JP2023552424A JP2023552424A JP7643580B2 JP 7643580 B2 JP7643580 B2 JP 7643580B2 JP 2023552424 A JP2023552424 A JP 2023552424A JP 2023552424 A JP2023552424 A JP 2023552424A JP 7643580 B2 JP7643580 B2 JP 7643580B2
Authority
JP
Japan
Prior art keywords
sentence
cluster
vector
vectors
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2023552424A
Other languages
English (en)
Japanese (ja)
Other versions
JPWO2023058099A1 (https=
JPWO2023058099A5 (https=
Inventor
正弘 片岡
良平 永浦
ウェイタント ミョ
聡 尾上
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Publication of JPWO2023058099A1 publication Critical patent/JPWO2023058099A1/ja
Publication of JPWO2023058099A5 publication Critical patent/JPWO2023058099A5/ja
Application granted granted Critical
Publication of JP7643580B2 publication Critical patent/JP7643580B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2237Vectors, bitmaps or matrices
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3347Query execution using vector based model
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
JP2023552424A 2021-10-04 2021-10-04 処理方法、処理プログラムおよび情報処理装置 Active JP7643580B2 (ja)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2021/036696 WO2023058099A1 (ja) 2021-10-04 2021-10-04 処理方法、処理プログラムおよび情報処理装置

Publications (3)

Publication Number Publication Date
JPWO2023058099A1 JPWO2023058099A1 (https=) 2023-04-13
JPWO2023058099A5 JPWO2023058099A5 (https=) 2024-03-26
JP7643580B2 true JP7643580B2 (ja) 2025-03-11

Family

ID=85804003

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2023552424A Active JP7643580B2 (ja) 2021-10-04 2021-10-04 処理方法、処理プログラムおよび情報処理装置

Country Status (6)

Country Link
US (1) US12517927B2 (https=)
EP (1) EP4414861A4 (https=)
JP (1) JP7643580B2 (https=)
CN (1) CN118043801A (https=)
AU (1) AU2021467326B2 (https=)
WO (1) WO2023058099A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20260105037A1 (en) * 2024-10-13 2026-04-16 Oracle International Corporation Partitioning of inverted file (ivf) vector indexes in a database system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004078446A (ja) 2002-08-14 2004-03-11 Nec Corp キーワード抽出装置、抽出方法、文書検索装置、検索方法、文書分類装置及び分類方法並びにプログラム
US20120209847A1 (en) 2011-02-16 2012-08-16 Clearwell Systems, Inc. Methods and systems for automatically generating semantic/concept searches
CN112256880A (zh) 2020-11-11 2021-01-22 腾讯科技(深圳)有限公司 文本识别方法和装置、存储介质及电子设备
US11036764B1 (en) 2017-01-12 2021-06-15 Parallels International Gmbh Document classification filter for search queries

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005258910A (ja) 2004-03-12 2005-09-22 Yamatake Corp 階層キーワード抽出装置、方法、およびプログラム
US9171071B2 (en) * 2010-03-26 2015-10-27 Nec Corporation Meaning extraction system, meaning extraction method, and recording medium
JP6635587B2 (ja) 2015-12-14 2020-01-29 日本放送協会 広告文選択装置及びプログラム
US10606873B2 (en) * 2016-08-16 2020-03-31 Ebay Inc. Search index trimming
JP2021022133A (ja) 2019-07-26 2021-02-18 京セラドキュメントソリューションズ株式会社 フォルダー分けシステムおよびフォルダー分けプログラム

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004078446A (ja) 2002-08-14 2004-03-11 Nec Corp キーワード抽出装置、抽出方法、文書検索装置、検索方法、文書分類装置及び分類方法並びにプログラム
US20120209847A1 (en) 2011-02-16 2012-08-16 Clearwell Systems, Inc. Methods and systems for automatically generating semantic/concept searches
US11036764B1 (en) 2017-01-12 2021-06-15 Parallels International Gmbh Document classification filter for search queries
CN112256880A (zh) 2020-11-11 2021-01-22 腾讯科技(深圳)有限公司 文本识别方法和装置、存储介质及电子设备

Also Published As

Publication number Publication date
EP4414861A4 (en) 2025-03-05
AU2021467326A1 (en) 2024-04-04
JPWO2023058099A1 (https=) 2023-04-13
WO2023058099A1 (ja) 2023-04-13
CN118043801A (zh) 2024-05-14
US12517927B2 (en) 2026-01-06
AU2021467326B2 (en) 2025-06-05
EP4414861A1 (en) 2024-08-14
US20240241891A1 (en) 2024-07-18

Similar Documents

Publication Publication Date Title
CN113761218B (zh) 一种实体链接的方法、装置、设备及存储介质
Nakata et al. A comprehensive big-data-based monitoring system for yield enhancement in semiconductor manufacturing
CN111444320A (zh) 文本检索方法、装置、计算机设备和存储介质
CN115563287B (zh) 一种获取关联对象的数据处理系统
WO2020182019A1 (zh) 图像检索方法、装置、设备及计算机可读存储介质
CN113051356A (zh) 开放关系抽取方法、装置、电子设备及存储介质
CN110851596A (zh) 文本分类方法、装置及计算机可读存储介质
KR20200032258A (ko) 일정한 처리 시간 내에 k개의 극값을 찾는 방법
CN111966811B (zh) 意图识别和槽填充方法、装置、可读存储介质及终端设备
CN115470358B (zh) 一种跨语言实体链接方法、系统、设备及终端
CN113468311B (zh) 一种基于知识图谱的复杂问句问答方法、装置及存储介质
US20230114374A1 (en) Storage medium, machine learning apparatus, and machine learning method
WO2021223882A1 (en) Prediction explanation in machine learning classifiers
CN107506350A (zh) 一种识别信息的方法和设备
CN113656547A (zh) 文本匹配方法、装置、设备及存储介质
CN112632264A (zh) 智能问答方法、装置、电子设备及存储介质
CN114840680A (zh) 一种实体关系联合抽取方法、装置、存储介质及终端
CN114706927B (zh) 基于人工智能的数据批量标注方法及相关设备
Rooshenas et al. Discriminative structure learning of arithmetic circuits
US20220284172A1 (en) Machine learning technologies for structuring unstructured data
CN117992573A (zh) 基于文本扩展的信息检索方法、装置、电子设备及介质
JP7643580B2 (ja) 処理方法、処理プログラムおよび情報処理装置
CN112199958A (zh) 概念词序列生成方法、装置、计算机设备及存储介质
CN112667809A (zh) 一种文本处理方法、装置及电子设备、存储介质
CN113988002B (zh) 一种基于神经聚类方法的近似注意力系统及方法

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20231221

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20231221

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20241203

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20250116

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20250128

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20250210

R150 Certificate of patent or registration of utility model

Ref document number: 7643580

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150