AU2021467326B2 - Processing method, processing program, and information processing apparatus - Google Patents

Processing method, processing program, and information processing apparatus Download PDF

Info

Publication number
AU2021467326B2
AU2021467326B2 AU2021467326A AU2021467326A AU2021467326B2 AU 2021467326 B2 AU2021467326 B2 AU 2021467326B2 AU 2021467326 A AU2021467326 A AU 2021467326A AU 2021467326 A AU2021467326 A AU 2021467326A AU 2021467326 B2 AU2021467326 B2 AU 2021467326B2
Authority
AU
Australia
Prior art keywords
cluster
vector
sentence
vectors
record
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
AU2021467326A
Other languages
English (en)
Other versions
AU2021467326A1 (en
Inventor
Masahiro Kataoka
Wai Thant MYO
Ryohei NAGAURA
Satoshi ONOUE
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Publication of AU2021467326A1 publication Critical patent/AU2021467326A1/en
Application granted granted Critical
Publication of AU2021467326B2 publication Critical patent/AU2021467326B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2237Vectors, bitmaps or matrices
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3347Query execution using vector based model
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
AU2021467326A 2021-10-04 2021-10-04 Processing method, processing program, and information processing apparatus Active AU2021467326B2 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2021/036696 WO2023058099A1 (ja) 2021-10-04 2021-10-04 処理方法、処理プログラムおよび情報処理装置

Publications (2)

Publication Number Publication Date
AU2021467326A1 AU2021467326A1 (en) 2024-04-04
AU2021467326B2 true AU2021467326B2 (en) 2025-06-05

Family

ID=85804003

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2021467326A Active AU2021467326B2 (en) 2021-10-04 2021-10-04 Processing method, processing program, and information processing apparatus

Country Status (6)

Country Link
US (1) US12517927B2 (https=)
EP (1) EP4414861A4 (https=)
JP (1) JP7643580B2 (https=)
CN (1) CN118043801A (https=)
AU (1) AU2021467326B2 (https=)
WO (1) WO2023058099A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20260105037A1 (en) * 2024-10-13 2026-04-16 Oracle International Corporation Partitioning of inverted file (ivf) vector indexes in a database system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120209847A1 (en) * 2011-02-16 2012-08-16 Clearwell Systems, Inc. Methods and systems for automatically generating semantic/concept searches
US20130006636A1 (en) * 2010-03-26 2013-01-03 Nec Corporation Meaning extraction system, meaning extraction method, and recording medium
US20180052876A1 (en) * 2016-08-16 2018-02-22 Ebay Inc. Search index trimming
CN112256880A (zh) * 2020-11-11 2021-01-22 腾讯科技(深圳)有限公司 文本识别方法和装置、存储介质及电子设备

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4423841B2 (ja) 2002-08-14 2010-03-03 日本電気株式会社 キーワード決定装置、決定方法、文書検索装置、検索方法、文書分類装置及び分類方法並びにプログラム
JP2005258910A (ja) 2004-03-12 2005-09-22 Yamatake Corp 階層キーワード抽出装置、方法、およびプログラム
JP6635587B2 (ja) 2015-12-14 2020-01-29 日本放送協会 広告文選択装置及びプログラム
US11036764B1 (en) 2017-01-12 2021-06-15 Parallels International Gmbh Document classification filter for search queries
JP2021022133A (ja) 2019-07-26 2021-02-18 京セラドキュメントソリューションズ株式会社 フォルダー分けシステムおよびフォルダー分けプログラム

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130006636A1 (en) * 2010-03-26 2013-01-03 Nec Corporation Meaning extraction system, meaning extraction method, and recording medium
US20120209847A1 (en) * 2011-02-16 2012-08-16 Clearwell Systems, Inc. Methods and systems for automatically generating semantic/concept searches
US20180052876A1 (en) * 2016-08-16 2018-02-22 Ebay Inc. Search index trimming
CN112256880A (zh) * 2020-11-11 2021-01-22 腾讯科技(深圳)有限公司 文本识别方法和装置、存储介质及电子设备

Also Published As

Publication number Publication date
EP4414861A4 (en) 2025-03-05
AU2021467326A1 (en) 2024-04-04
JPWO2023058099A1 (https=) 2023-04-13
JP7643580B2 (ja) 2025-03-11
WO2023058099A1 (ja) 2023-04-13
CN118043801A (zh) 2024-05-14
US12517927B2 (en) 2026-01-06
EP4414861A1 (en) 2024-08-14
US20240241891A1 (en) 2024-07-18

Similar Documents

Publication Publication Date Title
CN113591457B (zh) 文本纠错方法、装置、设备及存储介质
Dotan et al. Effect of tokenization on transformers for biological sequences
CN109344250B (zh) 基于医保数据的单病种诊断信息快速结构化方法
Weitschek et al. Supervised DNA Barcodes species classification: analysis, comparisons and results
Wang et al. Sketching image gist: Human-mimetic hierarchical scene graph generation
Heyne et al. GraphClust: alignment-free structural clustering of local RNA secondary structures
CN115563287B (zh) 一种获取关联对象的数据处理系统
CN111444320A (zh) 文本检索方法、装置、计算机设备和存储介质
US20190147038A1 (en) Preserving and processing ambiguity in natural language
US20230114374A1 (en) Storage medium, machine learning apparatus, and machine learning method
JP5373998B1 (ja) 辞書生成装置、方法、及びプログラム
CN112632264A (zh) 智能问答方法、装置、电子设备及存储介质
CN113076748A (zh) 弹幕敏感词的处理方法、装置、设备及存储介质
Dellert Combining information-weighted sequence alignment and sound correspondence models for improved cognate detection
Rooshenas et al. Discriminative structure learning of arithmetic circuits
AU2021467326B2 (en) Processing method, processing program, and information processing apparatus
Biharie et al. Cell type matching across species using protein embeddings and transfer learning
Sharma et al. Review of clustering methods: toward phylogenetic tree constructions
CN115687314B (zh) 一种唐卡文化知识图谱展示系统及其构建方法
CN112199958A (zh) 概念词序列生成方法、装置、计算机设备及存储介质
CN110413749B (zh) 确定标准问题的方法及装置
Jain et al. MASA: Motif-aware state assignment in noisy time series data
Wang Rule-based protein term identification with help from automatic species tagging
EP4276606A1 (en) Information processing program, information processing method, and information processing device
Lamurias et al. Identifying interactions between chemical entities in biomedical text

Legal Events

Date Code Title Description
DA3 Amendments made section 104

Free format text: THE NATURE OF THE AMENDMENT IS: AMEND THE INVENTION TITLE TO READ PROCESSING METHOD, PROCESSING PROGRAM, AND INFORMATION PROCESSING APPARATUS

FGA Letters patent sealed or granted (standard patent)