AU2021467326B2 - Processing method, processing program, and information processing apparatus - Google Patents
Processing method, processing program, and information processing apparatus Download PDFInfo
- Publication number
- AU2021467326B2 AU2021467326B2 AU2021467326A AU2021467326A AU2021467326B2 AU 2021467326 B2 AU2021467326 B2 AU 2021467326B2 AU 2021467326 A AU2021467326 A AU 2021467326A AU 2021467326 A AU2021467326 A AU 2021467326A AU 2021467326 B2 AU2021467326 B2 AU 2021467326B2
- Authority
- AU
- Australia
- Prior art keywords
- cluster
- vector
- sentence
- vectors
- record
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2228—Indexing structures
- G06F16/2237—Vectors, bitmaps or matrices
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3347—Query execution using vector based model
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/JP2021/036696 WO2023058099A1 (ja) | 2021-10-04 | 2021-10-04 | 処理方法、処理プログラムおよび情報処理装置 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| AU2021467326A1 AU2021467326A1 (en) | 2024-04-04 |
| AU2021467326B2 true AU2021467326B2 (en) | 2025-06-05 |
Family
ID=85804003
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AU2021467326A Active AU2021467326B2 (en) | 2021-10-04 | 2021-10-04 | Processing method, processing program, and information processing apparatus |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US12517927B2 (https=) |
| EP (1) | EP4414861A4 (https=) |
| JP (1) | JP7643580B2 (https=) |
| CN (1) | CN118043801A (https=) |
| AU (1) | AU2021467326B2 (https=) |
| WO (1) | WO2023058099A1 (https=) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20260105037A1 (en) * | 2024-10-13 | 2026-04-16 | Oracle International Corporation | Partitioning of inverted file (ivf) vector indexes in a database system |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120209847A1 (en) * | 2011-02-16 | 2012-08-16 | Clearwell Systems, Inc. | Methods and systems for automatically generating semantic/concept searches |
| US20130006636A1 (en) * | 2010-03-26 | 2013-01-03 | Nec Corporation | Meaning extraction system, meaning extraction method, and recording medium |
| US20180052876A1 (en) * | 2016-08-16 | 2018-02-22 | Ebay Inc. | Search index trimming |
| CN112256880A (zh) * | 2020-11-11 | 2021-01-22 | 腾讯科技(深圳)有限公司 | 文本识别方法和装置、存储介质及电子设备 |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP4423841B2 (ja) | 2002-08-14 | 2010-03-03 | 日本電気株式会社 | キーワード決定装置、決定方法、文書検索装置、検索方法、文書分類装置及び分類方法並びにプログラム |
| JP2005258910A (ja) | 2004-03-12 | 2005-09-22 | Yamatake Corp | 階層キーワード抽出装置、方法、およびプログラム |
| JP6635587B2 (ja) | 2015-12-14 | 2020-01-29 | 日本放送協会 | 広告文選択装置及びプログラム |
| US11036764B1 (en) | 2017-01-12 | 2021-06-15 | Parallels International Gmbh | Document classification filter for search queries |
| JP2021022133A (ja) | 2019-07-26 | 2021-02-18 | 京セラドキュメントソリューションズ株式会社 | フォルダー分けシステムおよびフォルダー分けプログラム |
-
2021
- 2021-10-04 CN CN202180102907.7A patent/CN118043801A/zh active Pending
- 2021-10-04 WO PCT/JP2021/036696 patent/WO2023058099A1/ja not_active Ceased
- 2021-10-04 EP EP21959842.2A patent/EP4414861A4/en active Pending
- 2021-10-04 AU AU2021467326A patent/AU2021467326B2/en active Active
- 2021-10-04 JP JP2023552424A patent/JP7643580B2/ja active Active
-
2024
- 2024-03-27 US US18/617,818 patent/US12517927B2/en active Active
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20130006636A1 (en) * | 2010-03-26 | 2013-01-03 | Nec Corporation | Meaning extraction system, meaning extraction method, and recording medium |
| US20120209847A1 (en) * | 2011-02-16 | 2012-08-16 | Clearwell Systems, Inc. | Methods and systems for automatically generating semantic/concept searches |
| US20180052876A1 (en) * | 2016-08-16 | 2018-02-22 | Ebay Inc. | Search index trimming |
| CN112256880A (zh) * | 2020-11-11 | 2021-01-22 | 腾讯科技(深圳)有限公司 | 文本识别方法和装置、存储介质及电子设备 |
Also Published As
| Publication number | Publication date |
|---|---|
| EP4414861A4 (en) | 2025-03-05 |
| AU2021467326A1 (en) | 2024-04-04 |
| JPWO2023058099A1 (https=) | 2023-04-13 |
| JP7643580B2 (ja) | 2025-03-11 |
| WO2023058099A1 (ja) | 2023-04-13 |
| CN118043801A (zh) | 2024-05-14 |
| US12517927B2 (en) | 2026-01-06 |
| EP4414861A1 (en) | 2024-08-14 |
| US20240241891A1 (en) | 2024-07-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN113591457B (zh) | 文本纠错方法、装置、设备及存储介质 | |
| Dotan et al. | Effect of tokenization on transformers for biological sequences | |
| CN109344250B (zh) | 基于医保数据的单病种诊断信息快速结构化方法 | |
| Weitschek et al. | Supervised DNA Barcodes species classification: analysis, comparisons and results | |
| Wang et al. | Sketching image gist: Human-mimetic hierarchical scene graph generation | |
| Heyne et al. | GraphClust: alignment-free structural clustering of local RNA secondary structures | |
| CN115563287B (zh) | 一种获取关联对象的数据处理系统 | |
| CN111444320A (zh) | 文本检索方法、装置、计算机设备和存储介质 | |
| US20190147038A1 (en) | Preserving and processing ambiguity in natural language | |
| US20230114374A1 (en) | Storage medium, machine learning apparatus, and machine learning method | |
| JP5373998B1 (ja) | 辞書生成装置、方法、及びプログラム | |
| CN112632264A (zh) | 智能问答方法、装置、电子设备及存储介质 | |
| CN113076748A (zh) | 弹幕敏感词的处理方法、装置、设备及存储介质 | |
| Dellert | Combining information-weighted sequence alignment and sound correspondence models for improved cognate detection | |
| Rooshenas et al. | Discriminative structure learning of arithmetic circuits | |
| AU2021467326B2 (en) | Processing method, processing program, and information processing apparatus | |
| Biharie et al. | Cell type matching across species using protein embeddings and transfer learning | |
| Sharma et al. | Review of clustering methods: toward phylogenetic tree constructions | |
| CN115687314B (zh) | 一种唐卡文化知识图谱展示系统及其构建方法 | |
| CN112199958A (zh) | 概念词序列生成方法、装置、计算机设备及存储介质 | |
| CN110413749B (zh) | 确定标准问题的方法及装置 | |
| Jain et al. | MASA: Motif-aware state assignment in noisy time series data | |
| Wang | Rule-based protein term identification with help from automatic species tagging | |
| EP4276606A1 (en) | Information processing program, information processing method, and information processing device | |
| Lamurias et al. | Identifying interactions between chemical entities in biomedical text |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| DA3 | Amendments made section 104 |
Free format text: THE NATURE OF THE AMENDMENT IS: AMEND THE INVENTION TITLE TO READ PROCESSING METHOD, PROCESSING PROGRAM, AND INFORMATION PROCESSING APPARATUS |
|
| FGA | Letters patent sealed or granted (standard patent) |