JP7771196B2 - 自然言語プロセッサのための複数特徴均衡化 - Google Patents

自然言語プロセッサのための複数特徴均衡化

Info

Publication number
JP7771196B2
JP7771196B2 JP2023543405A JP2023543405A JP7771196B2 JP 7771196 B2 JP7771196 B2 JP 7771196B2 JP 2023543405 A JP2023543405 A JP 2023543405A JP 2023543405 A JP2023543405 A JP 2023543405A JP 7771196 B2 JP7771196 B2 JP 7771196B2
Authority
JP
Japan
Prior art keywords
natural language
dataset
features
machine learning
contextual
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2023543405A
Other languages
English (en)
Japanese (ja)
Other versions
JP2024503519A (ja
JP2024503519A5 (https=
Inventor
ドゥオング,タン・ロング
ビシュノイ,ビシャル
ジョンソン,マーク・エドワード
ジャラルッディン,エリアス・ルクマン
ファム,トゥエン・クアン
ホアン,コン・ズイ・ブー
ザレムーディ,ポーヤ
ガッデ,シュリニバーサ・ファニ・クマール
カヌガ,アシュナ・デバング
リー,ズーカイ
ウー,ユエンシュ
Original Assignee
オラクル・インターナショナル・コーポレイション
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by オラクル・インターナショナル・コーポレイション filed Critical オラクル・インターナショナル・コーポレイション
Publication of JP2024503519A publication Critical patent/JP2024503519A/ja
Publication of JP2024503519A5 publication Critical patent/JP2024503519A5/ja
Priority to JP2025185483A priority Critical patent/JP2026027326A/ja
Application granted granted Critical
Publication of JP7771196B2 publication Critical patent/JP7771196B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/263Language identification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/02User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail using automatic reactions or user delegation, e.g. automatic replies or chatbot-generated messages

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Evolutionary Computation (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
JP2023543405A 2021-01-20 2022-01-20 自然言語プロセッサのための複数特徴均衡化 Active JP7771196B2 (ja)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2025185483A JP2026027326A (ja) 2021-01-20 2025-11-04 自然言語プロセッサのための複数特徴均衡化

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202163139695P 2021-01-20 2021-01-20
US63/139,695 2021-01-20
PCT/US2022/013060 WO2022159544A1 (en) 2021-01-20 2022-01-20 Multi-feature balancing for natural language processors

Related Child Applications (1)

Application Number Title Priority Date Filing Date
JP2025185483A Division JP2026027326A (ja) 2021-01-20 2025-11-04 自然言語プロセッサのための複数特徴均衡化

Publications (3)

Publication Number Publication Date
JP2024503519A JP2024503519A (ja) 2024-01-25
JP2024503519A5 JP2024503519A5 (https=) 2025-01-09
JP7771196B2 true JP7771196B2 (ja) 2025-11-17

Family

ID=82406292

Family Applications (2)

Application Number Title Priority Date Filing Date
JP2023543405A Active JP7771196B2 (ja) 2021-01-20 2022-01-20 自然言語プロセッサのための複数特徴均衡化
JP2025185483A Pending JP2026027326A (ja) 2021-01-20 2025-11-04 自然言語プロセッサのための複数特徴均衡化

Family Applications After (1)

Application Number Title Priority Date Filing Date
JP2025185483A Pending JP2026027326A (ja) 2021-01-20 2025-11-04 自然言語プロセッサのための複数特徴均衡化

Country Status (5)

Country Link
US (2) US12153885B2 (https=)
EP (1) EP4281880A4 (https=)
JP (2) JP7771196B2 (https=)
CN (1) CN116724306A (https=)
WO (1) WO2022159544A1 (https=)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022047214A2 (en) * 2020-08-27 2022-03-03 Carnelian Laboratories Llc Digital assistant control of applications
US12175968B1 (en) * 2021-03-26 2024-12-24 Amazon Technologies, Inc. Skill selection for responding to natural language inputs
JP2024517656A (ja) * 2021-04-28 2024-04-23 コーニンクレッカ フィリップス エヌ ヴェ 医用イメージングシステム用のチャットボット
US11729121B2 (en) * 2021-04-29 2023-08-15 Bank Of America Corporation Executing a network of chatbots using a combination approach
US11914644B2 (en) * 2021-10-11 2024-02-27 Microsoft Technology Licensing, Llc Suggested queries for transcript search
US20230401385A1 (en) * 2022-06-13 2023-12-14 Oracle International Corporation Hierarchical named entity recognition with multi-task setup
US12511140B2 (en) * 2022-11-28 2025-12-30 Sap Se Performance controller for machine learning based digital assistant
US12608562B2 (en) * 2023-09-21 2026-04-21 Google Llc Providing personalized prompts to users based on documents in cloud storage
TWI897448B (zh) * 2024-05-29 2025-09-11 神通資訊科技股份有限公司 提供多模態人機交互導引的多媒體事務機之系統及其方法
US12314305B1 (en) * 2024-11-24 2025-05-27 Signet Health Corporation System and method for generating an updated terminal node projection

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150127594A1 (en) 2013-11-04 2015-05-07 Google Inc. Transfer learning for deep neural network based hotword detection
JP2019514045A (ja) 2016-03-21 2019-05-30 アマゾン テクノロジーズ インコーポレイテッド 話者照合方法及びシステム
WO2020037217A1 (en) 2018-08-16 2020-02-20 Oracle International Corporation Techniques for building a knowledge graph in limited knowledge domains

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160293167A1 (en) * 2013-10-10 2016-10-06 Google Inc. Speaker recognition using neural networks
US9672814B2 (en) * 2015-05-08 2017-06-06 International Business Machines Corporation Semi-supervised learning of word embeddings
US11694072B2 (en) * 2017-05-19 2023-07-04 Nvidia Corporation Machine learning technique for automatic modeling of multiple-valued outputs
US10453454B2 (en) * 2017-10-26 2019-10-22 Hitachi, Ltd. Dialog system with self-learning natural language understanding
US10579733B2 (en) * 2018-05-10 2020-03-03 Google Llc Identifying codemixed text
US10861439B2 (en) 2018-10-22 2020-12-08 Ca, Inc. Machine learning model for identifying offensive, computer-generated natural-language text or speech
CN109783337B (zh) * 2018-12-19 2022-08-30 北京达佳互联信息技术有限公司 模型服务方法、系统、装置和计算机可读存储介质
CN109858018A (zh) * 2018-12-25 2019-06-07 中国科学院信息工程研究所 一种面向威胁情报的实体识别方法及系统
CN109918503B (zh) * 2019-01-29 2020-12-22 华南理工大学 基于动态窗口自注意力机制提取语义特征的槽填充方法
CN109918648B (zh) * 2019-01-31 2020-04-21 内蒙古工业大学 一种基于动态滑动窗口特征评分的谣言深度检测方法
WO2020219203A1 (en) 2019-04-26 2020-10-29 Oracle International Corporation Insights into performance of a bot system
US11481388B2 (en) * 2019-12-18 2022-10-25 Roy Fugère SIANEZ Methods and apparatus for using machine learning to securely and efficiently retrieve and present search results
US11250839B2 (en) * 2020-04-16 2022-02-15 Microsoft Technology Licensing, Llc Natural language processing models for conversational computing
US11450310B2 (en) * 2020-08-10 2022-09-20 Adobe Inc. Spoken language understanding
CN111949770A (zh) * 2020-08-24 2020-11-17 国网浙江省电力有限公司信息通信分公司 一种文档分类方法及装置
US11893354B2 (en) * 2021-03-25 2024-02-06 Cognizant Technology Solutions India Pvt. Ltd. System and method for improving chatbot training dataset

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150127594A1 (en) 2013-11-04 2015-05-07 Google Inc. Transfer learning for deep neural network based hotword detection
JP2019514045A (ja) 2016-03-21 2019-05-30 アマゾン テクノロジーズ インコーポレイテッド 話者照合方法及びシステム
WO2020037217A1 (en) 2018-08-16 2020-02-20 Oracle International Corporation Techniques for building a knowledge graph in limited knowledge domains

Also Published As

Publication number Publication date
US12153885B2 (en) 2024-11-26
US20220229991A1 (en) 2022-07-21
JP2024503519A (ja) 2024-01-25
JP2026027326A (ja) 2026-02-18
EP4281880A4 (en) 2024-12-18
EP4281880A1 (en) 2023-11-29
US20240419910A1 (en) 2024-12-19
WO2022159544A1 (en) 2022-07-28
CN116724306A (zh) 2023-09-08

Similar Documents

Publication Publication Date Title
JP7682202B2 (ja) ドメイン外(ood)検出のための改良された技術
JP7703667B2 (ja) 固有表現認識モデルを用いたコンテキストタグ統合
US12099816B2 (en) Multi-factor modelling for natural language processing
JP7561836B2 (ja) 自然言語処理のためのストップワードデータ拡張
JP7721559B2 (ja) 自然言語処理のためのノイズデータ拡張
JP7789778B2 (ja) 自然言語処理のためのドメイン外データ拡張
JP7771196B2 (ja) 自然言語プロセッサのための複数特徴均衡化
JP7843760B2 (ja) 自然言語処理のための距離ベースのロジット値
JP7726995B2 (ja) 自然言語処理のための強化されたロジット
JP2025118956A (ja) 堅牢な固有表現認識のためのチャットボットにおけるエンティティレベルデータ拡張
JP7828346B2 (ja) 自然言語処理のためのキーワードデータ拡張ツール
US12367352B2 (en) Deep learning techniques for extraction of embedded data from documents
KR102821062B1 (ko) 사전-트레이닝된 언어 모델들에 대한 긴 텍스트를 핸들링하기 위한 시스템 및 기술들
JP2024543062A (ja) 自然言語処理のパスのドロップアウト
US20230205999A1 (en) Gazetteer integration for neural named entity recognition
WO2023091436A1 (en) System and techniques for handling long text for pre-trained language models

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20241225

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20241225

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20251007

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20251105

R150 Certificate of patent or registration of utility model

Ref document number: 7771196

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150