BR112023027439A2 - Rotulagem automática de dados de texto - Google Patents

Rotulagem automática de dados de texto

Info

Publication number
BR112023027439A2
BR112023027439A2 BR112023027439A BR112023027439A BR112023027439A2 BR 112023027439 A2 BR112023027439 A2 BR 112023027439A2 BR 112023027439 A BR112023027439 A BR 112023027439A BR 112023027439 A BR112023027439 A BR 112023027439A BR 112023027439 A2 BR112023027439 A2 BR 112023027439A2
Authority
BR
Brazil
Prior art keywords
label
text
technology
candidate text
produce
Prior art date
Application number
BR112023027439A
Other languages
English (en)
Portuguese (pt)
Inventor
Christian Rudnick
Abraham Betser Michael
Milenko Drinic
Mohit Sewak
On Chan Pak
Kiran Reddy Poluri Ravi
Shirish Acharya Sharada
Sihong Liu
Weisheng Li
William Blum
Original Assignee
Microsoft Technology Licensing Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US17/711,506 external-priority patent/US12197486B2/en
Application filed by Microsoft Technology Licensing Llc filed Critical Microsoft Technology Licensing Llc
Publication of BR112023027439A2 publication Critical patent/BR112023027439A2/pt

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3346Query execution using probabilistic model
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/38Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/383Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • G06N3/0455Auto-encoder networks; Encoder-decoder networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0475Generative networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/096Transfer learning

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Molecular Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Library & Information Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • User Interface Of Digital Computer (AREA)
BR112023027439A 2021-06-29 2022-05-23 Rotulagem automática de dados de texto BR112023027439A2 (pt)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
IN202141029147 2021-06-29
US17/711,506 US12197486B2 (en) 2021-06-29 2022-04-01 Automatic labeling of text data
PCT/US2022/030464 WO2023278070A1 (en) 2021-06-29 2022-05-23 Automatic labeling of text data

Publications (1)

Publication Number Publication Date
BR112023027439A2 true BR112023027439A2 (pt) 2024-03-12

Family

ID=82156528

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112023027439A BR112023027439A2 (pt) 2021-06-29 2022-05-23 Rotulagem automática de dados de texto

Country Status (9)

Country Link
US (1) US20240370484A1 (enExample)
EP (1) EP4364000A1 (enExample)
JP (1) JP2024524060A (enExample)
KR (1) KR20240023535A (enExample)
AU (1) AU2022304683A1 (enExample)
BR (1) BR112023027439A2 (enExample)
CA (1) CA3225020A1 (enExample)
WO (1) WO2023278070A1 (enExample)
ZA (1) ZA202400308B (enExample)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20230385966A1 (en) * 2022-05-31 2023-11-30 Docusign, Inc. Predictive text for contract generation in a document management system
US20240054285A1 (en) * 2022-08-10 2024-02-15 TOTVS, Inc. Sentence pair ranking in natural language processing for a virtual assistant
CN116415154B (zh) * 2023-06-12 2023-08-22 江西五十铃汽车有限公司 一种基于gpt的车辆故障解决方案生成方法及装置
JP2025036355A (ja) * 2023-08-30 2025-03-14 宏達國際電子股▲ふん▼有限公司 外れた文字データをスクリーニングするためのデータ分類方法
CN116910279B (zh) * 2023-09-13 2024-01-05 深圳市智慧城市科技发展集团有限公司 标签提取方法、设备及计算机可读存储介质
CN121970062A (zh) * 2023-10-24 2026-05-01 株式会社半导体能源研究所 信息处理系统、信息处理方法
KR102763213B1 (ko) * 2024-04-04 2025-02-07 주식회사 리턴제로 도메인에 따른 템플릿 기반 데이터 라벨링을 수행하는 전자 장치 및 방법
US12530377B2 (en) 2024-05-22 2026-01-20 Shopify Inc. Additional searching based on confidence in a classification performed by a generative language machine learning model
CN118689468A (zh) * 2024-06-19 2024-09-24 北京百度网讯科技有限公司 基于大模型的代码生成方法、装置、电子设备及存储介质
KR102823763B1 (ko) * 2024-12-10 2025-06-23 한화시스템 주식회사 문장 구문 해석 기반 전투체계 데이터 생성 시스템 및 방법
CN120430300B (zh) * 2025-07-09 2025-09-23 中国民用航空飞行学院 一种航行通告文本自动纠错方法、系统、存储介质及终端
CN120541194B (zh) * 2025-07-25 2025-10-24 浪潮通用软件有限公司 基于多维标签的知识检索方法、系统及计算机设备
CN121303112A (zh) * 2025-09-28 2026-01-09 北京首发展智能科技有限公司 一种基于llm模型的标签获取方法、设备及介质

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10635727B2 (en) * 2016-08-16 2020-04-28 Ebay Inc. Semantic forward search indexing of publication corpus

Also Published As

Publication number Publication date
KR20240023535A (ko) 2024-02-22
AU2022304683A1 (en) 2024-01-04
JP2024524060A (ja) 2024-07-05
CA3225020A1 (en) 2023-01-05
WO2023278070A1 (en) 2023-01-05
ZA202400308B (en) 2025-10-29
US20240370484A1 (en) 2024-11-07
EP4364000A1 (en) 2024-05-08

Similar Documents

Publication Publication Date Title
BR112023027439A2 (pt) Rotulagem automática de dados de texto
Pintzuk Phrase structures in competition: Variation and change in Old English word order
Sulubacak et al. IMST: A revisited Turkish dependency treebank
BR112022016997A2 (pt) Quantização adaptativa para execução de modelos de aprendizagem de máquina
BR112018074148A8 (pt) Sistema configurado e método para modelar documentos clínicos de texto livre e sistema destinado a executar o método
BR112015010802A2 (pt) modelo gramatical para consultas de busca estruturadas
BR112016028797A2 (pt) modelagem de contexto de sessão para sistemas de entendimento de conversação
BR112015026451A2 (pt) gerenciamento de dados sujos para unidades híbridas
Ouvrard et al. Nudging acceptability for wood ash recycling in forests: a choice experiment
BR112022004014A2 (pt) Pré-processamento automático para tradução de caixa preta
Ledgeway Parallels in Romance nominal and clausal microvariation
Adeyanju Generating weather forecast texts with case based reasoning
Kaneko et al. TMU transformer system using BERT for re-ranking at BEA 2019 grammatical error correction on restricted track
Benmamoun VSO word order, primarily in Arabic languages
Haugen Configurationality in classical Nahuatl
Her Historical development of ba and jiang in the Tang dynasty
Mozgovoy Dependency-based rules for grammar checking with LanguageTool
LaTerza Adjectives and determiners
BR112022003312A2 (pt) Sutura magnética
Hu et al. Complexity in the acquisition of relative clauses: Evidence from school-age sequential Mandarin–Italian bilingual children
Çöltekin Using predictability for lexical segmentation
Bungum et al. A survey of domain adaptation in machine translation: Towards a refinement of domain space
Hammerly What ‘other people’mean to ‘us’
Giorgi et al. On the temporal and aspectual value of modern eastern armenian aorist: A comparative perspective
Poeppel Commentary on chapter 10