CN114722196A - 基于注意力机制的企业文本多标签标注方法及系统 - Google Patents
基于注意力机制的企业文本多标签标注方法及系统 Download PDFInfo
- Publication number
- CN114722196A CN114722196A CN202210319228.1A CN202210319228A CN114722196A CN 114722196 A CN114722196 A CN 114722196A CN 202210319228 A CN202210319228 A CN 202210319228A CN 114722196 A CN114722196 A CN 114722196A
- Authority
- CN
- China
- Prior art keywords
- enterprise
- text
- label
- marking
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000002372 labelling Methods 0.000 title claims abstract description 48
- 230000007246 mechanism Effects 0.000 title claims abstract description 31
- 238000000034 method Methods 0.000 claims abstract description 46
- 238000007781 pre-processing Methods 0.000 claims abstract description 12
- 230000008569 process Effects 0.000 claims abstract description 10
- 239000013598 vector Substances 0.000 claims description 36
- 238000012549 training Methods 0.000 claims description 23
- 238000012360 testing method Methods 0.000 claims description 14
- 238000013527 convolutional neural network Methods 0.000 claims description 13
- 239000011159 matrix material Substances 0.000 claims description 12
- 230000015654 memory Effects 0.000 claims description 12
- 238000012545 processing Methods 0.000 claims description 11
- 238000013528 artificial neural network Methods 0.000 claims description 9
- 238000003860 storage Methods 0.000 claims description 7
- 238000004140 cleaning Methods 0.000 claims description 6
- 239000000284 extract Substances 0.000 claims description 6
- 230000000694 effects Effects 0.000 claims description 5
- 230000008030 elimination Effects 0.000 claims description 4
- 238000003379 elimination reaction Methods 0.000 claims description 4
- 230000006870 function Effects 0.000 claims description 4
- 230000006399 behavior Effects 0.000 claims description 3
- 238000012795 verification Methods 0.000 claims description 2
- 238000009826 distribution Methods 0.000 abstract description 2
- 238000004590 computer program Methods 0.000 description 5
- 238000000605 extraction Methods 0.000 description 4
- 238000007689 inspection Methods 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000011176 pooling Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000004913 activation Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/211—Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2216/00—Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
- G06F2216/03—Data mining
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Evolutionary Computation (AREA)
- Biophysics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Databases & Information Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210319228.1A CN114722196A (zh) | 2022-03-29 | 2022-03-29 | 基于注意力机制的企业文本多标签标注方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210319228.1A CN114722196A (zh) | 2022-03-29 | 2022-03-29 | 基于注意力机制的企业文本多标签标注方法及系统 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114722196A true CN114722196A (zh) | 2022-07-08 |
Family
ID=82240770
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210319228.1A Pending CN114722196A (zh) | 2022-03-29 | 2022-03-29 | 基于注意力机制的企业文本多标签标注方法及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114722196A (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115422920A (zh) * | 2022-11-03 | 2022-12-02 | 南京信息工程大学 | 基于bert和gat的裁判文书争议焦点识别方法 |
CN116860979A (zh) * | 2023-09-04 | 2023-10-10 | 上海柯林布瑞信息技术有限公司 | 基于标签知识库的医疗文本标注方法及装置 |
-
2022
- 2022-03-29 CN CN202210319228.1A patent/CN114722196A/zh active Pending
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115422920A (zh) * | 2022-11-03 | 2022-12-02 | 南京信息工程大学 | 基于bert和gat的裁判文书争议焦点识别方法 |
CN115422920B (zh) * | 2022-11-03 | 2023-02-28 | 南京信息工程大学 | 基于bert和gat的裁判文书争议焦点识别方法 |
CN116860979A (zh) * | 2023-09-04 | 2023-10-10 | 上海柯林布瑞信息技术有限公司 | 基于标签知识库的医疗文本标注方法及装置 |
CN116860979B (zh) * | 2023-09-04 | 2023-12-08 | 上海柯林布瑞信息技术有限公司 | 基于标签知识库的医疗文本标注方法及装置 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6908628B2 (ja) | 画像分類及びラベリング | |
CN113822494A (zh) | 风险预测方法、装置、设备及存储介质 | |
CA3225621A1 (en) | Ai-augmented auditing platform including techniques for automated document processing | |
CN112070138B (zh) | 多标签混合分类模型的构建方法、新闻分类方法及系统 | |
CN107291822A (zh) | 基于深度学习的问题分类模型训练方法、分类方法及装置 | |
CN114722196A (zh) | 基于注意力机制的企业文本多标签标注方法及系统 | |
US20220180624A1 (en) | Method and device for automatic identification of labels of an image | |
CN111368175B (zh) | 一种事件抽取方法和系统及实体分类模型 | |
CN111767725A (zh) | 一种基于情感极性分析模型的数据处理方法及装置 | |
US11250299B2 (en) | Learning representations of generalized cross-modal entailment tasks | |
CN110728182B (zh) | 基于ai面试系统的面试方法、装置和计算机设备 | |
CN113204967B (zh) | 简历命名实体识别方法及系统 | |
KR102280490B1 (ko) | 상담 의도 분류용 인공지능 모델을 위한 훈련 데이터를 자동으로 생성하는 훈련 데이터 구축 방법 | |
CN114417785A (zh) | 知识点标注方法、模型的训练方法、计算机设备及存储介质 | |
CN110766460A (zh) | 一种用户画像的方法、装置、存储介质及计算机设备 | |
CN113360654A (zh) | 文本分类方法、装置、电子设备及可读存储介质 | |
CN115905538A (zh) | 基于知识图谱的事件多标签分类方法、装置、设备及介质 | |
CN116150367A (zh) | 一种基于方面的情感分析方法及系统 | |
Asha et al. | Artificial Neural Networks based DIGI Writing | |
CN114398480A (zh) | 基于关键信息抽取的金融舆情细分方面检测方法和设备 | |
CN112347739A (zh) | 适用规则分析方法、装置、电子设备及存储介质 | |
KR102663632B1 (ko) | 인공지능 기반의 미술품 거래의 트랜드 예측 장치 및 방법 | |
CN116263784A (zh) | 面向图片文本的粗粒度情感分析方法及装置 | |
CN115017894A (zh) | 一种舆情风险识别方法及装置 | |
US20210216721A1 (en) | System and method to quantify subject-specific sentiment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Country or region after: China Address after: 250014 No. 19, ASTRI Road, Lixia District, Shandong, Ji'nan Applicant after: SHANDONG COMPUTER SCIENCE CENTER(NATIONAL SUPERCOMPUTER CENTER IN JINAN) Applicant after: Qilu University of Technology (Shandong Academy of Sciences) Applicant after: Shandong Shanke Intelligent Technology Co.,Ltd. Address before: 250014 No. 19, ASTRI Road, Lixia District, Shandong, Ji'nan Applicant before: SHANDONG COMPUTER SCIENCE CENTER(NATIONAL SUPERCOMPUTER CENTER IN JINAN) Country or region before: China Applicant before: Qilu University of Technology Applicant before: Shandong Shanke Intelligent Technology Co.,Ltd. |