CN112988733B - 一种数据质量提升和增强的方法及装置 - Google Patents
一种数据质量提升和增强的方法及装置 Download PDFInfo
- Publication number
- CN112988733B CN112988733B CN202110410090.1A CN202110410090A CN112988733B CN 112988733 B CN112988733 B CN 112988733B CN 202110410090 A CN202110410090 A CN 202110410090A CN 112988733 B CN112988733 B CN 112988733B
- Authority
- CN
- China
- Prior art keywords
- data
- sample data
- trained
- training
- label
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/215—Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computing Systems (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Evolutionary Biology (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Databases & Information Systems (AREA)
- Quality & Reliability (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Medical Treatment And Welfare Office Work (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
Description
Claims (3)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110410090.1A CN112988733B (zh) | 2021-04-16 | 2021-04-16 | 一种数据质量提升和增强的方法及装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110410090.1A CN112988733B (zh) | 2021-04-16 | 2021-04-16 | 一种数据质量提升和增强的方法及装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112988733A CN112988733A (zh) | 2021-06-18 |
CN112988733B true CN112988733B (zh) | 2021-08-27 |
Family
ID=76340747
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110410090.1A Active CN112988733B (zh) | 2021-04-16 | 2021-04-16 | 一种数据质量提升和增强的方法及装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112988733B (zh) |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1908960A (zh) * | 2005-08-02 | 2007-02-07 | 中国科学院计算技术研究所 | 一种基于特征分组的多分类器组合人脸识别方法 |
CN109446369A (zh) * | 2018-09-28 | 2019-03-08 | 武汉中海庭数据技术有限公司 | 图像半自动标注的交互方法及系统 |
CN109784391A (zh) * | 2019-01-04 | 2019-05-21 | 杭州比智科技有限公司 | 基于多模型的样本标注方法及装置 |
CN110457675A (zh) * | 2019-06-26 | 2019-11-15 | 平安科技(深圳)有限公司 | 预测模型训练方法、装置、存储介质及计算机设备 |
CN110826332A (zh) * | 2019-11-02 | 2020-02-21 | 山西大学 | 一种基于gp的中医药专利命名实体自动识别方法 |
US20200143248A1 (en) * | 2017-07-12 | 2020-05-07 | Tencent Technology (Shenzhen) Company Limited | Machine learning model training method and device, and expression image classification method and device |
CN111652256A (zh) * | 2019-03-18 | 2020-09-11 | 上海铼锶信息技术有限公司 | 一种获取多维数据的方法和系统 |
CN112560912A (zh) * | 2020-12-03 | 2021-03-26 | 北京百度网讯科技有限公司 | 分类模型的训练方法、装置、电子设备和存储介质 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105426826A (zh) * | 2015-11-09 | 2016-03-23 | 张静 | 一种基于标签噪声纠正的众包标注数据质量提升方法 |
CN107153822A (zh) * | 2017-05-19 | 2017-09-12 | 北京航空航天大学 | 一种基于深度学习的半自动图像精标注方法 |
-
2021
- 2021-04-16 CN CN202110410090.1A patent/CN112988733B/zh active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1908960A (zh) * | 2005-08-02 | 2007-02-07 | 中国科学院计算技术研究所 | 一种基于特征分组的多分类器组合人脸识别方法 |
US20200143248A1 (en) * | 2017-07-12 | 2020-05-07 | Tencent Technology (Shenzhen) Company Limited | Machine learning model training method and device, and expression image classification method and device |
CN109446369A (zh) * | 2018-09-28 | 2019-03-08 | 武汉中海庭数据技术有限公司 | 图像半自动标注的交互方法及系统 |
CN109784391A (zh) * | 2019-01-04 | 2019-05-21 | 杭州比智科技有限公司 | 基于多模型的样本标注方法及装置 |
CN111652256A (zh) * | 2019-03-18 | 2020-09-11 | 上海铼锶信息技术有限公司 | 一种获取多维数据的方法和系统 |
CN110457675A (zh) * | 2019-06-26 | 2019-11-15 | 平安科技(深圳)有限公司 | 预测模型训练方法、装置、存储介质及计算机设备 |
CN110826332A (zh) * | 2019-11-02 | 2020-02-21 | 山西大学 | 一种基于gp的中医药专利命名实体自动识别方法 |
CN112560912A (zh) * | 2020-12-03 | 2021-03-26 | 北京百度网讯科技有限公司 | 分类模型的训练方法、装置、电子设备和存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN112988733A (zh) | 2021-06-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021212612A1 (zh) | 智能化文本纠错方法、装置、电子设备及可读存储介质 | |
EP3486838A1 (en) | System and method for semi-supervised conditional generative modeling using adversarial networks | |
CN110795938B (zh) | 文本序列分词方法、装置及存储介质 | |
CN108090043B (zh) | 基于人工智能的纠错举报处理方法、装置及可读介质 | |
CN110222330B (zh) | 语义识别方法及装置、存储介质、计算机设备 | |
CN112988963B (zh) | 基于多流程节点的用户意图预测方法、装置、设备及介质 | |
CN113704429A (zh) | 基于半监督学习的意图识别方法、装置、设备及介质 | |
CN110390110B (zh) | 用于语义匹配的预训练生成句子向量的方法和装置 | |
CN110543637A (zh) | 一种中文分词方法及装置 | |
CN110119353A (zh) | 测试数据生成方法、装置以及控制器和介质 | |
CN113780365B (zh) | 样本生成方法和装置 | |
US20090182757A1 (en) | Method for automatically computing proficiency of programming skills | |
CN109800776A (zh) | 素材标注方法、装置、终端和计算机可读存储介质 | |
CN114780701A (zh) | 自动问答匹配方法、装置、计算机设备及存储介质 | |
CN114610855A (zh) | 对话回复生成方法、装置、电子设备及存储介质 | |
CN112988733B (zh) | 一种数据质量提升和增强的方法及装置 | |
CN110489727A (zh) | 人名识别方法及相关装置 | |
CN113407676A (zh) | 题目批改方法和系统、电子设备和计算机可读介质 | |
CN112861519A (zh) | 医疗文本纠错方法、装置以及存储介质 | |
CN110032714B (zh) | 一种语料标注反馈方法及装置 | |
CN115169330B (zh) | 中文文本纠错及验证方法、装置、设备及存储介质 | |
CN113515591B (zh) | 文本不良信息识别方法、装置、电子设备及存储介质 | |
CN108597602A (zh) | 一种面向皮肤医学数据的标签纠错方法 | |
CN111382750A (zh) | 图形验证码识别方法及装置 | |
CN112364640A (zh) | 实体名词链接方法、装置、计算机设备和存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information | ||
CB03 | Change of inventor or designer information |
Inventor after: Liu Bangchang Inventor after: Kong Fei Inventor after: Chang Dejie Inventor after: Liu Chaozhen Inventor after: Wang Hai Inventor after: Zhao Hongwen Inventor after: Gu Shufeng Inventor after: Zhao Jin Inventor after: Luo Xiaobin Inventor before: Liu Bangchang Inventor before: Kong Fei Inventor before: Chang Dejie Inventor before: Liu Chaozhen Inventor before: Wang Hai Inventor before: Zhao Hongwen Inventor before: Gu Shufeng Inventor before: Zhao Jin Inventor before: Luo Xiaobin |