CN111652267B - 对抗样本的生成方法、装置、电子设备及存储介质 - Google Patents

对抗样本的生成方法、装置、电子设备及存储介质 Download PDF

Info

Publication number
CN111652267B
CN111652267B CN202010317965.9A CN202010317965A CN111652267B CN 111652267 B CN111652267 B CN 111652267B CN 202010317965 A CN202010317965 A CN 202010317965A CN 111652267 B CN111652267 B CN 111652267B
Authority
CN
China
Prior art keywords
particle
sample
word
original text
optimal solution
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010317965.9A
Other languages
English (en)
Chinese (zh)
Other versions
CN111652267A (zh
Inventor
岂凡超
臧原
刘知远
孙茂松
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Original Assignee
Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University filed Critical Tsinghua University
Priority to CN202010317965.9A priority Critical patent/CN111652267B/zh
Priority to PCT/CN2020/103219 priority patent/WO2021212675A1/fr
Publication of CN111652267A publication Critical patent/CN111652267A/zh
Application granted granted Critical
Publication of CN111652267B publication Critical patent/CN111652267B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/006Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
CN202010317965.9A 2020-04-21 2020-04-21 对抗样本的生成方法、装置、电子设备及存储介质 Active CN111652267B (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202010317965.9A CN111652267B (zh) 2020-04-21 2020-04-21 对抗样本的生成方法、装置、电子设备及存储介质
PCT/CN2020/103219 WO2021212675A1 (fr) 2020-04-21 2020-07-21 Procédé et appareil permettant de générer un échantillon antagoniste, dispositif électronique et support de stockage

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010317965.9A CN111652267B (zh) 2020-04-21 2020-04-21 对抗样本的生成方法、装置、电子设备及存储介质

Publications (2)

Publication Number Publication Date
CN111652267A CN111652267A (zh) 2020-09-11
CN111652267B true CN111652267B (zh) 2023-01-31

Family

ID=72346469

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010317965.9A Active CN111652267B (zh) 2020-04-21 2020-04-21 对抗样本的生成方法、装置、电子设备及存储介质

Country Status (2)

Country Link
CN (1) CN111652267B (fr)
WO (1) WO2021212675A1 (fr)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112216273B (zh) * 2020-10-30 2024-04-16 东南数字经济发展研究院 一种针对语音关键词分类网络的对抗样本攻击方法
CN112380845B (zh) * 2021-01-15 2021-04-09 鹏城实验室 句子噪声设计方法、设备及计算机存储介质
CN113723506B (zh) * 2021-08-30 2022-08-05 南京星环智能科技有限公司 一种对抗样本的生成方法、设备及存储介质
CN113806490B (zh) * 2021-09-27 2023-06-13 中国人民解放军国防科技大学 一种基于bert采样的文本通用触发器生成系统和方法
CN113935481B (zh) * 2021-10-12 2023-04-18 中国人民解放军国防科技大学 针对自然语言处理模型在有限次数条件下的对抗测试方法
CN113642678B (zh) * 2021-10-12 2022-01-07 南京山猫齐动信息技术有限公司 一种对抗消息样本生成的方法、装置及存储介质
CN113946687B (zh) * 2021-10-20 2022-09-23 中国人民解放军国防科技大学 一种标签一致的文本后门攻击方法
CN114169443B (zh) * 2021-12-08 2024-02-06 西安交通大学 词级文本对抗样本检测方法
CN114238661B (zh) * 2021-12-22 2024-03-19 西安交通大学 一种基于可解释模型的文本歧视性样本检测生成系统与方法
CN114444476B (zh) * 2022-01-25 2024-03-01 腾讯科技(深圳)有限公司 信息处理方法、装置和计算机可读存储介质
CN115034318B (zh) * 2022-06-17 2024-05-17 中国平安人寿保险股份有限公司 标题判别模型的生成方法和装置、设备、介质
CN115333869B (zh) * 2022-10-14 2022-12-13 四川大学 一种分布式网络对抗攻击自训练学习方法
CN116151392B (zh) * 2023-02-28 2024-01-09 北京百度网讯科技有限公司 训练样本生成方法、训练方法、推荐方法以及装置
CN117808095B (zh) * 2024-02-26 2024-05-28 哈尔滨工业大学(深圳)(哈尔滨工业大学深圳科技创新研究院) 一种对抗攻击样本生成方法和装置、电子设备

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110619292A (zh) * 2019-08-31 2019-12-27 浙江工业大学 基于二进制粒子群通道优化的对抗防御方法
CN110767216A (zh) * 2019-09-10 2020-02-07 浙江工业大学 一种基于pso算法的语音识别攻击防御方法
CN110930182A (zh) * 2019-11-08 2020-03-27 中国农业大学 基于改进粒子群优化算法的客户分类方法及装置

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11468234B2 (en) * 2017-06-26 2022-10-11 International Business Machines Corporation Identifying linguistic replacements to improve textual message effectiveness
CN109214327B (zh) * 2018-08-29 2021-08-03 浙江工业大学 一种基于pso的反人脸识别方法
CN109599109B (zh) * 2018-12-26 2022-03-25 浙江大学 针对白盒场景的对抗音频生成方法及系统
CN109887496A (zh) * 2019-01-22 2019-06-14 浙江大学 一种黑盒场景下的定向对抗音频生成方法及系统

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110619292A (zh) * 2019-08-31 2019-12-27 浙江工业大学 基于二进制粒子群通道优化的对抗防御方法
CN110767216A (zh) * 2019-09-10 2020-02-07 浙江工业大学 一种基于pso算法的语音识别攻击防御方法
CN110930182A (zh) * 2019-11-08 2020-03-27 中国农业大学 基于改进粒子群优化算法的客户分类方法及装置

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Modeling Semantic Compositionality with Sememe Knowledge;Fanchao Qi等;《Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics》;20190802;第5706–5715页 *
Open the Boxes of Words: Incorporating Sememes into Textual Adversarial Attack;Yuan Zang等;《arXiv:1910.12196v1[cs.CL]》;20191027;第1-5页 *
Textual Adversarial Attack as Combinatorial Optimization;Yuan Zang等;《arXiv:1910.12196v2[cs.CL]》;20191110;第1-6页 *
粒子群优化算法及差分进行算法研究;张庆科;《中国优秀博硕士学位论文全文数据库(博士)信息科技辑》;20170815;第58-61页 *

Also Published As

Publication number Publication date
WO2021212675A1 (fr) 2021-10-28
CN111652267A (zh) 2020-09-11

Similar Documents

Publication Publication Date Title
CN111652267B (zh) 对抗样本的生成方法、装置、电子设备及存储介质
US11734329B2 (en) System and method for text categorization and sentiment analysis
US11144581B2 (en) Verifying and correcting training data for text classification
US10262272B2 (en) Active machine learning
US9633002B1 (en) Systems and methods for coreference resolution using selective feature activation
KR20220025026A (ko) 자연어 이해(nlu) 프레임워크를 이용하여 의미 검색을 수행하기 위한 시스템 및 방법
US20190377793A1 (en) Method and apparatus for establishing a hierarchical intent system
US20120262461A1 (en) System and Method for the Normalization of Text
CN109948140B (zh) 一种词向量嵌入方法及装置
CN111523314B (zh) 模型对抗训练、命名实体识别方法及装置
CN112906392A (zh) 一种文本增强方法、文本分类方法及相关装置
CN111859964A (zh) 一种语句中命名实体的识别方法及装置
CN109791570B (zh) 高效且精确的命名实体识别方法和装置
CN112256842A (zh) 用于文本聚类的方法、电子设备和存储介质
CN114995903B (zh) 一种基于预训练语言模型的类别标签识别方法及装置
CN114756675A (zh) 文本分类方法、相关设备及可读存储介质
CN111680291A (zh) 一种对抗样本生成方法、装置、电子设备及存储介质
CN115062621A (zh) 标签提取方法、装置、电子设备和存储介质
CN113934848A (zh) 一种数据分类方法、装置和电子设备
WO2024051196A1 (fr) Procédé et appareil de détection de code malveillant, dispositif électronique et support de stockage
CN115858776B (zh) 一种变体文本分类识别方法、系统、存储介质和电子设备
CN115035890B (zh) 语音识别模型的训练方法、装置、电子设备及存储介质
CN116578700A (zh) 日志分类方法、日志分类装置、设备及介质
CN115906797A (zh) 文本实体对齐方法、装置、设备及介质
CN115909376A (zh) 文本识别方法、文本识别模型训练方法、装置及存储介质

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant