CN111226274B - 自动阻止音频流中包含的敏感数据 - Google Patents

自动阻止音频流中包含的敏感数据 Download PDF

Info

Publication number
CN111226274B
CN111226274B CN201880067472.5A CN201880067472A CN111226274B CN 111226274 B CN111226274 B CN 111226274B CN 201880067472 A CN201880067472 A CN 201880067472A CN 111226274 B CN111226274 B CN 111226274B
Authority
CN
China
Prior art keywords
sensitive
information
text
audio stream
computer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201880067472.5A
Other languages
English (en)
Chinese (zh)
Other versions
CN111226274A (zh
Inventor
J.A.施密特
A.D.布雷厄姆
J.尼古莱
J.桑托斯沃索
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of CN111226274A publication Critical patent/CN111226274A/zh
Application granted granted Critical
Publication of CN111226274B publication Critical patent/CN111226274B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/51Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing
    • H04M3/5166Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing in combination with interactive voice response systems or voice portals, e.g. as front-ends
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2203/00Aspects of automatic or semi-automatic exchanges
    • H04M2203/60Aspects of automatic or semi-automatic exchanges related to security aspects in telephonic communication systems
    • H04M2203/6009Personal information, e.g. profiles or personal directories being only provided to authorised persons
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/436Arrangements for screening incoming calls, i.e. evaluating the characteristics of a call before deciding whether to answer it
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/51Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Probability & Statistics with Applications (AREA)
  • Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Telephonic Communication Services (AREA)
  • Document Processing Apparatus (AREA)
CN201880067472.5A 2017-11-28 2018-11-26 自动阻止音频流中包含的敏感数据 Active CN111226274B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US15/824,051 US10453447B2 (en) 2017-11-28 2017-11-28 Filtering data in an audio stream
US15/824,051 2017-11-28
PCT/IB2018/059300 WO2019106517A1 (en) 2017-11-28 2018-11-26 Automatic blocking of sensitive data contained in an audio stream

Publications (2)

Publication Number Publication Date
CN111226274A CN111226274A (zh) 2020-06-02
CN111226274B true CN111226274B (zh) 2023-09-22

Family

ID=66633386

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201880067472.5A Active CN111226274B (zh) 2017-11-28 2018-11-26 自动阻止音频流中包含的敏感数据

Country Status (6)

Country Link
US (2) US10453447B2 (https=)
JP (1) JP7255811B2 (https=)
CN (1) CN111226274B (https=)
DE (1) DE112018005421B4 (https=)
GB (1) GB2583281B (https=)
WO (1) WO2019106517A1 (https=)

Families Citing this family (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10861476B2 (en) 2017-05-24 2020-12-08 Modulate, Inc. System and method for building a voice database
WO2019010250A1 (en) * 2017-07-05 2019-01-10 Interactions Llc REAL-TIME CONFIDENTIALITY FILTER
US10861463B2 (en) * 2018-01-09 2020-12-08 Sennheiser Electronic Gmbh & Co. Kg Method for speech processing and speech processing device
US11822885B1 (en) * 2019-06-03 2023-11-21 Amazon Technologies, Inc. Contextual natural language censoring
US11706337B1 (en) * 2019-08-29 2023-07-18 United Services Automobile Association (Usaa) Artificial intelligence assistant for customer service representatives
CN112560472B (zh) * 2019-09-26 2023-07-11 腾讯科技(深圳)有限公司 一种识别敏感信息的方法及装置
US12093414B1 (en) * 2019-12-09 2024-09-17 Amazon Technologies, Inc. Efficient detection of in-memory data accesses and context information
CN111105788B (zh) * 2019-12-20 2023-03-24 北京三快在线科技有限公司 敏感词分数检测方法、装置、电子设备及存储介质
CN111753539B (zh) * 2020-06-30 2023-12-26 北京搜狗科技发展有限公司 一种识别敏感文本的方法及装置
US11349983B2 (en) 2020-07-06 2022-05-31 At&T Intellectual Property I, L.P. Protecting user data during audio interactions
CN111883128B (zh) * 2020-07-31 2024-08-13 中国工商银行股份有限公司 语音处理方法及系统、语音处理装置
CN112183079A (zh) * 2020-09-07 2021-01-05 绿瘦健康产业集团有限公司 一种语音监测方法、装置、介质及终端设备
CN112333321A (zh) * 2020-09-24 2021-02-05 咪咕文化科技有限公司 语音检测方法、装置、电子设备及存储介质
WO2022076923A1 (en) 2020-10-08 2022-04-14 Modulate, Inc. Multi-stage adaptive system for content moderation
RO135860A2 (ro) * 2020-12-02 2022-06-30 Repsmate Software S.R.L. Sistem şi metodă pentru anonimizarea datelor de identificare a persoanelor aflate într-o convorbire audio/video
CN112559776A (zh) * 2020-12-21 2021-03-26 绿瘦健康产业集团有限公司 一种敏感信息的定位方法及系统
US11900927B2 (en) 2020-12-23 2024-02-13 Optum Technology, Inc. Cybersecurity for sensitive-information utterances in interactive voice sessions using risk profiles
US11854553B2 (en) * 2020-12-23 2023-12-26 Optum Technology, Inc. Cybersecurity for sensitive-information utterances in interactive voice sessions
CN112634881B (zh) * 2020-12-30 2023-08-11 广州博士信息技术研究院有限公司 一种基于科技成果数据库的语音智能识别方法及系统
CN112885371B (zh) * 2021-01-13 2021-11-23 北京爱数智慧科技有限公司 音频脱敏的方法、装置、电子设备以及可读存储介质
CN116868268A (zh) * 2021-02-15 2023-10-10 皇家飞利浦有限公司 用于处理语音音频以分离个人健康信息的方法和系统
US12167210B2 (en) * 2021-02-25 2024-12-10 Carnegie Mellon University Enabling environmental sound recognition in intelligent vehicles
US20220399009A1 (en) * 2021-06-09 2022-12-15 International Business Machines Corporation Protecting sensitive information in conversational exchanges
CN113851132B (zh) * 2021-09-22 2024-12-06 北京兆维电子(集团)有限责任公司 用于语音识别输入的敏感词过滤的方法、装置、控制板、设备
CN113840247A (zh) * 2021-10-12 2021-12-24 深圳追一科技有限公司 音频通信方法、装置、系统、电子设备及存储介质
CN114007131B (zh) * 2021-10-29 2023-04-25 平安科技(深圳)有限公司 视频监控方法、装置及相关设备
CN114238614B (zh) * 2021-11-17 2024-12-20 广州市百果园网络科技有限公司 一种违规变形文字检测方法、系统、设备及存储介质
CN114420129B (zh) * 2022-01-20 2026-03-31 上海喜马拉雅科技有限公司 一种音频检测方法、装置、电子设备及计算机可读存储介质
US12189817B2 (en) * 2022-02-14 2025-01-07 Twilio Inc. Personal information redaction and voice deidentification
US12003575B2 (en) 2022-02-22 2024-06-04 Optum, Inc. Routing of sensitive-information utterances through secure channels in interactive voice sessions
US12355781B2 (en) * 2022-04-01 2025-07-08 Comcast Cable Communications, Llc Method of authenticating a caller
WO2023196624A1 (en) * 2022-04-08 2023-10-12 Modulate, Inc. Predictive audio redaction for realtime communication
CN114786035A (zh) * 2022-05-25 2022-07-22 上海氪信信息技术有限公司 直播场景的合规质检和互动问答系统及方法
WO2023235517A1 (en) 2022-06-01 2023-12-07 Modulate, Inc. Scoring system for content moderation
CN115081440B (zh) * 2022-07-22 2022-11-01 湖南湘生网络信息有限公司 文本中变种词的识别及提取原敏感词的方法、装置及设备
US12014224B2 (en) 2022-08-31 2024-06-18 Bank Of America Corporation System and method for processing of event data real time in an electronic communication via an artificial intelligence engine
US12027177B2 (en) * 2022-09-08 2024-07-02 Roblox Corporation Artificial latency for moderating voice communication
CN116072123B (zh) * 2023-03-06 2023-06-23 南昌航天广信科技有限责任公司 广播信息播放方法、装置、可读存储介质及电子设备
US20240379107A1 (en) * 2023-05-09 2024-11-14 Sony Interactive Entertainment Inc. Real-time ai screening and auto-moderation of audio comments in a livestream
US12536367B2 (en) 2023-05-30 2026-01-27 Sony Interactive Entertainment Inc. Using artificial intelligence to generate customized summaries of conversations
CN116913327A (zh) * 2023-08-01 2023-10-20 北京声智科技有限公司 音频播放方法、装置、音频播放设备以及电子设备
CN117273054B (zh) * 2023-09-28 2024-06-25 江苏八点八智能科技有限公司 一种应用不同场景的虚拟人交互方法与系统
CN120434414B (zh) * 2025-07-08 2025-09-19 湖南芒果智媒科技发展有限公司 一种音频直播内容实时管控方法及系统

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006178203A (ja) * 2004-12-22 2006-07-06 Nec Corp 音声情報加工システム、音声情報加工方法及び音声情報加工プログラム
JP2009501942A (ja) * 2005-07-13 2009-01-22 ハイパークオリティー,インク. 音声認識技術を利用した録音した音声内の選択的セキュリティマスキング
JP2015055653A (ja) * 2013-09-10 2015-03-23 セイコーエプソン株式会社 音声認識装置及び方法、並びに、電子機器
CN104679729A (zh) * 2015-02-13 2015-06-03 广州市讯飞樽鸿信息技术有限公司 录音留言有效性处理方法及系统
CN105335483A (zh) * 2015-10-14 2016-02-17 广州市畅运信息科技有限公司 一种文本敏感词过滤系统和方法
CN105843950A (zh) * 2016-04-12 2016-08-10 乐视控股(北京)有限公司 敏感词过滤方法及装置
CN106504744A (zh) * 2016-10-26 2017-03-15 科大讯飞股份有限公司 一种语音处理方法及装置
US9787835B1 (en) * 2013-04-11 2017-10-10 Noble Systems Corporation Protecting sensitive information provided by a party to a contact center

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8473451B1 (en) 2004-07-30 2013-06-25 At&T Intellectual Property I, L.P. Preserving privacy in natural language databases
US7650628B2 (en) * 2004-10-21 2010-01-19 Escription, Inc. Transcription data security
US7502741B2 (en) * 2005-02-23 2009-03-10 Multimodal Technologies, Inc. Audio signal de-identification
US8433915B2 (en) * 2006-06-28 2013-04-30 Intellisist, Inc. Selective security masking within recorded speech
US20080208579A1 (en) 2007-02-27 2008-08-28 Verint Systems Ltd. Session recording and playback with selective information masking
US20080221882A1 (en) * 2007-03-06 2008-09-11 Bundock Donald S System for excluding unwanted data from a voice recording
US8140012B1 (en) 2007-10-25 2012-03-20 At&T Mobility Ii Llc Bluetooth security profile
JP5688279B2 (ja) * 2010-12-08 2015-03-25 ニュアンス コミュニケーションズ,インコーポレイテッド 秘匿情報をフィルタリングする情報処理装置、方法およびプログラム
WO2014028524A1 (en) 2012-08-15 2014-02-20 Visa International Service Association Searchable encrypted data
US9131369B2 (en) 2013-01-24 2015-09-08 Nuance Communications, Inc. Protection of private information in a client/server automatic speech recognition system
US9437207B2 (en) 2013-03-12 2016-09-06 Pullstring, Inc. Feature extraction for anonymized speech recognition
US9514741B2 (en) 2013-03-13 2016-12-06 Nuance Communications, Inc. Data shredding for speech recognition acoustic model training under data retention restrictions
US9407758B1 (en) * 2013-04-11 2016-08-02 Noble Systems Corporation Using a speech analytics system to control a secure audio bridge during a payment transaction
WO2015105994A1 (en) 2014-01-08 2015-07-16 Callminer, Inc. Real-time conversational analytics facility
US10754978B2 (en) * 2016-07-29 2020-08-25 Intellisist Inc. Computer-implemented system and method for storing and retrieving sensitive information
CN106528731A (zh) 2016-10-27 2017-03-22 新疆大学 一种敏感词过滤方法及系统
US10762221B2 (en) * 2016-11-14 2020-09-01 Paymentus Corporation Method and apparatus for multi-channel secure communication and data transfer
GB2559130B (en) * 2017-01-25 2020-05-27 Syntec Holdings Ltd Secure data exchange by voice in telephone calls
WO2019010250A1 (en) * 2017-07-05 2019-01-10 Interactions Llc REAL-TIME CONFIDENTIALITY FILTER

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006178203A (ja) * 2004-12-22 2006-07-06 Nec Corp 音声情報加工システム、音声情報加工方法及び音声情報加工プログラム
JP2009501942A (ja) * 2005-07-13 2009-01-22 ハイパークオリティー,インク. 音声認識技術を利用した録音した音声内の選択的セキュリティマスキング
US9787835B1 (en) * 2013-04-11 2017-10-10 Noble Systems Corporation Protecting sensitive information provided by a party to a contact center
JP2015055653A (ja) * 2013-09-10 2015-03-23 セイコーエプソン株式会社 音声認識装置及び方法、並びに、電子機器
CN104679729A (zh) * 2015-02-13 2015-06-03 广州市讯飞樽鸿信息技术有限公司 录音留言有效性处理方法及系统
CN105335483A (zh) * 2015-10-14 2016-02-17 广州市畅运信息科技有限公司 一种文本敏感词过滤系统和方法
CN105843950A (zh) * 2016-04-12 2016-08-10 乐视控股(北京)有限公司 敏感词过滤方法及装置
CN106504744A (zh) * 2016-10-26 2017-03-15 科大讯飞股份有限公司 一种语音处理方法及装置

Also Published As

Publication number Publication date
US11024295B2 (en) 2021-06-01
GB2583281B (en) 2022-09-21
JP7255811B2 (ja) 2023-04-11
JP2021505032A (ja) 2021-02-15
WO2019106517A1 (en) 2019-06-06
DE112018005421B4 (de) 2022-07-21
US20190164539A1 (en) 2019-05-30
CN111226274A (zh) 2020-06-02
US20200005773A1 (en) 2020-01-02
GB202009699D0 (en) 2020-08-12
GB2583281A (en) 2020-10-21
DE112018005421T5 (de) 2020-07-16
US10453447B2 (en) 2019-10-22

Similar Documents

Publication Publication Date Title
CN111226274B (zh) 自动阻止音频流中包含的敏感数据
US11842728B2 (en) Training neural networks to predict acoustic sequences using observed prosody info
US12136414B2 (en) Integrating dialog history into end-to-end spoken language understanding systems
JP7805070B2 (ja) 音声認識トランスクリプションの改善
US12148443B2 (en) Speaker-specific voice amplification
US10062385B2 (en) Automatic speech-to-text engine selection
US10755719B2 (en) Speaker identification assisted by categorical cues
US10089978B2 (en) Detecting customers with low speech recognition accuracy by investigating consistency of conversation in call-center
US9959887B2 (en) Multi-pass speech activity detection strategy to improve automatic speech recognition
US11605385B2 (en) Project issue tracking via automated voice recognition
US9972308B1 (en) Splitting utterances for quick responses
GB2604675A (en) Improving speech recognition transcriptions
US10991370B2 (en) Speech to text conversion engine for non-standard speech
US20220188525A1 (en) Dynamic, real-time collaboration enhancement
US20220319494A1 (en) End to end spoken language understanding model
US20230237987A1 (en) Data sorting for generating rnn-t models
JP2022055347A (ja) コンピュータ実装方法、コンピュータシステム及びコンピュータプログラム(スピーチ認識トランスクリプションの改善)
US20200286464A1 (en) Feature and feature variant reconstruction for recurrent model accuracy improvement in speech recognition
US20180122404A1 (en) Determining a behavior of a user utilizing audio data
US12112767B2 (en) Acoustic data augmentation with mixed normalization factors
US12417759B2 (en) Speech recognition using cadence patterns

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant