KR102101044B1 - 텍스트 투 스피치 및 시맨틱스에 기초한 오디오 인적 상호 증명 기법 - Google Patents

텍스트 투 스피치 및 시맨틱스에 기초한 오디오 인적 상호 증명 기법 Download PDF

Info

Publication number
KR102101044B1
KR102101044B1 KR1020147022837A KR20147022837A KR102101044B1 KR 102101044 B1 KR102101044 B1 KR 102101044B1 KR 1020147022837 A KR1020147022837 A KR 1020147022837A KR 20147022837 A KR20147022837 A KR 20147022837A KR 102101044 B1 KR102101044 B1 KR 102101044B1
Authority
KR
South Korea
Prior art keywords
audio
speech
computer
text
task
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
KR1020147022837A
Other languages
English (en)
Korean (ko)
Other versions
KR20140134653A (ko
Inventor
야오 키안
빈 벤자민 주
프랭크 카오-핑 숭
Original Assignee
마이크로소프트 테크놀로지 라이센싱, 엘엘씨
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 filed Critical 마이크로소프트 테크놀로지 라이센싱, 엘엘씨
Publication of KR20140134653A publication Critical patent/KR20140134653A/ko
Application granted granted Critical
Publication of KR102101044B1 publication Critical patent/KR102101044B1/ko
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2221/00Indexing scheme relating to security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F2221/21Indexing scheme relating to G06F21/00 and subgroups addressing additional information or applications relating to security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F2221/2133Verifying human interaction, e.g., Captcha
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
KR1020147022837A 2012-02-17 2013-02-01 텍스트 투 스피치 및 시맨틱스에 기초한 오디오 인적 상호 증명 기법 Expired - Fee Related KR102101044B1 (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US13/399,496 2012-02-17
US13/399,496 US10319363B2 (en) 2012-02-17 2012-02-17 Audio human interactive proof based on text-to-speech and semantics
PCT/US2013/024245 WO2013122750A1 (en) 2012-02-17 2013-02-01 Audio human interactive proof based on text-to-speech and semantics

Publications (2)

Publication Number Publication Date
KR20140134653A KR20140134653A (ko) 2014-11-24
KR102101044B1 true KR102101044B1 (ko) 2020-04-14

Family

ID=48982943

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020147022837A Expired - Fee Related KR102101044B1 (ko) 2012-02-17 2013-02-01 텍스트 투 스피치 및 시맨틱스에 기초한 오디오 인적 상호 증명 기법

Country Status (7)

Country Link
US (1) US10319363B2 (enExample)
EP (1) EP2815398B1 (enExample)
JP (1) JP6238312B2 (enExample)
KR (1) KR102101044B1 (enExample)
CN (1) CN104115221B (enExample)
ES (1) ES2628901T3 (enExample)
WO (1) WO2013122750A1 (enExample)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140067394A1 (en) * 2012-08-28 2014-03-06 King Abdulaziz City For Science And Technology System and method for decoding speech
US10149077B1 (en) * 2012-10-04 2018-12-04 Amazon Technologies, Inc. Audio themes
US9338162B2 (en) * 2014-06-13 2016-05-10 International Business Machines Corporation CAPTCHA challenge incorporating obfuscated characters
CN105047192B (zh) * 2015-05-25 2018-08-17 上海交通大学 基于隐马尔科夫模型的统计语音合成方法及装置
CN105185379B (zh) * 2015-06-17 2017-08-18 百度在线网络技术(北京)有限公司 声纹认证方法和装置
CN105161105A (zh) * 2015-07-31 2015-12-16 北京奇虎科技有限公司 一种交互系统的语音识别方法和装置
CN105161098A (zh) * 2015-07-31 2015-12-16 北京奇虎科技有限公司 一种交互系统的语音识别方法和装置
US10277581B2 (en) * 2015-09-08 2019-04-30 Oath, Inc. Audio verification
US9466299B1 (en) 2015-11-18 2016-10-11 International Business Machines Corporation Speech source classification
US10347247B2 (en) * 2016-12-30 2019-07-09 Google Llc Modulation of packetized audio signals
US10332520B2 (en) 2017-02-13 2019-06-25 Qualcomm Incorporated Enhanced speech generation
CN108630193B (zh) * 2017-03-21 2020-10-02 北京嘀嘀无限科技发展有限公司 语音识别方法及装置
WO2018183290A1 (en) * 2017-03-27 2018-10-04 Orion Labs Bot group messaging using general voice libraries
CN107609389B (zh) * 2017-08-24 2020-10-30 南京理工大学 一种基于图像内容相关性的验证方法及系统
JP6791825B2 (ja) * 2017-09-26 2020-11-25 株式会社日立製作所 情報処理装置、対話処理方法及び対話システム
WO2019077013A1 (en) 2017-10-18 2019-04-25 Soapbox Labs Ltd. METHODS AND SYSTEMS FOR PROCESSING AUDIO SIGNALS CONTAINING VOICE DATA
KR20190057687A (ko) * 2017-11-20 2019-05-29 삼성전자주식회사 챗봇 변경을 위한 위한 전자 장치 및 이의 제어 방법
US11355125B2 (en) 2018-08-06 2022-06-07 Google Llc Captcha automated assistant
CN111048062B (zh) * 2018-10-10 2022-10-04 华为技术有限公司 语音合成方法及设备
US11423073B2 (en) 2018-11-16 2022-08-23 Microsoft Technology Licensing, Llc System and management of semantic indicators during document presentations
US11126794B2 (en) * 2019-04-11 2021-09-21 Microsoft Technology Licensing, Llc Targeted rewrites
CN110390104B (zh) * 2019-07-23 2023-05-05 思必驰科技股份有限公司 用于语音对话平台的不规则文本转写方法及系统
KR102663669B1 (ko) * 2019-11-01 2024-05-08 엘지전자 주식회사 소음 환경에서의 음성 합성
US20220035898A1 (en) * 2020-07-31 2022-02-03 Nuance Communications, Inc. Audio CAPTCHA Using Echo
FR3122508A1 (fr) * 2021-04-29 2022-11-04 Orange Caractérisation d’un utilisateur par association d’un son à un élément interactif
US20230142081A1 (en) * 2021-11-10 2023-05-11 Nuance Communications, Inc. Voice captcha
CN114299919B (zh) * 2021-12-27 2025-06-03 完美世界(北京)软件科技发展有限公司 文字转语音方法、装置、存储介质及计算机设备
US20240363119A1 (en) * 2023-04-28 2024-10-31 Pindrop Security, Inc. Active voice liveness detection system
WO2024259486A1 (en) * 2023-06-19 2024-12-26 Macquarie University Scam call system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040254793A1 (en) * 2003-06-12 2004-12-16 Cormac Herley System and method for providing an audio challenge to distinguish a human from a computer
US20050015257A1 (en) * 2003-07-14 2005-01-20 Alexandre Bronstein Human test based on human conceptual capabilities
JP2006106741A (ja) * 2004-10-01 2006-04-20 At & T Corp 対話型音声応答システムによる音声理解を防ぐための方法および装置
US20090319270A1 (en) * 2008-06-23 2009-12-24 John Nicholas Gross CAPTCHA Using Challenges Optimized for Distinguishing Between Humans and Machines

Family Cites Families (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63231496A (ja) * 1987-03-20 1988-09-27 富士通株式会社 音声認識応答システム
US6195698B1 (en) 1998-04-13 2001-02-27 Compaq Computer Corporation Method for selectively restricting access to computer systems
US7054811B2 (en) 2002-11-06 2006-05-30 Cellmax Systems Ltd. Method and system for verifying and enabling user access based on voice parameters
US7039949B2 (en) 2001-12-10 2006-05-02 Brian Ross Cartmell Method and system for blocking unwanted communications
JP2003302999A (ja) * 2002-04-11 2003-10-24 Advanced Media Inc 音声による個人認証システム
CN1246826C (zh) * 2004-06-01 2006-03-22 安徽中科大讯飞信息科技有限公司 在语音合成系统中将背景音与文本语音混合输出的方法
US8255223B2 (en) 2004-12-03 2012-08-28 Microsoft Corporation User authentication by combining speaker verification and reverse turing test
US7945952B1 (en) * 2005-06-30 2011-05-17 Google Inc. Methods and apparatuses for presenting challenges to tell humans and computers apart
US8145914B2 (en) 2005-12-15 2012-03-27 Microsoft Corporation Client-side CAPTCHA ceremony for user verification
US20070165811A1 (en) 2006-01-19 2007-07-19 John Reumann System and method for spam detection
US7680891B1 (en) * 2006-06-19 2010-03-16 Google Inc. CAPTCHA-based spam control for content creation systems
US8036902B1 (en) * 2006-06-21 2011-10-11 Tellme Networks, Inc. Audio human verification
US20090055193A1 (en) * 2007-02-22 2009-02-26 Pudding Holdings Israel Ltd. Method, apparatus and computer code for selectively providing access to a service in accordance with spoken content received from a user
BRPI0808289A2 (pt) * 2007-03-21 2015-06-16 Vivotext Ltd "biblioteca de amostras de fala para transformar texto em falta e métodos e instrumentos para gerar e utilizar o mesmo"
CN101059830A (zh) 2007-06-01 2007-10-24 华南理工大学 一种可结合游戏特征的机器人外挂识别方法
US8495727B2 (en) 2007-08-07 2013-07-23 Microsoft Corporation Spam reduction in real time communications by human interaction proof
US20090249477A1 (en) * 2008-03-28 2009-10-01 Yahoo! Inc. Method and system for determining whether a computer user is human
US8752141B2 (en) * 2008-06-27 2014-06-10 John Nicholas Methods for presenting and determining the efficacy of progressive pictorial and motion-based CAPTCHAs
US8793135B2 (en) * 2008-08-25 2014-07-29 At&T Intellectual Property I, L.P. System and method for auditory captchas
US8925057B1 (en) * 2009-02-06 2014-12-30 New Jersey Institute Of Technology Automated tests to distinguish computers from humans
US9342508B2 (en) * 2009-03-19 2016-05-17 Microsoft Technology Licensing, Llc Data localization templates and parsing
US8315871B2 (en) * 2009-06-04 2012-11-20 Microsoft Corporation Hidden Markov model based text to speech systems employing rope-jumping algorithm
WO2012010743A1 (en) * 2010-07-23 2012-01-26 Nokia Corporation Method and apparatus for authorizing a user or a user device based on location information
WO2012029519A1 (ja) * 2010-08-31 2012-03-08 楽天株式会社 応答判定装置、応答判定方法、応答判定プログラム、記録媒体、および、応答判定システム
US8719930B2 (en) * 2010-10-12 2014-05-06 Sonus Networks, Inc. Real-time network attack detection and mitigation infrastructure
CA2819473A1 (en) * 2010-11-30 2012-06-07 Towson University Audio based human-interaction proof
JP2012163692A (ja) * 2011-02-04 2012-08-30 Nec Corp 音声信号処理システム、音声信号処理方法および音声信号処理方法プログラム
US20120232907A1 (en) * 2011-03-09 2012-09-13 Christopher Liam Ivey System and Method for Delivering a Human Interactive Proof to the Visually Impaired by Means of Semantic Association of Objects
US8810368B2 (en) * 2011-03-29 2014-08-19 Nokia Corporation Method and apparatus for providing biometric authentication using distributed computations
US8904517B2 (en) * 2011-06-28 2014-12-02 International Business Machines Corporation System and method for contexually interpreting image sequences
US9146917B2 (en) * 2011-07-15 2015-09-29 International Business Machines Corporation Validating that a user is human

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040254793A1 (en) * 2003-06-12 2004-12-16 Cormac Herley System and method for providing an audio challenge to distinguish a human from a computer
US20050015257A1 (en) * 2003-07-14 2005-01-20 Alexandre Bronstein Human test based on human conceptual capabilities
JP2006106741A (ja) * 2004-10-01 2006-04-20 At & T Corp 対話型音声応答システムによる音声理解を防ぐための方法および装置
US20090319270A1 (en) * 2008-06-23 2009-12-24 John Nicholas Gross CAPTCHA Using Challenges Optimized for Distinguishing Between Humans and Machines

Also Published As

Publication number Publication date
CN104115221B (zh) 2017-09-01
EP2815398B1 (en) 2017-03-29
WO2013122750A1 (en) 2013-08-22
JP6238312B2 (ja) 2017-11-29
US10319363B2 (en) 2019-06-11
CN104115221A (zh) 2014-10-22
ES2628901T3 (es) 2017-08-04
EP2815398A1 (en) 2014-12-24
US20130218566A1 (en) 2013-08-22
KR20140134653A (ko) 2014-11-24
JP2015510147A (ja) 2015-04-02
EP2815398A4 (en) 2015-05-06

Similar Documents

Publication Publication Date Title
KR102101044B1 (ko) 텍스트 투 스피치 및 시맨틱스에 기초한 오디오 인적 상호 증명 기법
Chen et al. Automated scoring of nonnative speech using the speechrater sm v. 5.0 engine
US7289950B2 (en) Extended finite state grammar for speech recognition systems
Athanaselis et al. ASR for emotional speech: clarifying the issues and enhancing performance
US8036894B2 (en) Multi-unit approach to text-to-speech synthesis
CN110782880B (zh) 一种韵律生成模型的训练方法及装置
Watts Unsupervised learning for text-to-speech synthesis
US9437195B2 (en) Biometric password security
US20110213610A1 (en) Processor Implemented Systems and Methods for Measuring Syntactic Complexity on Spontaneous Non-Native Speech Data by Using Structural Event Detection
US20190206386A1 (en) Method and system for text-to-speech synthesis
CN105280177A (zh) 语音合成字典创建装置、语音合成器、以及语音合成字典创建方法
CN110782918B (zh) 一种基于人工智能的语音韵律评估方法及装置
JP6810580B2 (ja) 言語モデル学習装置およびそのプログラム
JPWO2016103652A1 (ja) 音声処理装置、音声処理方法、およびプログラム
US12118898B2 (en) Voice visualization system for english learning, and method therefor
US11250837B2 (en) Speech synthesis system, method and non-transitory computer readable medium with language option selection and acoustic models
HaCohen-Kerner et al. Language and gender classification of speech files using supervised machine learning methods
Dielen Improving the automatic speech recognition model whisper with voice activity detection
Motyka et al. Information technology of transcribing Ukrainian-language content based on deep learning
Kirkedal Danish stød and automatic speech recognition
Carson-Berndsen Multilingual time maps: portable phonotactic models for speech technology
Kumar et al. Formalizing expert knowledge for developing accurate speech recognizers.
Sayed et al. Convolutional Neural Networks to Facilitate the Continuous Recognition of Arabic Speech with Independent Speakers
Drašković Integration of Ai Tools into an Ai-Driven Software System to Make Learning Programming Easier
Zhang et al. AcousticScope: Understanding Biases in Voice Interaction via Automated Acoustic Testing

Legal Events

Date Code Title Description
PA0105 International application

St.27 status event code: A-0-1-A10-A15-nap-PA0105

PG1501 Laying open of application

St.27 status event code: A-1-1-Q10-Q12-nap-PG1501

N231 Notification of change of applicant
PN2301 Change of applicant

St.27 status event code: A-3-3-R10-R13-asn-PN2301

St.27 status event code: A-3-3-R10-R11-asn-PN2301

AMND Amendment
P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

PA0201 Request for examination

St.27 status event code: A-1-2-D10-D11-exm-PA0201

E902 Notification of reason for refusal
PE0902 Notice of grounds for rejection

St.27 status event code: A-1-2-D10-D21-exm-PE0902

AMND Amendment
P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

E601 Decision to refuse application
PE0601 Decision on rejection of patent

St.27 status event code: N-2-6-B10-B15-exm-PE0601

T11-X000 Administrative time limit extension requested

St.27 status event code: U-3-3-T10-T11-oth-X000

T13-X000 Administrative time limit extension granted

St.27 status event code: U-3-3-T10-T13-oth-X000

AMND Amendment
E13-X000 Pre-grant limitation requested

St.27 status event code: A-2-3-E10-E13-lim-X000

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

PX0901 Re-examination

St.27 status event code: A-2-3-E10-E12-rex-PX0901

PX0701 Decision of registration after re-examination

St.27 status event code: A-3-4-F10-F13-rex-PX0701

X701 Decision to grant (after re-examination)
GRNT Written decision to grant
PR0701 Registration of establishment

St.27 status event code: A-2-4-F10-F11-exm-PR0701

PR1002 Payment of registration fee

St.27 status event code: A-2-2-U10-U12-oth-PR1002

Fee payment year number: 1

PG1601 Publication of registration

St.27 status event code: A-4-4-Q10-Q13-nap-PG1601

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 4

PC1903 Unpaid annual fee

St.27 status event code: A-4-4-U10-U13-oth-PC1903

Not in force date: 20240409

Payment event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE

PC1903 Unpaid annual fee

St.27 status event code: N-4-6-H10-H13-oth-PC1903

Ip right cessation event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE

Not in force date: 20240409