KR102101044B1 - 텍스트 투 스피치 및 시맨틱스에 기초한 오디오 인적 상호 증명 기법 - Google Patents
텍스트 투 스피치 및 시맨틱스에 기초한 오디오 인적 상호 증명 기법 Download PDFInfo
- Publication number
- KR102101044B1 KR102101044B1 KR1020147022837A KR20147022837A KR102101044B1 KR 102101044 B1 KR102101044 B1 KR 102101044B1 KR 1020147022837 A KR1020147022837 A KR 1020147022837A KR 20147022837 A KR20147022837 A KR 20147022837A KR 102101044 B1 KR102101044 B1 KR 102101044B1
- Authority
- KR
- South Korea
- Prior art keywords
- audio
- speech
- computer
- text
- task
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2221/00—Indexing scheme relating to security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F2221/21—Indexing scheme relating to G06F21/00 and subgroups addressing additional information or applications relating to security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F2221/2133—Verifying human interaction, e.g., Captcha
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/399,496 | 2012-02-17 | ||
| US13/399,496 US10319363B2 (en) | 2012-02-17 | 2012-02-17 | Audio human interactive proof based on text-to-speech and semantics |
| PCT/US2013/024245 WO2013122750A1 (en) | 2012-02-17 | 2013-02-01 | Audio human interactive proof based on text-to-speech and semantics |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| KR20140134653A KR20140134653A (ko) | 2014-11-24 |
| KR102101044B1 true KR102101044B1 (ko) | 2020-04-14 |
Family
ID=48982943
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020147022837A Expired - Fee Related KR102101044B1 (ko) | 2012-02-17 | 2013-02-01 | 텍스트 투 스피치 및 시맨틱스에 기초한 오디오 인적 상호 증명 기법 |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US10319363B2 (enExample) |
| EP (1) | EP2815398B1 (enExample) |
| JP (1) | JP6238312B2 (enExample) |
| KR (1) | KR102101044B1 (enExample) |
| CN (1) | CN104115221B (enExample) |
| ES (1) | ES2628901T3 (enExample) |
| WO (1) | WO2013122750A1 (enExample) |
Families Citing this family (29)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20140067394A1 (en) * | 2012-08-28 | 2014-03-06 | King Abdulaziz City For Science And Technology | System and method for decoding speech |
| US10149077B1 (en) * | 2012-10-04 | 2018-12-04 | Amazon Technologies, Inc. | Audio themes |
| US9338162B2 (en) * | 2014-06-13 | 2016-05-10 | International Business Machines Corporation | CAPTCHA challenge incorporating obfuscated characters |
| CN105047192B (zh) * | 2015-05-25 | 2018-08-17 | 上海交通大学 | 基于隐马尔科夫模型的统计语音合成方法及装置 |
| CN105185379B (zh) * | 2015-06-17 | 2017-08-18 | 百度在线网络技术(北京)有限公司 | 声纹认证方法和装置 |
| CN105161105A (zh) * | 2015-07-31 | 2015-12-16 | 北京奇虎科技有限公司 | 一种交互系统的语音识别方法和装置 |
| CN105161098A (zh) * | 2015-07-31 | 2015-12-16 | 北京奇虎科技有限公司 | 一种交互系统的语音识别方法和装置 |
| US10277581B2 (en) * | 2015-09-08 | 2019-04-30 | Oath, Inc. | Audio verification |
| US9466299B1 (en) | 2015-11-18 | 2016-10-11 | International Business Machines Corporation | Speech source classification |
| US10347247B2 (en) * | 2016-12-30 | 2019-07-09 | Google Llc | Modulation of packetized audio signals |
| US10332520B2 (en) | 2017-02-13 | 2019-06-25 | Qualcomm Incorporated | Enhanced speech generation |
| CN108630193B (zh) * | 2017-03-21 | 2020-10-02 | 北京嘀嘀无限科技发展有限公司 | 语音识别方法及装置 |
| WO2018183290A1 (en) * | 2017-03-27 | 2018-10-04 | Orion Labs | Bot group messaging using general voice libraries |
| CN107609389B (zh) * | 2017-08-24 | 2020-10-30 | 南京理工大学 | 一种基于图像内容相关性的验证方法及系统 |
| JP6791825B2 (ja) * | 2017-09-26 | 2020-11-25 | 株式会社日立製作所 | 情報処理装置、対話処理方法及び対話システム |
| WO2019077013A1 (en) | 2017-10-18 | 2019-04-25 | Soapbox Labs Ltd. | METHODS AND SYSTEMS FOR PROCESSING AUDIO SIGNALS CONTAINING VOICE DATA |
| KR20190057687A (ko) * | 2017-11-20 | 2019-05-29 | 삼성전자주식회사 | 챗봇 변경을 위한 위한 전자 장치 및 이의 제어 방법 |
| US11355125B2 (en) | 2018-08-06 | 2022-06-07 | Google Llc | Captcha automated assistant |
| CN111048062B (zh) * | 2018-10-10 | 2022-10-04 | 华为技术有限公司 | 语音合成方法及设备 |
| US11423073B2 (en) | 2018-11-16 | 2022-08-23 | Microsoft Technology Licensing, Llc | System and management of semantic indicators during document presentations |
| US11126794B2 (en) * | 2019-04-11 | 2021-09-21 | Microsoft Technology Licensing, Llc | Targeted rewrites |
| CN110390104B (zh) * | 2019-07-23 | 2023-05-05 | 思必驰科技股份有限公司 | 用于语音对话平台的不规则文本转写方法及系统 |
| KR102663669B1 (ko) * | 2019-11-01 | 2024-05-08 | 엘지전자 주식회사 | 소음 환경에서의 음성 합성 |
| US20220035898A1 (en) * | 2020-07-31 | 2022-02-03 | Nuance Communications, Inc. | Audio CAPTCHA Using Echo |
| FR3122508A1 (fr) * | 2021-04-29 | 2022-11-04 | Orange | Caractérisation d’un utilisateur par association d’un son à un élément interactif |
| US20230142081A1 (en) * | 2021-11-10 | 2023-05-11 | Nuance Communications, Inc. | Voice captcha |
| CN114299919B (zh) * | 2021-12-27 | 2025-06-03 | 完美世界(北京)软件科技发展有限公司 | 文字转语音方法、装置、存储介质及计算机设备 |
| US20240363119A1 (en) * | 2023-04-28 | 2024-10-31 | Pindrop Security, Inc. | Active voice liveness detection system |
| WO2024259486A1 (en) * | 2023-06-19 | 2024-12-26 | Macquarie University | Scam call system |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20040254793A1 (en) * | 2003-06-12 | 2004-12-16 | Cormac Herley | System and method for providing an audio challenge to distinguish a human from a computer |
| US20050015257A1 (en) * | 2003-07-14 | 2005-01-20 | Alexandre Bronstein | Human test based on human conceptual capabilities |
| JP2006106741A (ja) * | 2004-10-01 | 2006-04-20 | At & T Corp | 対話型音声応答システムによる音声理解を防ぐための方法および装置 |
| US20090319270A1 (en) * | 2008-06-23 | 2009-12-24 | John Nicholas Gross | CAPTCHA Using Challenges Optimized for Distinguishing Between Humans and Machines |
Family Cites Families (31)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS63231496A (ja) * | 1987-03-20 | 1988-09-27 | 富士通株式会社 | 音声認識応答システム |
| US6195698B1 (en) | 1998-04-13 | 2001-02-27 | Compaq Computer Corporation | Method for selectively restricting access to computer systems |
| US7054811B2 (en) | 2002-11-06 | 2006-05-30 | Cellmax Systems Ltd. | Method and system for verifying and enabling user access based on voice parameters |
| US7039949B2 (en) | 2001-12-10 | 2006-05-02 | Brian Ross Cartmell | Method and system for blocking unwanted communications |
| JP2003302999A (ja) * | 2002-04-11 | 2003-10-24 | Advanced Media Inc | 音声による個人認証システム |
| CN1246826C (zh) * | 2004-06-01 | 2006-03-22 | 安徽中科大讯飞信息科技有限公司 | 在语音合成系统中将背景音与文本语音混合输出的方法 |
| US8255223B2 (en) | 2004-12-03 | 2012-08-28 | Microsoft Corporation | User authentication by combining speaker verification and reverse turing test |
| US7945952B1 (en) * | 2005-06-30 | 2011-05-17 | Google Inc. | Methods and apparatuses for presenting challenges to tell humans and computers apart |
| US8145914B2 (en) | 2005-12-15 | 2012-03-27 | Microsoft Corporation | Client-side CAPTCHA ceremony for user verification |
| US20070165811A1 (en) | 2006-01-19 | 2007-07-19 | John Reumann | System and method for spam detection |
| US7680891B1 (en) * | 2006-06-19 | 2010-03-16 | Google Inc. | CAPTCHA-based spam control for content creation systems |
| US8036902B1 (en) * | 2006-06-21 | 2011-10-11 | Tellme Networks, Inc. | Audio human verification |
| US20090055193A1 (en) * | 2007-02-22 | 2009-02-26 | Pudding Holdings Israel Ltd. | Method, apparatus and computer code for selectively providing access to a service in accordance with spoken content received from a user |
| BRPI0808289A2 (pt) * | 2007-03-21 | 2015-06-16 | Vivotext Ltd | "biblioteca de amostras de fala para transformar texto em falta e métodos e instrumentos para gerar e utilizar o mesmo" |
| CN101059830A (zh) | 2007-06-01 | 2007-10-24 | 华南理工大学 | 一种可结合游戏特征的机器人外挂识别方法 |
| US8495727B2 (en) | 2007-08-07 | 2013-07-23 | Microsoft Corporation | Spam reduction in real time communications by human interaction proof |
| US20090249477A1 (en) * | 2008-03-28 | 2009-10-01 | Yahoo! Inc. | Method and system for determining whether a computer user is human |
| US8752141B2 (en) * | 2008-06-27 | 2014-06-10 | John Nicholas | Methods for presenting and determining the efficacy of progressive pictorial and motion-based CAPTCHAs |
| US8793135B2 (en) * | 2008-08-25 | 2014-07-29 | At&T Intellectual Property I, L.P. | System and method for auditory captchas |
| US8925057B1 (en) * | 2009-02-06 | 2014-12-30 | New Jersey Institute Of Technology | Automated tests to distinguish computers from humans |
| US9342508B2 (en) * | 2009-03-19 | 2016-05-17 | Microsoft Technology Licensing, Llc | Data localization templates and parsing |
| US8315871B2 (en) * | 2009-06-04 | 2012-11-20 | Microsoft Corporation | Hidden Markov model based text to speech systems employing rope-jumping algorithm |
| WO2012010743A1 (en) * | 2010-07-23 | 2012-01-26 | Nokia Corporation | Method and apparatus for authorizing a user or a user device based on location information |
| WO2012029519A1 (ja) * | 2010-08-31 | 2012-03-08 | 楽天株式会社 | 応答判定装置、応答判定方法、応答判定プログラム、記録媒体、および、応答判定システム |
| US8719930B2 (en) * | 2010-10-12 | 2014-05-06 | Sonus Networks, Inc. | Real-time network attack detection and mitigation infrastructure |
| CA2819473A1 (en) * | 2010-11-30 | 2012-06-07 | Towson University | Audio based human-interaction proof |
| JP2012163692A (ja) * | 2011-02-04 | 2012-08-30 | Nec Corp | 音声信号処理システム、音声信号処理方法および音声信号処理方法プログラム |
| US20120232907A1 (en) * | 2011-03-09 | 2012-09-13 | Christopher Liam Ivey | System and Method for Delivering a Human Interactive Proof to the Visually Impaired by Means of Semantic Association of Objects |
| US8810368B2 (en) * | 2011-03-29 | 2014-08-19 | Nokia Corporation | Method and apparatus for providing biometric authentication using distributed computations |
| US8904517B2 (en) * | 2011-06-28 | 2014-12-02 | International Business Machines Corporation | System and method for contexually interpreting image sequences |
| US9146917B2 (en) * | 2011-07-15 | 2015-09-29 | International Business Machines Corporation | Validating that a user is human |
-
2012
- 2012-02-17 US US13/399,496 patent/US10319363B2/en active Active
-
2013
- 2013-02-01 EP EP13749405.0A patent/EP2815398B1/en active Active
- 2013-02-01 KR KR1020147022837A patent/KR102101044B1/ko not_active Expired - Fee Related
- 2013-02-01 CN CN201380009453.4A patent/CN104115221B/zh active Active
- 2013-02-01 WO PCT/US2013/024245 patent/WO2013122750A1/en not_active Ceased
- 2013-02-01 ES ES13749405.0T patent/ES2628901T3/es active Active
- 2013-02-01 JP JP2014557674A patent/JP6238312B2/ja not_active Expired - Fee Related
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20040254793A1 (en) * | 2003-06-12 | 2004-12-16 | Cormac Herley | System and method for providing an audio challenge to distinguish a human from a computer |
| US20050015257A1 (en) * | 2003-07-14 | 2005-01-20 | Alexandre Bronstein | Human test based on human conceptual capabilities |
| JP2006106741A (ja) * | 2004-10-01 | 2006-04-20 | At & T Corp | 対話型音声応答システムによる音声理解を防ぐための方法および装置 |
| US20090319270A1 (en) * | 2008-06-23 | 2009-12-24 | John Nicholas Gross | CAPTCHA Using Challenges Optimized for Distinguishing Between Humans and Machines |
Also Published As
| Publication number | Publication date |
|---|---|
| CN104115221B (zh) | 2017-09-01 |
| EP2815398B1 (en) | 2017-03-29 |
| WO2013122750A1 (en) | 2013-08-22 |
| JP6238312B2 (ja) | 2017-11-29 |
| US10319363B2 (en) | 2019-06-11 |
| CN104115221A (zh) | 2014-10-22 |
| ES2628901T3 (es) | 2017-08-04 |
| EP2815398A1 (en) | 2014-12-24 |
| US20130218566A1 (en) | 2013-08-22 |
| KR20140134653A (ko) | 2014-11-24 |
| JP2015510147A (ja) | 2015-04-02 |
| EP2815398A4 (en) | 2015-05-06 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR102101044B1 (ko) | 텍스트 투 스피치 및 시맨틱스에 기초한 오디오 인적 상호 증명 기법 | |
| Chen et al. | Automated scoring of nonnative speech using the speechrater sm v. 5.0 engine | |
| US7289950B2 (en) | Extended finite state grammar for speech recognition systems | |
| Athanaselis et al. | ASR for emotional speech: clarifying the issues and enhancing performance | |
| US8036894B2 (en) | Multi-unit approach to text-to-speech synthesis | |
| CN110782880B (zh) | 一种韵律生成模型的训练方法及装置 | |
| Watts | Unsupervised learning for text-to-speech synthesis | |
| US9437195B2 (en) | Biometric password security | |
| US20110213610A1 (en) | Processor Implemented Systems and Methods for Measuring Syntactic Complexity on Spontaneous Non-Native Speech Data by Using Structural Event Detection | |
| US20190206386A1 (en) | Method and system for text-to-speech synthesis | |
| CN105280177A (zh) | 语音合成字典创建装置、语音合成器、以及语音合成字典创建方法 | |
| CN110782918B (zh) | 一种基于人工智能的语音韵律评估方法及装置 | |
| JP6810580B2 (ja) | 言語モデル学習装置およびそのプログラム | |
| JPWO2016103652A1 (ja) | 音声処理装置、音声処理方法、およびプログラム | |
| US12118898B2 (en) | Voice visualization system for english learning, and method therefor | |
| US11250837B2 (en) | Speech synthesis system, method and non-transitory computer readable medium with language option selection and acoustic models | |
| HaCohen-Kerner et al. | Language and gender classification of speech files using supervised machine learning methods | |
| Dielen | Improving the automatic speech recognition model whisper with voice activity detection | |
| Motyka et al. | Information technology of transcribing Ukrainian-language content based on deep learning | |
| Kirkedal | Danish stød and automatic speech recognition | |
| Carson-Berndsen | Multilingual time maps: portable phonotactic models for speech technology | |
| Kumar et al. | Formalizing expert knowledge for developing accurate speech recognizers. | |
| Sayed et al. | Convolutional Neural Networks to Facilitate the Continuous Recognition of Arabic Speech with Independent Speakers | |
| Drašković | Integration of Ai Tools into an Ai-Driven Software System to Make Learning Programming Easier | |
| Zhang et al. | AcousticScope: Understanding Biases in Voice Interaction via Automated Acoustic Testing |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
St.27 status event code: A-0-1-A10-A15-nap-PA0105 |
|
| PG1501 | Laying open of application |
St.27 status event code: A-1-1-Q10-Q12-nap-PG1501 |
|
| N231 | Notification of change of applicant | ||
| PN2301 | Change of applicant |
St.27 status event code: A-3-3-R10-R13-asn-PN2301 St.27 status event code: A-3-3-R10-R11-asn-PN2301 |
|
| AMND | Amendment | ||
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| PA0201 | Request for examination |
St.27 status event code: A-1-2-D10-D11-exm-PA0201 |
|
| E902 | Notification of reason for refusal | ||
| PE0902 | Notice of grounds for rejection |
St.27 status event code: A-1-2-D10-D21-exm-PE0902 |
|
| AMND | Amendment | ||
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| E601 | Decision to refuse application | ||
| PE0601 | Decision on rejection of patent |
St.27 status event code: N-2-6-B10-B15-exm-PE0601 |
|
| T11-X000 | Administrative time limit extension requested |
St.27 status event code: U-3-3-T10-T11-oth-X000 |
|
| T13-X000 | Administrative time limit extension granted |
St.27 status event code: U-3-3-T10-T13-oth-X000 |
|
| AMND | Amendment | ||
| E13-X000 | Pre-grant limitation requested |
St.27 status event code: A-2-3-E10-E13-lim-X000 |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| PX0901 | Re-examination |
St.27 status event code: A-2-3-E10-E12-rex-PX0901 |
|
| PX0701 | Decision of registration after re-examination |
St.27 status event code: A-3-4-F10-F13-rex-PX0701 |
|
| X701 | Decision to grant (after re-examination) | ||
| GRNT | Written decision to grant | ||
| PR0701 | Registration of establishment |
St.27 status event code: A-2-4-F10-F11-exm-PR0701 |
|
| PR1002 | Payment of registration fee |
St.27 status event code: A-2-2-U10-U12-oth-PR1002 Fee payment year number: 1 |
|
| PG1601 | Publication of registration |
St.27 status event code: A-4-4-Q10-Q13-nap-PG1601 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 4 |
|
| PC1903 | Unpaid annual fee |
St.27 status event code: A-4-4-U10-U13-oth-PC1903 Not in force date: 20240409 Payment event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE |
|
| PC1903 | Unpaid annual fee |
St.27 status event code: N-4-6-H10-H13-oth-PC1903 Ip right cessation event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE Not in force date: 20240409 |