JP6238312B2 - テキストの音声化及び意味に基づくオーディオhip - Google Patents

テキストの音声化及び意味に基づくオーディオhip Download PDF

Info

Publication number
JP6238312B2
JP6238312B2 JP2014557674A JP2014557674A JP6238312B2 JP 6238312 B2 JP6238312 B2 JP 6238312B2 JP 2014557674 A JP2014557674 A JP 2014557674A JP 2014557674 A JP2014557674 A JP 2014557674A JP 6238312 B2 JP6238312 B2 JP 6238312B2
Authority
JP
Japan
Prior art keywords
text
speech
audio
response
challenge
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2014557674A
Other languages
English (en)
Japanese (ja)
Other versions
JP2015510147A5 (enExample
JP2015510147A (ja
Inventor
チエン,ヤオ
ベンジャミン ジュウ,ビン
ベンジャミン ジュウ,ビン
カオ−ピーン スーン,フランク
カオ−ピーン スーン,フランク
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Corp
Microsoft Technology Licensing LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp, Microsoft Technology Licensing LLC filed Critical Microsoft Corp
Publication of JP2015510147A publication Critical patent/JP2015510147A/ja
Publication of JP2015510147A5 publication Critical patent/JP2015510147A5/ja
Application granted granted Critical
Publication of JP6238312B2 publication Critical patent/JP6238312B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2221/00Indexing scheme relating to security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F2221/21Indexing scheme relating to G06F21/00 and subgroups addressing additional information or applications relating to security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F2221/2133Verifying human interaction, e.g., Captcha
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
JP2014557674A 2012-02-17 2013-02-01 テキストの音声化及び意味に基づくオーディオhip Expired - Fee Related JP6238312B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US13/399,496 2012-02-17
US13/399,496 US10319363B2 (en) 2012-02-17 2012-02-17 Audio human interactive proof based on text-to-speech and semantics
PCT/US2013/024245 WO2013122750A1 (en) 2012-02-17 2013-02-01 Audio human interactive proof based on text-to-speech and semantics

Publications (3)

Publication Number Publication Date
JP2015510147A JP2015510147A (ja) 2015-04-02
JP2015510147A5 JP2015510147A5 (enExample) 2016-02-25
JP6238312B2 true JP6238312B2 (ja) 2017-11-29

Family

ID=48982943

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2014557674A Expired - Fee Related JP6238312B2 (ja) 2012-02-17 2013-02-01 テキストの音声化及び意味に基づくオーディオhip

Country Status (7)

Country Link
US (1) US10319363B2 (enExample)
EP (1) EP2815398B1 (enExample)
JP (1) JP6238312B2 (enExample)
KR (1) KR102101044B1 (enExample)
CN (1) CN104115221B (enExample)
ES (1) ES2628901T3 (enExample)
WO (1) WO2013122750A1 (enExample)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140067394A1 (en) * 2012-08-28 2014-03-06 King Abdulaziz City For Science And Technology System and method for decoding speech
US10149077B1 (en) * 2012-10-04 2018-12-04 Amazon Technologies, Inc. Audio themes
US9338162B2 (en) * 2014-06-13 2016-05-10 International Business Machines Corporation CAPTCHA challenge incorporating obfuscated characters
CN105047192B (zh) * 2015-05-25 2018-08-17 上海交通大学 基于隐马尔科夫模型的统计语音合成方法及装置
CN105185379B (zh) * 2015-06-17 2017-08-18 百度在线网络技术(北京)有限公司 声纹认证方法和装置
CN105161105A (zh) * 2015-07-31 2015-12-16 北京奇虎科技有限公司 一种交互系统的语音识别方法和装置
CN105161098A (zh) * 2015-07-31 2015-12-16 北京奇虎科技有限公司 一种交互系统的语音识别方法和装置
US10277581B2 (en) * 2015-09-08 2019-04-30 Oath, Inc. Audio verification
US9466299B1 (en) 2015-11-18 2016-10-11 International Business Machines Corporation Speech source classification
US10347247B2 (en) * 2016-12-30 2019-07-09 Google Llc Modulation of packetized audio signals
US10332520B2 (en) 2017-02-13 2019-06-25 Qualcomm Incorporated Enhanced speech generation
CN108630193B (zh) * 2017-03-21 2020-10-02 北京嘀嘀无限科技发展有限公司 语音识别方法及装置
WO2018183290A1 (en) * 2017-03-27 2018-10-04 Orion Labs Bot group messaging using general voice libraries
CN107609389B (zh) * 2017-08-24 2020-10-30 南京理工大学 一种基于图像内容相关性的验证方法及系统
JP6791825B2 (ja) * 2017-09-26 2020-11-25 株式会社日立製作所 情報処理装置、対話処理方法及び対話システム
WO2019077013A1 (en) 2017-10-18 2019-04-25 Soapbox Labs Ltd. METHODS AND SYSTEMS FOR PROCESSING AUDIO SIGNALS CONTAINING VOICE DATA
KR20190057687A (ko) * 2017-11-20 2019-05-29 삼성전자주식회사 챗봇 변경을 위한 위한 전자 장치 및 이의 제어 방법
US11355125B2 (en) 2018-08-06 2022-06-07 Google Llc Captcha automated assistant
CN111048062B (zh) * 2018-10-10 2022-10-04 华为技术有限公司 语音合成方法及设备
US11423073B2 (en) 2018-11-16 2022-08-23 Microsoft Technology Licensing, Llc System and management of semantic indicators during document presentations
US11126794B2 (en) * 2019-04-11 2021-09-21 Microsoft Technology Licensing, Llc Targeted rewrites
CN110390104B (zh) * 2019-07-23 2023-05-05 思必驰科技股份有限公司 用于语音对话平台的不规则文本转写方法及系统
KR102663669B1 (ko) * 2019-11-01 2024-05-08 엘지전자 주식회사 소음 환경에서의 음성 합성
US20220035898A1 (en) * 2020-07-31 2022-02-03 Nuance Communications, Inc. Audio CAPTCHA Using Echo
FR3122508A1 (fr) * 2021-04-29 2022-11-04 Orange Caractérisation d’un utilisateur par association d’un son à un élément interactif
US20230142081A1 (en) * 2021-11-10 2023-05-11 Nuance Communications, Inc. Voice captcha
CN114299919B (zh) * 2021-12-27 2025-06-03 完美世界(北京)软件科技发展有限公司 文字转语音方法、装置、存储介质及计算机设备
US20240363119A1 (en) * 2023-04-28 2024-10-31 Pindrop Security, Inc. Active voice liveness detection system
WO2024259486A1 (en) * 2023-06-19 2024-12-26 Macquarie University Scam call system

Family Cites Families (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63231496A (ja) * 1987-03-20 1988-09-27 富士通株式会社 音声認識応答システム
US6195698B1 (en) 1998-04-13 2001-02-27 Compaq Computer Corporation Method for selectively restricting access to computer systems
US7054811B2 (en) 2002-11-06 2006-05-30 Cellmax Systems Ltd. Method and system for verifying and enabling user access based on voice parameters
US7039949B2 (en) 2001-12-10 2006-05-02 Brian Ross Cartmell Method and system for blocking unwanted communications
JP2003302999A (ja) * 2002-04-11 2003-10-24 Advanced Media Inc 音声による個人認証システム
US20040254793A1 (en) * 2003-06-12 2004-12-16 Cormac Herley System and method for providing an audio challenge to distinguish a human from a computer
US7841940B2 (en) * 2003-07-14 2010-11-30 Astav, Inc Human test based on human conceptual capabilities
CN1246826C (zh) * 2004-06-01 2006-03-22 安徽中科大讯飞信息科技有限公司 在语音合成系统中将背景音与文本语音混合输出的方法
US7558389B2 (en) 2004-10-01 2009-07-07 At&T Intellectual Property Ii, L.P. Method and system of generating a speech signal with overlayed random frequency signal
US8255223B2 (en) 2004-12-03 2012-08-28 Microsoft Corporation User authentication by combining speaker verification and reverse turing test
US7945952B1 (en) * 2005-06-30 2011-05-17 Google Inc. Methods and apparatuses for presenting challenges to tell humans and computers apart
US8145914B2 (en) 2005-12-15 2012-03-27 Microsoft Corporation Client-side CAPTCHA ceremony for user verification
US20070165811A1 (en) 2006-01-19 2007-07-19 John Reumann System and method for spam detection
US7680891B1 (en) * 2006-06-19 2010-03-16 Google Inc. CAPTCHA-based spam control for content creation systems
US8036902B1 (en) * 2006-06-21 2011-10-11 Tellme Networks, Inc. Audio human verification
US20090055193A1 (en) * 2007-02-22 2009-02-26 Pudding Holdings Israel Ltd. Method, apparatus and computer code for selectively providing access to a service in accordance with spoken content received from a user
BRPI0808289A2 (pt) * 2007-03-21 2015-06-16 Vivotext Ltd "biblioteca de amostras de fala para transformar texto em falta e métodos e instrumentos para gerar e utilizar o mesmo"
CN101059830A (zh) 2007-06-01 2007-10-24 华南理工大学 一种可结合游戏特征的机器人外挂识别方法
US8495727B2 (en) 2007-08-07 2013-07-23 Microsoft Corporation Spam reduction in real time communications by human interaction proof
US20090249477A1 (en) * 2008-03-28 2009-10-01 Yahoo! Inc. Method and system for determining whether a computer user is human
US8489399B2 (en) 2008-06-23 2013-07-16 John Nicholas and Kristin Gross Trust System and method for verifying origin of input through spoken language analysis
US8752141B2 (en) * 2008-06-27 2014-06-10 John Nicholas Methods for presenting and determining the efficacy of progressive pictorial and motion-based CAPTCHAs
US8793135B2 (en) * 2008-08-25 2014-07-29 At&T Intellectual Property I, L.P. System and method for auditory captchas
US8925057B1 (en) * 2009-02-06 2014-12-30 New Jersey Institute Of Technology Automated tests to distinguish computers from humans
US9342508B2 (en) * 2009-03-19 2016-05-17 Microsoft Technology Licensing, Llc Data localization templates and parsing
US8315871B2 (en) * 2009-06-04 2012-11-20 Microsoft Corporation Hidden Markov model based text to speech systems employing rope-jumping algorithm
WO2012010743A1 (en) * 2010-07-23 2012-01-26 Nokia Corporation Method and apparatus for authorizing a user or a user device based on location information
WO2012029519A1 (ja) * 2010-08-31 2012-03-08 楽天株式会社 応答判定装置、応答判定方法、応答判定プログラム、記録媒体、および、応答判定システム
US8719930B2 (en) * 2010-10-12 2014-05-06 Sonus Networks, Inc. Real-time network attack detection and mitigation infrastructure
CA2819473A1 (en) * 2010-11-30 2012-06-07 Towson University Audio based human-interaction proof
JP2012163692A (ja) * 2011-02-04 2012-08-30 Nec Corp 音声信号処理システム、音声信号処理方法および音声信号処理方法プログラム
US20120232907A1 (en) * 2011-03-09 2012-09-13 Christopher Liam Ivey System and Method for Delivering a Human Interactive Proof to the Visually Impaired by Means of Semantic Association of Objects
US8810368B2 (en) * 2011-03-29 2014-08-19 Nokia Corporation Method and apparatus for providing biometric authentication using distributed computations
US8904517B2 (en) * 2011-06-28 2014-12-02 International Business Machines Corporation System and method for contexually interpreting image sequences
US9146917B2 (en) * 2011-07-15 2015-09-29 International Business Machines Corporation Validating that a user is human

Also Published As

Publication number Publication date
CN104115221B (zh) 2017-09-01
EP2815398B1 (en) 2017-03-29
WO2013122750A1 (en) 2013-08-22
US10319363B2 (en) 2019-06-11
KR102101044B1 (ko) 2020-04-14
CN104115221A (zh) 2014-10-22
ES2628901T3 (es) 2017-08-04
EP2815398A1 (en) 2014-12-24
US20130218566A1 (en) 2013-08-22
KR20140134653A (ko) 2014-11-24
JP2015510147A (ja) 2015-04-02
EP2815398A4 (en) 2015-05-06

Similar Documents

Publication Publication Date Title
JP6238312B2 (ja) テキストの音声化及び意味に基づくオーディオhip
US11837216B2 (en) Speech recognition using unspoken text and speech synthesis
US11335324B2 (en) Synthesized data augmentation using voice conversion and speech recognition models
Reddy et al. Speech-to-text and text-to-speech recognition using deep learning
US10685644B2 (en) Method and system for text-to-speech synthesis
US9437195B2 (en) Biometric password security
US12112740B2 (en) Creative work systems and methods thereof
CN113192483B (zh) 一种文本转换为语音的方法、装置、存储介质和设备
Iriondo et al. Automatic refinement of an expressive speech corpus assembling subjective perception and automatic classification
Dai [Retracted] An Automatic Pronunciation Error Detection and Correction Mechanism in English Teaching Based on an Improved Random Forest Model
Bailey et al. Addressing Bias in Spoken Language Systems Used in the Development and Implementation of Automated Child Language‐Based Assessment
Zahariev et al. An approach to speech ambiguities eliminating using semantically-acoustical analysis
Ajayi et al. Indigenuous Vocabulary Reformulation For Continuousyorùbá Speech Recognition In M-Commerce Using Acoustic Nudging-Based Gaussian Mixture Model
Drašković Integration of Ai Tools into an Ai-Driven Software System to Make Learning Programming Easier
Nakashe AUTOMATIC SPEECH RECOGNITION FOR AIR TRAFFIC CONTROL USING CONVOLUTIONAL LSTM
CN120472890A (zh) 语音处理方法、装置及电子设备
Lawyer Underspecification in the Mental Lexicon
Ford Jr Spoken Language Identification from Processing and Pattern Analysis of Spectrograms

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20160105

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20160105

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20170113

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20170124

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20170419

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20170926

A711 Notification of change in applicant

Free format text: JAPANESE INTERMEDIATE CODE: A711

Effective date: 20171003

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20171025

R150 Certificate of patent or registration of utility model

Ref document number: 6238312

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

LAPS Cancellation because of no payment of annual fees