SG11202001429XA - Information processing apparatus and information processing method - Google Patents

Information processing apparatus and information processing method

Info

Publication number
SG11202001429XA
SG11202001429XA SG11202001429XA SG11202001429XA SG11202001429XA SG 11202001429X A SG11202001429X A SG 11202001429XA SG 11202001429X A SG11202001429X A SG 11202001429XA SG 11202001429X A SG11202001429X A SG 11202001429XA SG 11202001429X A SG11202001429X A SG 11202001429XA
Authority
SG
Singapore
Prior art keywords
information processing
processing apparatus
processing method
information
processing
Prior art date
Application number
SG11202001429XA
Other languages
English (en)
Inventor
Yasuaki Yamagishi
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Publication of SG11202001429XA publication Critical patent/SG11202001429XA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/043Distributed expert systems; Blackboards
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H20/00Arrangements for broadcast or for distribution combined with broadcast
    • H04H20/28Arrangements for simultaneous broadcast of plural pieces of information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Telephonic Communication Services (AREA)
SG11202001429XA 2017-09-15 2018-08-31 Information processing apparatus and information processing method SG11202001429XA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2017177754 2017-09-15
PCT/JP2018/032323 WO2019054199A1 (ja) 2017-09-15 2018-08-31 情報処理装置、及び情報処理方法

Publications (1)

Publication Number Publication Date
SG11202001429XA true SG11202001429XA (en) 2020-04-29

Family

ID=65722792

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202001429XA SG11202001429XA (en) 2017-09-15 2018-08-31 Information processing apparatus and information processing method

Country Status (10)

Country Link
US (1) US11600270B2 (ko)
EP (1) EP3683792B1 (ko)
JP (1) JP7227140B2 (ko)
KR (1) KR102607192B1 (ko)
CN (1) CN111052231B (ko)
AU (1) AU2018333668B2 (ko)
CA (1) CA3075249A1 (ko)
MX (1) MX2020002591A (ko)
SG (1) SG11202001429XA (ko)
WO (1) WO2019054199A1 (ko)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020128552A1 (ja) * 2018-12-18 2020-06-25 日産自動車株式会社 音声認識装置、音声認識装置の制御方法、コンテンツ再生装置、及びコンテンツ送受信システム
JP2020185618A (ja) * 2019-05-10 2020-11-19 株式会社スター精機 機械動作方法,機械動作設定方法及び機械動作確認方法
WO2021100555A1 (ja) * 2019-11-21 2021-05-27 ソニーグループ株式会社 情報処理システム、情報処理装置、情報処理方法及びプログラム
US20240038249A1 (en) * 2022-07-27 2024-02-01 Cerence Operating Company Tamper-robust watermarking of speech signals

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7720249B2 (en) * 1993-11-18 2010-05-18 Digimarc Corporation Watermark embedder and reader
US6937984B1 (en) * 1998-12-17 2005-08-30 International Business Machines Corporation Speech command input recognition system for interactive computer display with speech controlled display of recognized commands
KR100552468B1 (ko) 2001-07-19 2006-02-15 삼성전자주식회사 음성인식에 따른 오동작을 방지 및 음성인식율을 향상 할수 있는 전자기기 및 방법
JP2005338454A (ja) 2004-05-27 2005-12-08 Toshiba Tec Corp 音声対話装置
US9955205B2 (en) 2005-06-10 2018-04-24 Hewlett-Packard Development Company, L.P. Method and system for improving interactive media response systems using visual cues
JP5103479B2 (ja) * 2006-10-18 2012-12-19 デスティニー ソフトウェア プロダクションズ インコーポレイテッド メディアデータに電子透かしを付与する方法
JP5042799B2 (ja) * 2007-04-16 2012-10-03 ソニー株式会社 音声チャットシステム、情報処理装置およびプログラム
JP5144196B2 (ja) 2007-05-08 2013-02-13 ソフトバンクBb株式会社 分散処理により膨大なコンテンツの検査を行う装置と方法、およびコンテンツの検査結果にもとづいて利用者間の自律的なコンテンツ流通とコンテンツ利用を制御するコンテンツ配信システム
JP5332602B2 (ja) * 2008-12-26 2013-11-06 ヤマハ株式会社 サービス提供装置
JP2010164992A (ja) * 2010-03-19 2010-07-29 Toshiba Tec Corp 音声対話装置
JP5982791B2 (ja) * 2011-11-16 2016-08-31 ソニー株式会社 情報処理装置及び情報処理方法、情報提供装置、並びに、情報提供システム
JP6221202B2 (ja) 2012-02-03 2017-11-01 ヤマハ株式会社 通信システム
CN104956436B (zh) 2012-12-28 2018-05-29 株式会社索思未来 带有语音识别功能的设备以及语音识别方法
US9715875B2 (en) 2014-05-30 2017-07-25 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US9548053B1 (en) * 2014-09-19 2017-01-17 Amazon Technologies, Inc. Audible command filtering
US9924224B2 (en) * 2015-04-03 2018-03-20 The Nielsen Company (Us), Llc Methods and apparatus to determine a state of a media presentation device
US9818414B2 (en) * 2015-06-04 2017-11-14 Intel Corporation Dialogue system with audio watermark
US10079024B1 (en) * 2016-08-19 2018-09-18 Amazon Technologies, Inc. Detecting replay attacks in voice-based authentication
US10395650B2 (en) * 2017-06-05 2019-08-27 Google Llc Recorded media hotword trigger suppression

Also Published As

Publication number Publication date
EP3683792B1 (en) 2024-07-03
JPWO2019054199A1 (ja) 2020-10-22
US11600270B2 (en) 2023-03-07
AU2018333668A1 (en) 2020-03-26
AU2018333668B2 (en) 2023-12-21
EP3683792A4 (en) 2020-11-11
KR102607192B1 (ko) 2023-11-29
WO2019054199A1 (ja) 2019-03-21
CA3075249A1 (en) 2019-03-21
EP3683792A1 (en) 2020-07-22
MX2020002591A (es) 2020-07-13
KR20200053486A (ko) 2020-05-18
CN111052231A (zh) 2020-04-21
CN111052231B (zh) 2024-04-12
JP7227140B2 (ja) 2023-02-21
US20200211549A1 (en) 2020-07-02

Similar Documents

Publication Publication Date Title
ZA201902729B (en) Blockchain data processing method and apparatus
SG11202002560PA (en) Data processing method and apparatus
SG10201706691UA (en) Information processing apparatus and information processing method
IL255644B (en) Data processing apparatus and method
EP3624530A4 (en) INFORMATION PROCESSING PROCESS AND RELATED DEVICE
EP3893180C0 (en) SERVICE DATA PROCESSING METHOD AND DEVICE
SG11202103291YA (en) Information processing apparatus and information processing method
ZA201905493B (en) Information processing method and communications apparatus
EP3624551A4 (en) INFORMATION PROCESSING PROCESS AND APPARATUS
EP3557534A4 (en) INFORMATION PROCESSING METHOD AND APPARATUS
SG11202006203QA (en) Location information processing method and apparatus
GB201700081D0 (en) Data processing method and apparatus
EP3480965A4 (en) METHOD AND APPARATUS FOR PROCESSING INFORMATION
EP3565228A4 (en) INFORMATION PROCESSING PROCESS AND APPARATUS
ZA201906314B (en) Information processing method and communication apparatus
SG11201710887RA (en) Information processing apparatus and information processing method
GB201705488D0 (en) Information processing apparatus and control method thereof
EP3267739A4 (en) Information processing apparatus and information processing method
EP3269302A4 (en) Information processing apparatus and information processing method
GB201903138D0 (en) Information processing apparatus and information processing method
SG11202001429XA (en) Information processing apparatus and information processing method
SG11201913532YA (en) Media information processing method and apparatus
GB201704320D0 (en) Data processing apparatus and methods
SG11201705043PA (en) Information processing apparatus and information processing method
EP3550798A4 (en) INFORMATION PROCESSING AND DEVICE