JP2012508903A5 - - Google Patents

Download PDF

Info

Publication number
JP2012508903A5
JP2012508903A5 JP2011536467A JP2011536467A JP2012508903A5 JP 2012508903 A5 JP2012508903 A5 JP 2012508903A5 JP 2011536467 A JP2011536467 A JP 2011536467A JP 2011536467 A JP2011536467 A JP 2011536467A JP 2012508903 A5 JP2012508903 A5 JP 2012508903A5
Authority
JP
Japan
Prior art keywords
ensemble
decision
weak
classifier
classifiers
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2011536467A
Other languages
English (en)
Other versions
JP2012508903A (ja
JP5850747B2 (ja
Filing date
Publication date
Priority claimed from US12/616,723 external-priority patent/US8566088B2/en
Application filed filed Critical
Publication of JP2012508903A publication Critical patent/JP2012508903A/ja
Publication of JP2012508903A5 publication Critical patent/JP2012508903A5/ja
Application granted granted Critical
Publication of JP5850747B2 publication Critical patent/JP5850747B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Description

弱い分類器とは、偶然より高い確率で決定を実行する決定関数である。アンサンブル分類器は、多数の弱い分類器の結果を結合することによって形成される。ブースティングとは、アンサンブルの決定が、何れの弱い分類器による決定より良好となるように、弱い分類器を選択しかつ重みづけすることによって、アンサンブル分類器を自動的に構築する公知の方法である。この選択は、相対的に多数の弱い分類器の組から各弱い分類器を繰り返し評価し、かつラベルが付されたトレーニング例の重み付け分布で最善のパフォーマンスを有するものを選択することによって、なされる。この選択された弱い分類器はアンサンブルに追加され、かつその決定には、その誤り率に基づいた重みが割り当てられる。次いで、この分布重みは、アンサンブルによってなされたエラーを強調するように調整され、そして次の反復処理が開始される。正しく分類されなかった例が、分布内で強調されるので、アンサンブルのエラーを訂正する傾向を持つ弱い分類器が、続くステップで追加され、そしてアンサンブル全体の決定が改善される。
ブースティングは、良好な一般化特性を有する分類器を生成するために示された。この弱い分類器は、それらのパフォーマンスが偶然より高い確率で行われる限り、いかなる形も取ることができる。
JP2011536467A 2008-11-12 2009-11-12 自動音声−テキスト変換のためのシステムと方法 Active JP5850747B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US11391008P 2008-11-12 2008-11-12
US61/113,910 2008-11-12
US12/616,723 2009-11-11
US12/616,723 US8566088B2 (en) 2008-11-12 2009-11-11 System and method for automatic speech to text conversion
PCT/US2009/064214 WO2010056868A1 (en) 2008-11-12 2009-11-12 System and method for automatic speach to text conversion

Publications (3)

Publication Number Publication Date
JP2012508903A JP2012508903A (ja) 2012-04-12
JP2012508903A5 true JP2012508903A5 (ja) 2013-11-14
JP5850747B2 JP5850747B2 (ja) 2016-02-03

Family

ID=42166012

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2011536467A Active JP5850747B2 (ja) 2008-11-12 2009-11-12 自動音声−テキスト変換のためのシステムと方法

Country Status (7)

Country Link
US (1) US8566088B2 (ja)
EP (1) EP2347408A4 (ja)
JP (1) JP5850747B2 (ja)
KR (1) KR101688240B1 (ja)
CN (1) CN102227767B (ja)
BR (1) BRPI0922035B1 (ja)
WO (1) WO2010056868A1 (ja)

Families Citing this family (94)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8719004B2 (en) * 2009-03-19 2014-05-06 Ditech Networks, Inc. Systems and methods for punctuating voicemail transcriptions
US8712774B2 (en) * 2009-03-30 2014-04-29 Nuance Communications, Inc. Systems and methods for generating a hybrid text string from two or more text strings generated by multiple automated speech recognition systems
US8412525B2 (en) * 2009-04-30 2013-04-02 Microsoft Corporation Noise robust speech classifier ensemble
US8281231B2 (en) * 2009-09-11 2012-10-02 Digitalsmiths, Inc. Timeline alignment for closed-caption text using speech recognition transcripts
US10224036B2 (en) * 2010-10-05 2019-03-05 Infraware, Inc. Automated identification of verbal records using boosted classifiers to improve a textual transcript
US8676574B2 (en) 2010-11-10 2014-03-18 Sony Computer Entertainment Inc. Method for tone/intonation recognition using auditory attention cues
US9031839B2 (en) * 2010-12-01 2015-05-12 Cisco Technology, Inc. Conference transcription based on conference data
US9558738B2 (en) * 2011-03-08 2017-01-31 At&T Intellectual Property I, L.P. System and method for speech recognition modeling for mobile voice search
WO2012134877A2 (en) * 2011-03-25 2012-10-04 Educational Testing Service Computer-implemented systems and methods evaluating prosodic features of speech
US8756061B2 (en) 2011-04-01 2014-06-17 Sony Computer Entertainment Inc. Speech syllable/vowel/phone boundary detection using auditory attention cues
US20120259638A1 (en) * 2011-04-08 2012-10-11 Sony Computer Entertainment Inc. Apparatus and method for determining relevance of input speech
US8185448B1 (en) 2011-06-10 2012-05-22 Myslinski Lucas J Fact checking method and system
US9087048B2 (en) 2011-06-10 2015-07-21 Linkedin Corporation Method of and system for validating a fact checking system
US9015037B2 (en) 2011-06-10 2015-04-21 Linkedin Corporation Interactive fact checking system
US9176957B2 (en) 2011-06-10 2015-11-03 Linkedin Corporation Selective fact checking method and system
US9053750B2 (en) 2011-06-17 2015-06-09 At&T Intellectual Property I, L.P. Speaker association with a visual representation of spoken content
US8719031B2 (en) 2011-06-17 2014-05-06 At&T Intellectual Property I, L.P. Dynamic access to external media content based on speaker content
US20130094567A1 (en) * 2011-10-18 2013-04-18 Lsi Corporation Apparatus and methods for performing block matching on a video stream
US20130132079A1 (en) * 2011-11-17 2013-05-23 Microsoft Corporation Interactive speech recognition
US8849666B2 (en) * 2012-02-23 2014-09-30 International Business Machines Corporation Conference call service with speech processing for heavily accented speakers
CN102682766A (zh) * 2012-05-12 2012-09-19 黄莹 可自学习的情侣声音对换机
US9529793B1 (en) 2012-06-01 2016-12-27 Google Inc. Resolving pronoun ambiguity in voice queries
US9336302B1 (en) 2012-07-20 2016-05-10 Zuci Realty Llc Insight and algorithmic clustering for automated synthesis
US8484022B1 (en) 2012-07-27 2013-07-09 Google Inc. Adaptive auto-encoders
US8484025B1 (en) * 2012-10-04 2013-07-09 Google Inc. Mapping an audio utterance to an action using a classifier
CN102903361A (zh) * 2012-10-15 2013-01-30 Itp创新科技有限公司 一种通话即时翻译系统和方法
US9557818B2 (en) * 2012-10-16 2017-01-31 Google Inc. Contextually-specific automatic separators
US9020822B2 (en) 2012-10-19 2015-04-28 Sony Computer Entertainment Inc. Emotion recognition using auditory attention cues extracted from users voice
US9031293B2 (en) 2012-10-19 2015-05-12 Sony Computer Entertainment Inc. Multi-modal sensor based emotion recognition and emotional interface
US9570076B2 (en) * 2012-10-30 2017-02-14 Google Technology Holdings LLC Method and system for voice recognition employing multiple voice-recognition techniques
US9240184B1 (en) 2012-11-15 2016-01-19 Google Inc. Frame-level combination of deep neural network and gaussian mixture models
RU2530268C2 (ru) 2012-11-28 2014-10-10 Общество с ограниченной ответственностью "Спиктуит" Способ обучения информационной диалоговой системы пользователем
US9672811B2 (en) 2012-11-29 2017-06-06 Sony Interactive Entertainment Inc. Combining auditory attention cues with phoneme posterior scores for phone/vowel/syllable boundary detection
US9483159B2 (en) 2012-12-12 2016-11-01 Linkedin Corporation Fact checking graphical user interface including fact checking icons
US8977555B2 (en) * 2012-12-20 2015-03-10 Amazon Technologies, Inc. Identification of utterance subjects
US9390380B2 (en) * 2013-03-15 2016-07-12 Intel Corporation Continuous interaction learning and detection in real-time
CN104143331B (zh) * 2013-05-24 2015-12-09 腾讯科技(深圳)有限公司 一种添加标点的方法和系统
CN104142915B (zh) * 2013-05-24 2016-02-24 腾讯科技(深圳)有限公司 一种添加标点的方法和系统
US9601130B2 (en) * 2013-07-18 2017-03-21 Mitsubishi Electric Research Laboratories, Inc. Method for processing speech signals using an ensemble of speech enhancement procedures
US9728202B2 (en) 2013-08-07 2017-08-08 Vonage America Inc. Method and apparatus for voice modification during a call
US9299358B2 (en) * 2013-08-07 2016-03-29 Vonage America Inc. Method and apparatus for voice modification during a call
US10169424B2 (en) 2013-09-27 2019-01-01 Lucas J. Myslinski Apparatus, systems and methods for scoring and distributing the reliability of online information
US20150095320A1 (en) 2013-09-27 2015-04-02 Trooclick France Apparatus, systems and methods for scoring the reliability of online information
US10311865B2 (en) * 2013-10-14 2019-06-04 The Penn State Research Foundation System and method for automated speech recognition
US8943405B1 (en) * 2013-11-27 2015-01-27 Google Inc. Assisted punctuation of character strings
GB2523984B (en) * 2013-12-18 2017-07-26 Cirrus Logic Int Semiconductor Ltd Processing received speech data
CN103761064A (zh) * 2013-12-27 2014-04-30 圆展科技股份有限公司 自动语音输入系统及其方法
US9269045B2 (en) * 2014-02-14 2016-02-23 Qualcomm Incorporated Auditory source separation in a spiking neural network
US9972055B2 (en) 2014-02-28 2018-05-15 Lucas J. Myslinski Fact checking method and system utilizing social networking information
US9643722B1 (en) 2014-02-28 2017-05-09 Lucas J. Myslinski Drone device security system
US8990234B1 (en) 2014-02-28 2015-03-24 Lucas J. Myslinski Efficient fact checking method and system
US9189514B1 (en) 2014-09-04 2015-11-17 Lucas J. Myslinski Optimized fact checking method and system
US9520128B2 (en) * 2014-09-23 2016-12-13 Intel Corporation Frame skipping with extrapolation and outputs on demand neural network for automatic speech recognition
KR20160058470A (ko) * 2014-11-17 2016-05-25 삼성전자주식회사 음성 합성 장치 및 그 제어 방법
US9659259B2 (en) * 2014-12-20 2017-05-23 Microsoft Corporation Latency-efficient multi-stage tagging mechanism
US10395555B2 (en) * 2015-03-30 2019-08-27 Toyota Motor Engineering & Manufacturing North America, Inc. System and method for providing optimal braille output based on spoken and sign language
US9640177B2 (en) 2015-06-01 2017-05-02 Quest Software Inc. Method and apparatus to extrapolate sarcasm and irony using multi-dimensional machine learning based linguistic analysis
US10529328B2 (en) 2015-06-22 2020-01-07 Carnegie Mellon University Processing speech signals in voice-based profiling
US9978370B2 (en) * 2015-07-31 2018-05-22 Lenovo (Singapore) Pte. Ltd. Insertion of characters in speech recognition
CN105741838B (zh) * 2016-01-20 2019-10-15 百度在线网络技术(北京)有限公司 语音唤醒方法及装置
CN105704538A (zh) * 2016-03-17 2016-06-22 广东小天才科技有限公司 一种音视频字幕生成方法及系统
KR101862337B1 (ko) 2016-03-24 2018-05-31 주식회사 닷 정보 출력 장치, 방법 및 컴퓨터 판독 가능한 기록 매체
CN107886951B (zh) * 2016-09-29 2021-07-23 百度在线网络技术(北京)有限公司 一种语音检测方法、装置及设备
KR102476897B1 (ko) 2016-10-05 2022-12-12 삼성전자주식회사 객체 추적 방법 및 장치, 및 이를 이용한 3d 디스플레이 장치
CN107943405A (zh) * 2016-10-13 2018-04-20 广州市动景计算机科技有限公司 语音播报装置、方法、浏览器及用户终端
US11205103B2 (en) 2016-12-09 2021-12-21 The Research Foundation for the State University Semisupervised autoencoder for sentiment analysis
KR101818980B1 (ko) 2016-12-12 2018-01-16 주식회사 소리자바 다중 화자 음성 인식 수정 시스템
CN107424612B (zh) * 2017-07-28 2021-07-06 北京搜狗科技发展有限公司 处理方法、装置和机器可读介质
JP6891073B2 (ja) * 2017-08-22 2021-06-18 キヤノン株式会社 スキャン画像にファイル名等を設定するための装置、その制御方法及びプログラム
US10423727B1 (en) 2018-01-11 2019-09-24 Wells Fargo Bank, N.A. Systems and methods for processing nuances in natural language
CN108108357B (zh) * 2018-01-12 2022-08-09 京东方科技集团股份有限公司 口音转换方法及装置、电子设备
CN108600773B (zh) * 2018-04-25 2021-08-10 腾讯科技(深圳)有限公司 字幕数据推送方法、字幕展示方法、装置、设备及介质
RU2711153C2 (ru) 2018-05-23 2020-01-15 Общество С Ограниченной Ответственностью "Яндекс" Способы и электронные устройства для определения намерения, связанного с произнесенным высказыванием пользователя
CN108831458A (zh) * 2018-05-29 2018-11-16 广东声将军科技有限公司 一种离线的语音到命令变换方法和系统
CN108831481A (zh) * 2018-08-01 2018-11-16 平安科技(深圳)有限公司 语音识别中符号添加方法、装置、计算机设备及存储介质
US11094326B2 (en) * 2018-08-06 2021-08-17 Cisco Technology, Inc. Ensemble modeling of automatic speech recognition output
CN109192217B (zh) * 2018-08-06 2023-03-31 中国科学院声学研究所 面向多类低速率压缩语音隐写的通用信息隐藏检测方法
TWI698857B (zh) * 2018-11-21 2020-07-11 財團法人工業技術研究院 語音辨識系統及其方法、與電腦程式產品
RU2761940C1 (ru) 2018-12-18 2021-12-14 Общество С Ограниченной Ответственностью "Яндекс" Способы и электронные устройства для идентификации пользовательского высказывания по цифровому аудиосигналу
CN111858861B (zh) * 2019-04-28 2022-07-19 华为技术有限公司 一种基于绘本的问答交互方法及电子设备
CN112036174B (zh) * 2019-05-15 2023-11-07 南京大学 一种标点标注方法及装置
CN110287156B (zh) * 2019-06-28 2021-12-21 维沃移动通信有限公司 文件处理方法及移动终端
US11961511B2 (en) * 2019-11-08 2024-04-16 Vail Systems, Inc. System and method for disambiguation and error resolution in call transcripts
CN111369981B (zh) * 2020-03-02 2024-02-23 北京远鉴信息技术有限公司 一种方言地域识别方法、装置、电子设备及存储介质
CN111931508B (zh) * 2020-08-24 2023-05-12 上海携旅信息技术有限公司 数字转换方法及系统、文本处理方法及系统、设备和介质
KR102562692B1 (ko) * 2020-10-08 2023-08-02 (주)에어사운드 문장 구두점 제공 시스템 및 방법
WO2022085296A1 (ja) * 2020-10-19 2022-04-28 ソニーグループ株式会社 情報処理装置及び情報処理方法、コンピュータプログラム、フォーマット変換装置、オーディオコンテンツ自動転記システム、学習済みモデル、並びに表示装置
CN112331178A (zh) * 2020-10-26 2021-02-05 昆明理工大学 一种用于低信噪比环境下的语种识别特征融合方法
CN112966561B (zh) * 2021-02-03 2024-01-30 成都职业技术学院 一种便携式大学生创新创业多功能记录方法及装置
US11545143B2 (en) 2021-05-18 2023-01-03 Boris Fridman-Mintz Recognition or synthesis of human-uttered harmonic sounds
CN113744368A (zh) * 2021-08-12 2021-12-03 北京百度网讯科技有限公司 动画合成方法、装置、电子设备及存储介质
KR20230102506A (ko) * 2021-12-30 2023-07-07 삼성전자주식회사 전자 장치 및 이의 제어 방법
TWI812070B (zh) * 2022-03-15 2023-08-11 宏碁股份有限公司 錄音檔轉文字稿方法及系統
CN114758645A (zh) * 2022-04-29 2022-07-15 建信金融科技有限责任公司 语音合成模型的训练方法、装置、设备及存储介质

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5749066A (en) * 1995-04-24 1998-05-05 Ericsson Messaging Systems Inc. Method and apparatus for developing a neural network for phoneme recognition
US5799276A (en) 1995-11-07 1998-08-25 Accent Incorporated Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals
US6611802B2 (en) 1999-06-11 2003-08-26 International Business Machines Corporation Method and system for proofreading and correcting dictated text
JP2002156997A (ja) 2000-11-21 2002-05-31 Sharp Corp 音声検出制御装置
US7668718B2 (en) 2001-07-17 2010-02-23 Custom Speech Usa, Inc. Synchronized pattern recognition source data processed by manual or automatic means for creation of shared speaker-dependent speech user profile
JP2003177776A (ja) 2001-12-12 2003-06-27 Seiko Instruments Inc 議事録記録システム
JP4675840B2 (ja) 2006-06-29 2011-04-27 三菱電機株式会社 リモートコントローラ並びに家電機器

Similar Documents

Publication Publication Date Title
JP2012508903A5 (ja)
Sigtia et al. Audio Chord Recognition with a Hybrid Recurrent Neural Network.
JP2018529159A5 (ja)
CN105574547B (zh) 适应动态调整基分类器权重的集成学习方法及装置
JP2015536356A5 (ja)
JP2010123090A5 (ja)
JP2011521488A5 (ja)
JP2011509650A5 (ja)
WO2011056421A3 (en) Sifting models of a subsurface structure
FR2939132B1 (fr) Fabrication de chlorure de vinyle monomere a partir de matieres renouvelables, chlorure de vinyle monomere obtenu et utilisation.
WO2017106728A3 (en) Repeat protein architectures
WO2013074001A3 (en) Consumer information aggregator and profile generator
WO2012129328A3 (en) Knowledge-based automatic image segmentation
Cao et al. Feature importance sampling‐based adaptive random forest as a useful tool to screen underlying lead compounds
JP2014520318A5 (ja)
JP2013210230A5 (ja)
CN103246897A (zh) 一种基于AdaBoost的弱分类器内部结构调整方法
Mares et al. Stochastic modeling of the connection between sea level pressure and discharge in the Danube lower basin by means of Hidden Markov Model
Yoon et al. Predicting Daylight Illuminances on Vertical Surfaces Using Luminous Efficacy of SolarIr radiance
Jiranyakul Recent evidence of the validity of the export-led growth hypothesis for Thailand
Del Prete et al. Feature selection on a dataset of protein families: from exploratory data analysis to statistical variable importance
Strakova „Pfidaná hodnota studia na viceletych gymnáziích ve svëtle dostupnych datovych zdrojû."
O’Brien et al. Operational Evaluation of a Wind-farm Forecasting System
Karaseva et al. INFORMATIONAL AND TERMINOLOGICAL BASIS IN MULTILINGUALADAPTIVE-TRAINING TECHNOLOGY
At-Tasneem et al. Numerical simulation of multiple arrays arrangement of micro hydro power turbines