GB2614179A - Speech-to-text auto-scaling for live use cases - Google Patents

Speech-to-text auto-scaling for live use cases Download PDF

Info

Publication number
GB2614179A
GB2614179A GB2304504.0A GB202304504A GB2614179A GB 2614179 A GB2614179 A GB 2614179A GB 202304504 A GB202304504 A GB 202304504A GB 2614179 A GB2614179 A GB 2614179A
Authority
GB
United Kingdom
Prior art keywords
deltas
current values
latency
computational resources
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
GB2304504.0A
Other languages
English (en)
Other versions
GB202304504D0 (en
Inventor
Bolanos Daniel
Rogelio Lee Antonio
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of GB202304504D0 publication Critical patent/GB202304504D0/en
Publication of GB2614179A publication Critical patent/GB2614179A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5011Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
    • G06F9/5022Mechanisms to release resources
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/505Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5077Logical partitioning of resources; Management or configuration of virtualized resources
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/285Memory allocation or algorithm optimisation to reduce hardware requirements
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/501Performance criteria

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Information Transfer Between Computers (AREA)
  • Machine Translation (AREA)
GB2304504.0A 2020-09-03 2021-09-02 Speech-to-text auto-scaling for live use cases Pending GB2614179A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US17/010,866 US11521617B2 (en) 2020-09-03 2020-09-03 Speech-to-text auto-scaling for live use cases
PCT/CN2021/116202 WO2022048595A1 (en) 2020-09-03 2021-09-02 Speech-to-text auto-scaling for live use cases

Publications (2)

Publication Number Publication Date
GB202304504D0 GB202304504D0 (en) 2023-05-10
GB2614179A true GB2614179A (en) 2023-06-28

Family

ID=80358863

Family Applications (1)

Application Number Title Priority Date Filing Date
GB2304504.0A Pending GB2614179A (en) 2020-09-03 2021-09-02 Speech-to-text auto-scaling for live use cases

Country Status (6)

Country Link
US (1) US11521617B2 (https=)
JP (1) JP7748783B2 (https=)
CN (1) CN116194987B (https=)
DE (1) DE112021003525B4 (https=)
GB (1) GB2614179A (https=)
WO (1) WO2022048595A1 (https=)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12008416B2 (en) 2021-06-29 2024-06-11 Capital One Services, Llc Systems and methods for choosing an appropriate scaling technique for allocating computational resources to distributed applications
US12468574B2 (en) * 2021-10-28 2025-11-11 Capital One Services, Llc Systems and methods for dynamically scaling remote resources
US12165646B2 (en) * 2022-04-29 2024-12-10 Zoom Video Communications, Inc. Delta models for providing privatized speech-to-text during virtual meetings
US20230409393A1 (en) * 2022-05-24 2023-12-21 Microsoft Technology Licensing, Llc Systems and methods for autoscaling in datacenters

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000026902A1 (en) * 1998-11-04 2000-05-11 Syvox Corporation Apparatus and method for improved memory and resource management in a single-user or multi-user speech recognition system
US9269355B1 (en) * 2013-03-14 2016-02-23 Amazon Technologies, Inc. Load balancing for automatic speech recognition
US9514747B1 (en) * 2013-08-28 2016-12-06 Amazon Technologies, Inc. Reducing speech recognition latency
US9646601B1 (en) * 2013-07-26 2017-05-09 Amazon Technologies, Inc. Reduced latency text-to-speech system

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6629075B1 (en) * 2000-06-09 2003-09-30 Speechworks International, Inc. Load-adjusted speech recogintion
US6728677B1 (en) * 2001-01-31 2004-04-27 Nuance Communications Method and system for dynamically improving performance of speech recognition or other speech processing systems
JP4536481B2 (ja) * 2004-10-25 2010-09-01 インターナショナル・ビジネス・マシーンズ・コーポレーション コンピュータシステム、修正作業を支援するための方法、及びプログラム
US7953603B2 (en) * 2005-12-21 2011-05-31 International Business Machines Corporation Load balancing based upon speech processing specific factors
US9002713B2 (en) * 2009-06-09 2015-04-07 At&T Intellectual Property I, L.P. System and method for speech personalization by need
WO2011149558A2 (en) 2010-05-28 2011-12-01 Abelow Daniel H Reality alternate
US9098338B2 (en) 2010-12-17 2015-08-04 Verizon Patent And Licensing Inc. Work flow command processing system
US9769085B2 (en) 2012-05-04 2017-09-19 Citrix Systems, Inc. Systems and methods for adaptive application provisioning
US9069606B2 (en) 2012-05-08 2015-06-30 Adobe Systems Incorporated Autonomous application-level auto-scaling in a cloud
WO2014116888A1 (en) 2013-01-25 2014-07-31 REMTCS Inc. Network security system, method, and apparatus
US9064495B1 (en) * 2013-05-07 2015-06-23 Amazon Technologies, Inc. Measurement of user perceived latency in a cloud based speech application
KR20150026405A (ko) * 2013-09-03 2015-03-11 삼성전자주식회사 음성 패킷 송수신 방법 및 이를 구현하는 전자 장치
WO2015105994A1 (en) 2014-01-08 2015-07-16 Callminer, Inc. Real-time conversational analytics facility
CN103942372B (zh) * 2014-04-04 2017-01-04 天津大学 基于fpga的有源配电网暂态实时仿真多速率接口方法
CN104182909A (zh) * 2014-08-21 2014-12-03 大连理工大学 一种水电系统优化调度的多核并行逐次逼近方法
US9535738B2 (en) 2015-04-03 2017-01-03 International Business Machines Corporation Migrating virtual machines based on relative priority of virtual machine in the context of a target hypervisor environment
US9848041B2 (en) 2015-05-01 2017-12-19 Amazon Technologies, Inc. Automatic scaling of resource instance groups within compute clusters
EP3494700A4 (en) * 2016-08-03 2020-01-15 Dejero Labs Inc. SYSTEM AND METHOD FOR CONTROLLING DATA CURRENT MODIFICATIONS
US10193822B1 (en) 2016-08-18 2019-01-29 Amazon Technologies, Inc. Predictive auto-scaling and reactive auto-scaling for network accessible messaging services
JP2019028538A (ja) 2017-07-26 2019-02-21 日本電信電話株式会社 オートスケール処理装置、オートスケール方法及びプログラム
CN109509465B (zh) * 2017-09-15 2023-07-25 阿里巴巴集团控股有限公司 语音信号的处理方法、组件、设备及介质
US11152005B2 (en) * 2019-09-11 2021-10-19 VIQ Solutions Inc. Parallel processing framework for voice to text digital media
US11183178B2 (en) * 2020-01-13 2021-11-23 Microsoft Technology Licensing, Llc Adaptive batching to reduce recognition latency

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000026902A1 (en) * 1998-11-04 2000-05-11 Syvox Corporation Apparatus and method for improved memory and resource management in a single-user or multi-user speech recognition system
US9269355B1 (en) * 2013-03-14 2016-02-23 Amazon Technologies, Inc. Load balancing for automatic speech recognition
US9646601B1 (en) * 2013-07-26 2017-05-09 Amazon Technologies, Inc. Reduced latency text-to-speech system
US9514747B1 (en) * 2013-08-28 2016-12-06 Amazon Technologies, Inc. Reducing speech recognition latency

Also Published As

Publication number Publication date
DE112021003525B4 (de) 2024-10-02
US20220068280A1 (en) 2022-03-03
GB202304504D0 (en) 2023-05-10
US11521617B2 (en) 2022-12-06
CN116194987B (zh) 2025-08-12
JP7748783B2 (ja) 2025-10-03
DE112021003525T5 (de) 2023-07-06
CN116194987A (zh) 2023-05-30
JP2023540495A (ja) 2023-09-25
WO2022048595A1 (en) 2022-03-10

Similar Documents

Publication Publication Date Title
GB2614179A (en) Speech-to-text auto-scaling for live use cases
CN109697522B (zh) 一种数据预测的方法和装置
CN107330516B (zh) 模型参数训练方法、装置及系统
US10460241B2 (en) Server and cloud computing resource optimization method thereof for cloud big data computing architecture
US20200342322A1 (en) Method and device for training data, storage medium, and electronic device
US11269686B2 (en) Adaptive consumer thread pool
CN110795284B (zh) 一种数据恢复方法、装置、设备及可读存储介质
US11704158B2 (en) Managing processing system efficiency
CN107402810B (zh) 线程分配方法及装置
CN110941325B (zh) 处理器的调频方法及装置、计算设备
CN107908367A (zh) 存储系统中数据存储的方法、装置、设备及存储介质
US20190236439A1 (en) Resource allocation based on decomposed utilization data
CN114489942B (zh) 一种面向应用集群的队列任务调度方法及系统
EP4422145A2 (en) Adaptive management of casting requests and/or user inputs at a rechargeable device
US20060161920A1 (en) Method, system, and computer program for managing a queuing system
CN106874100A (zh) 计算资源分配方法及装置
JP2023540495A5 (https=)
CN114895773A (zh) 异构多核处理器的能耗优化方法、系统、装置及存储介质
CN104679590A (zh) 分布式计算系统中的Map优化方法及装置
CN115525400A (zh) 基于批次来管理多个计算任务的方法、设备和程序产品
CN107133332B (zh) 一种查询任务的分配方法及装置
CN107402851B (zh) 一种数据恢复控制方法及装置
US20170286168A1 (en) Balancing thread groups
CN105468461A (zh) 一种内存分区的方法及系统
CN114048010A (zh) 服务超时时间的控制方法、装置、设备以及存储介质