GB2614179A - Speech-to-text auto-scaling for live use cases - Google Patents
Speech-to-text auto-scaling for live use cases Download PDFInfo
- Publication number
- GB2614179A GB2614179A GB2304504.0A GB202304504A GB2614179A GB 2614179 A GB2614179 A GB 2614179A GB 202304504 A GB202304504 A GB 202304504A GB 2614179 A GB2614179 A GB 2614179A
- Authority
- GB
- United Kingdom
- Prior art keywords
- deltas
- current values
- latency
- computational resources
- word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5011—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
- G06F9/5022—Mechanisms to release resources
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
- G06F9/505—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G06F9/5077—Logical partitioning of resources; Management or configuration of virtualized resources
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5083—Techniques for rebalancing the load in a distributed system
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/285—Memory allocation or algorithm optimisation to reduce hardware requirements
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2209/00—Indexing scheme relating to G06F9/00
- G06F2209/50—Indexing scheme relating to G06F9/50
- G06F2209/501—Performance criteria
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Information Transfer Between Computers (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/010,866 US11521617B2 (en) | 2020-09-03 | 2020-09-03 | Speech-to-text auto-scaling for live use cases |
| PCT/CN2021/116202 WO2022048595A1 (en) | 2020-09-03 | 2021-09-02 | Speech-to-text auto-scaling for live use cases |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| GB202304504D0 GB202304504D0 (en) | 2023-05-10 |
| GB2614179A true GB2614179A (en) | 2023-06-28 |
Family
ID=80358863
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| GB2304504.0A Pending GB2614179A (en) | 2020-09-03 | 2021-09-02 | Speech-to-text auto-scaling for live use cases |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US11521617B2 (https=) |
| JP (1) | JP7748783B2 (https=) |
| CN (1) | CN116194987B (https=) |
| DE (1) | DE112021003525B4 (https=) |
| GB (1) | GB2614179A (https=) |
| WO (1) | WO2022048595A1 (https=) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12008416B2 (en) | 2021-06-29 | 2024-06-11 | Capital One Services, Llc | Systems and methods for choosing an appropriate scaling technique for allocating computational resources to distributed applications |
| US12468574B2 (en) * | 2021-10-28 | 2025-11-11 | Capital One Services, Llc | Systems and methods for dynamically scaling remote resources |
| US12165646B2 (en) * | 2022-04-29 | 2024-12-10 | Zoom Video Communications, Inc. | Delta models for providing privatized speech-to-text during virtual meetings |
| US20230409393A1 (en) * | 2022-05-24 | 2023-12-21 | Microsoft Technology Licensing, Llc | Systems and methods for autoscaling in datacenters |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2000026902A1 (en) * | 1998-11-04 | 2000-05-11 | Syvox Corporation | Apparatus and method for improved memory and resource management in a single-user or multi-user speech recognition system |
| US9269355B1 (en) * | 2013-03-14 | 2016-02-23 | Amazon Technologies, Inc. | Load balancing for automatic speech recognition |
| US9514747B1 (en) * | 2013-08-28 | 2016-12-06 | Amazon Technologies, Inc. | Reducing speech recognition latency |
| US9646601B1 (en) * | 2013-07-26 | 2017-05-09 | Amazon Technologies, Inc. | Reduced latency text-to-speech system |
Family Cites Families (23)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6629075B1 (en) * | 2000-06-09 | 2003-09-30 | Speechworks International, Inc. | Load-adjusted speech recogintion |
| US6728677B1 (en) * | 2001-01-31 | 2004-04-27 | Nuance Communications | Method and system for dynamically improving performance of speech recognition or other speech processing systems |
| JP4536481B2 (ja) * | 2004-10-25 | 2010-09-01 | インターナショナル・ビジネス・マシーンズ・コーポレーション | コンピュータシステム、修正作業を支援するための方法、及びプログラム |
| US7953603B2 (en) * | 2005-12-21 | 2011-05-31 | International Business Machines Corporation | Load balancing based upon speech processing specific factors |
| US9002713B2 (en) * | 2009-06-09 | 2015-04-07 | At&T Intellectual Property I, L.P. | System and method for speech personalization by need |
| WO2011149558A2 (en) | 2010-05-28 | 2011-12-01 | Abelow Daniel H | Reality alternate |
| US9098338B2 (en) | 2010-12-17 | 2015-08-04 | Verizon Patent And Licensing Inc. | Work flow command processing system |
| US9769085B2 (en) | 2012-05-04 | 2017-09-19 | Citrix Systems, Inc. | Systems and methods for adaptive application provisioning |
| US9069606B2 (en) | 2012-05-08 | 2015-06-30 | Adobe Systems Incorporated | Autonomous application-level auto-scaling in a cloud |
| WO2014116888A1 (en) | 2013-01-25 | 2014-07-31 | REMTCS Inc. | Network security system, method, and apparatus |
| US9064495B1 (en) * | 2013-05-07 | 2015-06-23 | Amazon Technologies, Inc. | Measurement of user perceived latency in a cloud based speech application |
| KR20150026405A (ko) * | 2013-09-03 | 2015-03-11 | 삼성전자주식회사 | 음성 패킷 송수신 방법 및 이를 구현하는 전자 장치 |
| WO2015105994A1 (en) | 2014-01-08 | 2015-07-16 | Callminer, Inc. | Real-time conversational analytics facility |
| CN103942372B (zh) * | 2014-04-04 | 2017-01-04 | 天津大学 | 基于fpga的有源配电网暂态实时仿真多速率接口方法 |
| CN104182909A (zh) * | 2014-08-21 | 2014-12-03 | 大连理工大学 | 一种水电系统优化调度的多核并行逐次逼近方法 |
| US9535738B2 (en) | 2015-04-03 | 2017-01-03 | International Business Machines Corporation | Migrating virtual machines based on relative priority of virtual machine in the context of a target hypervisor environment |
| US9848041B2 (en) | 2015-05-01 | 2017-12-19 | Amazon Technologies, Inc. | Automatic scaling of resource instance groups within compute clusters |
| EP3494700A4 (en) * | 2016-08-03 | 2020-01-15 | Dejero Labs Inc. | SYSTEM AND METHOD FOR CONTROLLING DATA CURRENT MODIFICATIONS |
| US10193822B1 (en) | 2016-08-18 | 2019-01-29 | Amazon Technologies, Inc. | Predictive auto-scaling and reactive auto-scaling for network accessible messaging services |
| JP2019028538A (ja) | 2017-07-26 | 2019-02-21 | 日本電信電話株式会社 | オートスケール処理装置、オートスケール方法及びプログラム |
| CN109509465B (zh) * | 2017-09-15 | 2023-07-25 | 阿里巴巴集团控股有限公司 | 语音信号的处理方法、组件、设备及介质 |
| US11152005B2 (en) * | 2019-09-11 | 2021-10-19 | VIQ Solutions Inc. | Parallel processing framework for voice to text digital media |
| US11183178B2 (en) * | 2020-01-13 | 2021-11-23 | Microsoft Technology Licensing, Llc | Adaptive batching to reduce recognition latency |
-
2020
- 2020-09-03 US US17/010,866 patent/US11521617B2/en active Active
-
2021
- 2021-09-02 WO PCT/CN2021/116202 patent/WO2022048595A1/en not_active Ceased
- 2021-09-02 JP JP2023514502A patent/JP7748783B2/ja active Active
- 2021-09-02 DE DE112021003525.8T patent/DE112021003525B4/de active Active
- 2021-09-02 CN CN202180053854.4A patent/CN116194987B/zh active Active
- 2021-09-02 GB GB2304504.0A patent/GB2614179A/en active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2000026902A1 (en) * | 1998-11-04 | 2000-05-11 | Syvox Corporation | Apparatus and method for improved memory and resource management in a single-user or multi-user speech recognition system |
| US9269355B1 (en) * | 2013-03-14 | 2016-02-23 | Amazon Technologies, Inc. | Load balancing for automatic speech recognition |
| US9646601B1 (en) * | 2013-07-26 | 2017-05-09 | Amazon Technologies, Inc. | Reduced latency text-to-speech system |
| US9514747B1 (en) * | 2013-08-28 | 2016-12-06 | Amazon Technologies, Inc. | Reducing speech recognition latency |
Also Published As
| Publication number | Publication date |
|---|---|
| DE112021003525B4 (de) | 2024-10-02 |
| US20220068280A1 (en) | 2022-03-03 |
| GB202304504D0 (en) | 2023-05-10 |
| US11521617B2 (en) | 2022-12-06 |
| CN116194987B (zh) | 2025-08-12 |
| JP7748783B2 (ja) | 2025-10-03 |
| DE112021003525T5 (de) | 2023-07-06 |
| CN116194987A (zh) | 2023-05-30 |
| JP2023540495A (ja) | 2023-09-25 |
| WO2022048595A1 (en) | 2022-03-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| GB2614179A (en) | Speech-to-text auto-scaling for live use cases | |
| CN109697522B (zh) | 一种数据预测的方法和装置 | |
| CN107330516B (zh) | 模型参数训练方法、装置及系统 | |
| US10460241B2 (en) | Server and cloud computing resource optimization method thereof for cloud big data computing architecture | |
| US20200342322A1 (en) | Method and device for training data, storage medium, and electronic device | |
| US11269686B2 (en) | Adaptive consumer thread pool | |
| CN110795284B (zh) | 一种数据恢复方法、装置、设备及可读存储介质 | |
| US11704158B2 (en) | Managing processing system efficiency | |
| CN107402810B (zh) | 线程分配方法及装置 | |
| CN110941325B (zh) | 处理器的调频方法及装置、计算设备 | |
| CN107908367A (zh) | 存储系统中数据存储的方法、装置、设备及存储介质 | |
| US20190236439A1 (en) | Resource allocation based on decomposed utilization data | |
| CN114489942B (zh) | 一种面向应用集群的队列任务调度方法及系统 | |
| EP4422145A2 (en) | Adaptive management of casting requests and/or user inputs at a rechargeable device | |
| US20060161920A1 (en) | Method, system, and computer program for managing a queuing system | |
| CN106874100A (zh) | 计算资源分配方法及装置 | |
| JP2023540495A5 (https=) | ||
| CN114895773A (zh) | 异构多核处理器的能耗优化方法、系统、装置及存储介质 | |
| CN104679590A (zh) | 分布式计算系统中的Map优化方法及装置 | |
| CN115525400A (zh) | 基于批次来管理多个计算任务的方法、设备和程序产品 | |
| CN107133332B (zh) | 一种查询任务的分配方法及装置 | |
| CN107402851B (zh) | 一种数据恢复控制方法及装置 | |
| US20170286168A1 (en) | Balancing thread groups | |
| CN105468461A (zh) | 一种内存分区的方法及系统 | |
| CN114048010A (zh) | 服务超时时间的控制方法、装置、设备以及存储介质 |