DE112021003525B4 - Automatische sprache-zu-text-skalierung für live-anwendungsfälle - Google Patents
Automatische sprache-zu-text-skalierung für live-anwendungsfälle Download PDFInfo
- Publication number
- DE112021003525B4 DE112021003525B4 DE112021003525.8T DE112021003525T DE112021003525B4 DE 112021003525 B4 DE112021003525 B4 DE 112021003525B4 DE 112021003525 T DE112021003525 T DE 112021003525T DE 112021003525 B4 DE112021003525 B4 DE 112021003525B4
- Authority
- DE
- Germany
- Prior art keywords
- deltas
- latency
- current values
- predefined
- threshold
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5011—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
- G06F9/5022—Mechanisms to release resources
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
- G06F9/505—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G06F9/5077—Logical partitioning of resources; Management or configuration of virtualized resources
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5083—Techniques for rebalancing the load in a distributed system
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/285—Memory allocation or algorithm optimisation to reduce hardware requirements
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2209/00—Indexing scheme relating to G06F9/00
- G06F2209/50—Indexing scheme relating to G06F9/50
- G06F2209/501—Performance criteria
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Information Transfer Between Computers (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/010,866 | 2020-09-03 | ||
| US17/010,866 US11521617B2 (en) | 2020-09-03 | 2020-09-03 | Speech-to-text auto-scaling for live use cases |
| PCT/CN2021/116202 WO2022048595A1 (en) | 2020-09-03 | 2021-09-02 | Speech-to-text auto-scaling for live use cases |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| DE112021003525T5 DE112021003525T5 (de) | 2023-07-06 |
| DE112021003525B4 true DE112021003525B4 (de) | 2024-10-02 |
Family
ID=80358863
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| DE112021003525.8T Active DE112021003525B4 (de) | 2020-09-03 | 2021-09-02 | Automatische sprache-zu-text-skalierung für live-anwendungsfälle |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US11521617B2 (https=) |
| JP (1) | JP7748783B2 (https=) |
| CN (1) | CN116194987B (https=) |
| DE (1) | DE112021003525B4 (https=) |
| GB (1) | GB2614179A (https=) |
| WO (1) | WO2022048595A1 (https=) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12008416B2 (en) | 2021-06-29 | 2024-06-11 | Capital One Services, Llc | Systems and methods for choosing an appropriate scaling technique for allocating computational resources to distributed applications |
| US12468574B2 (en) * | 2021-10-28 | 2025-11-11 | Capital One Services, Llc | Systems and methods for dynamically scaling remote resources |
| US12165646B2 (en) * | 2022-04-29 | 2024-12-10 | Zoom Video Communications, Inc. | Delta models for providing privatized speech-to-text during virtual meetings |
| US20230409393A1 (en) * | 2022-05-24 | 2023-12-21 | Microsoft Technology Licensing, Llc | Systems and methods for autoscaling in datacenters |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070143116A1 (en) | 2005-12-21 | 2007-06-21 | International Business Machines Corporation | Load balancing based upon speech processing specific factors |
| US20100312556A1 (en) | 2009-06-09 | 2010-12-09 | AT & T Intellectual Property I , L.P. | System and method for speech personalization by need |
| US9098338B2 (en) | 2010-12-17 | 2015-08-04 | Verizon Patent And Licensing Inc. | Work flow command processing system |
| US20160292000A1 (en) | 2015-04-03 | 2016-10-06 | International Business Machines Corporation | Migrating virtual machines based on relative priority of virtual machine in the context of a target hypervisor environment |
Family Cites Families (23)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2000026902A1 (en) * | 1998-11-04 | 2000-05-11 | Syvox Corporation | Apparatus and method for improved memory and resource management in a single-user or multi-user speech recognition system |
| US6629075B1 (en) * | 2000-06-09 | 2003-09-30 | Speechworks International, Inc. | Load-adjusted speech recogintion |
| US6728677B1 (en) * | 2001-01-31 | 2004-04-27 | Nuance Communications | Method and system for dynamically improving performance of speech recognition or other speech processing systems |
| JP4536481B2 (ja) * | 2004-10-25 | 2010-09-01 | インターナショナル・ビジネス・マシーンズ・コーポレーション | コンピュータシステム、修正作業を支援するための方法、及びプログラム |
| WO2011149558A2 (en) | 2010-05-28 | 2011-12-01 | Abelow Daniel H | Reality alternate |
| US9769085B2 (en) | 2012-05-04 | 2017-09-19 | Citrix Systems, Inc. | Systems and methods for adaptive application provisioning |
| US9069606B2 (en) | 2012-05-08 | 2015-06-30 | Adobe Systems Incorporated | Autonomous application-level auto-scaling in a cloud |
| WO2014116888A1 (en) | 2013-01-25 | 2014-07-31 | REMTCS Inc. | Network security system, method, and apparatus |
| US9269355B1 (en) * | 2013-03-14 | 2016-02-23 | Amazon Technologies, Inc. | Load balancing for automatic speech recognition |
| US9064495B1 (en) * | 2013-05-07 | 2015-06-23 | Amazon Technologies, Inc. | Measurement of user perceived latency in a cloud based speech application |
| US9646601B1 (en) * | 2013-07-26 | 2017-05-09 | Amazon Technologies, Inc. | Reduced latency text-to-speech system |
| US9514747B1 (en) * | 2013-08-28 | 2016-12-06 | Amazon Technologies, Inc. | Reducing speech recognition latency |
| KR20150026405A (ko) * | 2013-09-03 | 2015-03-11 | 삼성전자주식회사 | 음성 패킷 송수신 방법 및 이를 구현하는 전자 장치 |
| WO2015105994A1 (en) | 2014-01-08 | 2015-07-16 | Callminer, Inc. | Real-time conversational analytics facility |
| CN103942372B (zh) * | 2014-04-04 | 2017-01-04 | 天津大学 | 基于fpga的有源配电网暂态实时仿真多速率接口方法 |
| CN104182909A (zh) * | 2014-08-21 | 2014-12-03 | 大连理工大学 | 一种水电系统优化调度的多核并行逐次逼近方法 |
| US9848041B2 (en) | 2015-05-01 | 2017-12-19 | Amazon Technologies, Inc. | Automatic scaling of resource instance groups within compute clusters |
| EP3494700A4 (en) * | 2016-08-03 | 2020-01-15 | Dejero Labs Inc. | SYSTEM AND METHOD FOR CONTROLLING DATA CURRENT MODIFICATIONS |
| US10193822B1 (en) | 2016-08-18 | 2019-01-29 | Amazon Technologies, Inc. | Predictive auto-scaling and reactive auto-scaling for network accessible messaging services |
| JP2019028538A (ja) | 2017-07-26 | 2019-02-21 | 日本電信電話株式会社 | オートスケール処理装置、オートスケール方法及びプログラム |
| CN109509465B (zh) * | 2017-09-15 | 2023-07-25 | 阿里巴巴集团控股有限公司 | 语音信号的处理方法、组件、设备及介质 |
| US11152005B2 (en) * | 2019-09-11 | 2021-10-19 | VIQ Solutions Inc. | Parallel processing framework for voice to text digital media |
| US11183178B2 (en) * | 2020-01-13 | 2021-11-23 | Microsoft Technology Licensing, Llc | Adaptive batching to reduce recognition latency |
-
2020
- 2020-09-03 US US17/010,866 patent/US11521617B2/en active Active
-
2021
- 2021-09-02 WO PCT/CN2021/116202 patent/WO2022048595A1/en not_active Ceased
- 2021-09-02 JP JP2023514502A patent/JP7748783B2/ja active Active
- 2021-09-02 DE DE112021003525.8T patent/DE112021003525B4/de active Active
- 2021-09-02 CN CN202180053854.4A patent/CN116194987B/zh active Active
- 2021-09-02 GB GB2304504.0A patent/GB2614179A/en active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070143116A1 (en) | 2005-12-21 | 2007-06-21 | International Business Machines Corporation | Load balancing based upon speech processing specific factors |
| US20100312556A1 (en) | 2009-06-09 | 2010-12-09 | AT & T Intellectual Property I , L.P. | System and method for speech personalization by need |
| US9098338B2 (en) | 2010-12-17 | 2015-08-04 | Verizon Patent And Licensing Inc. | Work flow command processing system |
| US20160292000A1 (en) | 2015-04-03 | 2016-10-06 | International Business Machines Corporation | Migrating virtual machines based on relative priority of virtual machine in the context of a target hypervisor environment |
Also Published As
| Publication number | Publication date |
|---|---|
| US20220068280A1 (en) | 2022-03-03 |
| GB202304504D0 (en) | 2023-05-10 |
| US11521617B2 (en) | 2022-12-06 |
| CN116194987B (zh) | 2025-08-12 |
| JP7748783B2 (ja) | 2025-10-03 |
| DE112021003525T5 (de) | 2023-07-06 |
| CN116194987A (zh) | 2023-05-30 |
| JP2023540495A (ja) | 2023-09-25 |
| WO2022048595A1 (en) | 2022-03-10 |
| GB2614179A (en) | 2023-06-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| DE112021003525B4 (de) | Automatische sprache-zu-text-skalierung für live-anwendungsfälle | |
| DE112021004199B4 (de) | Genauigkeit einer Datenstrom- Umsetzungsfunktion für rekurrente neuronale Netzwerke | |
| DE112012003505T5 (de) | Automatisierte Auswahl von Funktionen zum Verringern der Speicherkapazität auf der Grundlage von Leistungsanforderungen | |
| DE112020005323T5 (de) | Elastische ausführung von machine-learning-arbeitslasten unter verwendung einer anwendungsbasierten profilierung | |
| DE112012004336T5 (de) | System, Verfahren und Programmprodukt für kostenbewusste Auswahl von Vorlagen zum Bereitstellen von gemeinsam genutzten Ressourcen | |
| DE102021131913B4 (de) | Optimieren eines planens einer einheitenaktualisierung | |
| DE112013004805T5 (de) | Unterstützen eines koordinierten Zugriffs auf einen gemeinsam genutzten Speicher eines Dateisystems unter Verwendung einer automatischen Ausrichtung eines Protokolls für einen parallelen Dateizugriff und einer Metadatenverwaltung | |
| DE102013205572A1 (de) | Verwenden von softwarekomponenten-metadaten zum bereitstellen von virtuellen maschinen in einer vernetzten datenverarbeitungsumgebung | |
| DE112021003184T5 (de) | Ermittlung von laufzeitumgebungen für software-container | |
| DE102021123133A1 (de) | Selektive anzeige sensibler daten | |
| DE112020005306T5 (de) | Implementierung von arbeitslasten in einer multi-cloud-umgebung | |
| DE112021000390T5 (de) | Anpassen der leistung eines datenverarbeitungssystems | |
| DE112021004234T5 (de) | Einsetzen von metalernen zum optimieren der automatischen auswahl von pipelinesdes maschinellen lernens | |
| DE112021002246T5 (de) | Symphonisierung der serverlosen funktionen von hybriden diensten | |
| DE112021003401T5 (de) | Schattenexperimente für serverlose multi-tenant-cloud-dienste | |
| DE112021004227B4 (de) | Prädiktive kommunikationskompensation | |
| DE102021130965A1 (de) | Aufrüsten einer sequenz von mikrodiensten in einer cloud-computing-umgebung | |
| DE112022004517T5 (de) | Optimierung von lippensynchronisation in einem in natürliche sprache übersetzten video | |
| DE112020004925T5 (de) | Aktualisieren und umsetzen eines dokuments aus einem audiovorgang | |
| DE112022004584T5 (de) | Verwalten einer neuen version eines integrationsflusses während einer rollierenden aktualisierung | |
| DE102016105062A1 (de) | Nähengestützte Berechtigungsprüfung für einheitenübergreifend verteilte Daten | |
| DE112022001431T5 (de) | Adaptive auswahl von datenmodalitäten für eine effiziente videoerkennung | |
| DE102014116744A1 (de) | Management von Informationstechnologieressourcen | |
| DE112020005801B4 (de) | Erkennen eines datenverlustrisikos bei 5g-fähigen einheiten | |
| DE112022000466T5 (de) | Strategie für chunking und überlappungsdecodierung für im datenstrom übertragende rnn-wandler zur spracherkennung |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| R012 | Request for examination validly filed | ||
| R016 | Response to examination communication | ||
| R018 | Grant decision by examination section/examining division | ||
| R084 | Declaration of willingness to licence | ||
| R020 | Patent grant now final |