SG11201811604UA - System and method for real-time transcription of an audio signal into texts - Google Patents

System and method for real-time transcription of an audio signal into texts

Info

Publication number
SG11201811604UA
SG11201811604UA SG11201811604UA SG11201811604UA SG11201811604UA SG 11201811604U A SG11201811604U A SG 11201811604UA SG 11201811604U A SG11201811604U A SG 11201811604UA SG 11201811604U A SG11201811604U A SG 11201811604UA SG 11201811604U A SG11201811604U A SG 11201811604UA
Authority
SG
Singapore
Prior art keywords
speech
audio signal
international
texts
signal
Prior art date
Application number
SG11201811604UA
Other languages
English (en)
Inventor
Shilong Li
Original Assignee
Beijing Didi Infinity Technology & Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Didi Infinity Technology & Development Co Ltd filed Critical Beijing Didi Infinity Technology & Development Co Ltd
Publication of SG11201811604UA publication Critical patent/SG11201811604UA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42221Conversation recording systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2203/00Aspects of automatic or semi-automatic exchanges
    • H04M2203/10Aspects of automatic or semi-automatic exchanges related to the purpose or context of the telephonic communication
    • H04M2203/1058Shopping and product ordering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2203/00Aspects of automatic or semi-automatic exchanges
    • H04M2203/30Aspects of automatic or semi-automatic exchanges related to audio recordings in general
    • H04M2203/303Marking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/51Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing
    • H04M3/5166Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing in combination with interactive voice response systems or voice portals, e.g. as front-ends

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)
  • Display Devices Of Pinball Game Machines (AREA)
SG11201811604UA 2017-04-24 2017-04-24 System and method for real-time transcription of an audio signal into texts SG11201811604UA (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/081659 WO2018195704A1 (en) 2017-04-24 2017-04-24 System and method for real-time transcription of an audio signal into texts

Publications (1)

Publication Number Publication Date
SG11201811604UA true SG11201811604UA (en) 2019-01-30

Family

ID=63918749

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11201811604UA SG11201811604UA (en) 2017-04-24 2017-04-24 System and method for real-time transcription of an audio signal into texts

Country Status (9)

Country Link
US (1) US20190130913A1 (de)
EP (1) EP3461304A4 (de)
JP (1) JP6918845B2 (de)
CN (1) CN109417583B (de)
AU (2) AU2017411915B2 (de)
CA (1) CA3029444C (de)
SG (1) SG11201811604UA (de)
TW (1) TW201843674A (de)
WO (1) WO2018195704A1 (de)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102018212902A1 (de) * 2018-08-02 2020-02-06 Bayerische Motoren Werke Aktiengesellschaft Verfahren zum Bestimmen eines digitalen Assistenten zum Ausführen einer Fahrzeugfunktion aus einer Vielzahl von digitalen Assistenten in einem Fahrzeug, computerlesbares Medium, System, und Fahrzeug
CN111292735A (zh) * 2018-12-06 2020-06-16 北京嘀嘀无限科技发展有限公司 信号处理装置、方法、电子设备及计算机存储介质
KR20210043995A (ko) * 2019-10-14 2021-04-22 삼성전자주식회사 모델 학습 방법 및 장치, 및 시퀀스 인식 방법
US10848618B1 (en) * 2019-12-31 2020-11-24 Youmail, Inc. Dynamically providing safe phone numbers for responding to inbound communications
US11431658B2 (en) 2020-04-02 2022-08-30 Paymentus Corporation Systems and methods for aggregating user sessions for interactive transactions using virtual assistants
CN114464170A (zh) * 2020-10-21 2022-05-10 阿里巴巴集团控股有限公司 语音交互及语音识别方法、装置、设备和存储介质
CN113035188A (zh) * 2021-02-25 2021-06-25 平安普惠企业管理有限公司 通话文本生成方法、装置、设备及存储介质
CN113421572B (zh) * 2021-06-23 2024-02-02 平安科技(深圳)有限公司 实时音频对话报告生成方法、装置、电子设备及存储介质
CN114827100B (zh) * 2022-04-26 2023-10-13 郑州锐目通信设备有限公司 一种出租车电召方法及系统

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6738784B1 (en) * 2000-04-06 2004-05-18 Dictaphone Corporation Document and information processing system
US20080227438A1 (en) * 2007-03-15 2008-09-18 International Business Machines Corporation Conferencing using publish/subscribe communications
US8279861B2 (en) * 2009-12-08 2012-10-02 International Business Machines Corporation Real-time VoIP communications using n-Way selective language processing
CN102262665A (zh) * 2011-07-26 2011-11-30 西南交通大学 基于关键词提取的应答支持系统
US9368116B2 (en) * 2012-09-07 2016-06-14 Verint Systems Ltd. Speaker separation in diarization
CN102903361A (zh) * 2012-10-15 2013-01-30 Itp创新科技有限公司 一种通话即时翻译系统和方法
WO2015014409A1 (en) * 2013-08-02 2015-02-05 Telefonaktiebolaget L M Ericsson (Publ) Transcription of communication sessions
CN103533129B (zh) * 2013-10-23 2017-06-23 上海斐讯数据通信技术有限公司 实时的语音翻译通信方法、系统及所适用的通讯设备
CN103680134B (zh) * 2013-12-31 2016-08-24 北京东方车云信息技术有限公司 一种提供打车服务的方法、装置及系统
US9614969B2 (en) * 2014-05-27 2017-04-04 Microsoft Technology Licensing, Llc In-call translation
US20150347399A1 (en) * 2014-05-27 2015-12-03 Microsoft Technology Licensing, Llc In-Call Translation
CN104216972A (zh) * 2014-08-28 2014-12-17 小米科技有限责任公司 一种发送打车业务请求的方法和装置

Also Published As

Publication number Publication date
EP3461304A1 (de) 2019-04-03
EP3461304A4 (de) 2019-05-22
CA3029444C (en) 2021-08-31
AU2017411915B2 (en) 2020-01-30
TW201843674A (zh) 2018-12-16
CN109417583A (zh) 2019-03-01
JP2019537041A (ja) 2019-12-19
CN109417583B (zh) 2022-01-28
US20190130913A1 (en) 2019-05-02
AU2020201997A1 (en) 2020-04-09
JP6918845B2 (ja) 2021-08-11
AU2017411915A1 (en) 2019-01-24
WO2018195704A1 (en) 2018-11-01
CA3029444A1 (en) 2018-11-01
AU2020201997B2 (en) 2021-03-11

Similar Documents

Publication Publication Date Title
SG11201811604UA (en) System and method for real-time transcription of an audio signal into texts
SG11201811240XA (en) Systems and methods for route planning
SG11201811174XA (en) Systems and methods for determining estimated time of arrival
SG11201811659PA (en) Systems and methods for determining an estimated time of arrival
SG11201811283PA (en) System and method for determining safety score of driver
SG11201811740WA (en) Systems and methods for identifying risky driving behavior
CA3015496A1 (en) Voice control of a media playback system
SG11201903738QA (en) Offshore gnss reference station apparatus, offshore gnss positioning system, and method of generating positioning reference data offshore
SG11201811535RA (en) Systems and methods for allocating service requests
SG11201906875RA (en) Ultra-reliable low-latency communication indication channelization designs
SG11201903130WA (en) Sequence to sequence transformations for speech synthesis via recurrent neural networks
SG11201903604PA (en) Iot security service
SG11201806806YA (en) System and method for processing simultaneous carpool requests
SG11201811765TA (en) Methods and systems for providing transportation service
SG11201902667UA (en) Methods and systems for chromatography data analysis
SG11201900293PA (en) Method and device for displaying application information
SG11201902457UA (en) Decoupling of synchronization raster and channel raster
SG11201805950UA (en) Self-assembled nanostructures and separation membranes comprising aquaporin water channels and methods of making and using them
SG11201811690TA (en) Systems and methods for cheat examination
SG11201811742YA (en) Systems and methods for information processing
SG11201811723QA (en) Using a mobile phone for monitoring a medical device
SG11201803998PA (en) Systems and methods for updating sequence of services
SG11201807325UA (en) Optimizing range of aircraft docking system
SG11201909561RA (en) Octree-based convolutional neural network
SG11201900665VA (en) Cannabis composition