WO2000046787A3 - Systeme et procede d'automatisation de services de transcription - Google Patents

Systeme et procede d'automatisation de services de transcription Download PDF

Info

Publication number
WO2000046787A3
WO2000046787A3 PCT/US2000/002808 US0002808W WO0046787A3 WO 2000046787 A3 WO2000046787 A3 WO 2000046787A3 US 0002808 W US0002808 W US 0002808W WO 0046787 A3 WO0046787 A3 WO 0046787A3
Authority
WO
WIPO (PCT)
Prior art keywords
training
file
speech recognition
status
recognition program
Prior art date
Application number
PCT/US2000/002808
Other languages
English (en)
Other versions
WO2000046787A2 (fr
Inventor
Jonathan Kahn
Charles Qin
Thomas P Flynn
Robert J Tippe
Original Assignee
Custom Speech Usa Inc
Jonathan Kahn
Charles Qin
Thomas P Flynn
Robert J Tippe
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Custom Speech Usa Inc, Jonathan Kahn, Charles Qin, Thomas P Flynn, Robert J Tippe filed Critical Custom Speech Usa Inc
Priority to CA002362462A priority Critical patent/CA2362462A1/fr
Priority to US09/889,870 priority patent/US7006967B1/en
Priority to AU35882/00A priority patent/AU3588200A/en
Priority to GB0118231A priority patent/GB2361569B/en
Publication of WO2000046787A2 publication Critical patent/WO2000046787A2/fr
Publication of WO2000046787A3 publication Critical patent/WO2000046787A3/fr
Priority to US10/014,677 priority patent/US20020095290A1/en
Priority to HK02101880.9A priority patent/HK1041086A1/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/221Announcement of recognition results

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)
  • Telephonic Communication Services (AREA)

Abstract

La présente invention concerne un système d'automatisation substantielle de services de transcription pour utilisateurs vocaux multiples comprenant une station de transcription manuelle, un programme de reconnaissance vocale et un programme d'acheminement. Ce système établit le profil de chaque utilisateur vocal contenant un indicateur de niveau de formation à choisir entre les niveaux d'embauche, de formation, automatisé et d'arrêt d'automatisation. En outre, ce système produit un fichier de dictée vocale à identification unique établi à partir d'un utilisateur vocal courant et, en s'appuyant sur le niveau connu de formation, le système achemine le fichier de dictée vocale à identification unique à une station de transcription manuelle et au programme de reconnaissance vocale. Un transcripteur humain crée des fichiers transcrits pour chaque fichier de dictée vocale reçu. Le programme de reconnaissance vocale crée automatiquement un texte écrit pour chaque dictée vocale reçue, si l'utilisateur courant est du niveau formation ou automatisé. Un fichier mot à mot est manuellement établi si l'utilisateur courant est du niveau embauche ou formation, le programme de reconnaissance vocale subissant un entraînement au moyen d'un modèle acoustique adapté pour l'utilisateur courant, avec utilisation du fichier mot à mot et du fichier de dictée vocale si l'utilisateur courant est du niveau embauche ou formation. En outre, le fichier transcrit est renvoyé à l'utilisateur courant s'il est du niveau embauche ou formation, ou le texte écrit est renvoyé à l'utilisateur s'il est du niveau automatisé. En l'occurrence, cette invention concerne un appareil et un procédé permettant de tester les aptitudes d'un transcripteur humain. On peut également utiliser ces appareil et procédé pour établir un nouveau modèle d'enseignement linguistique largement diffusé.
PCT/US2000/002808 1999-02-05 2000-02-04 Systeme et procede d'automatisation de services de transcription WO2000046787A2 (fr)

Priority Applications (6)

Application Number Priority Date Filing Date Title
CA002362462A CA2362462A1 (fr) 1999-02-05 2000-02-04 Systeme et procede d'automatisation de services de transcription
US09/889,870 US7006967B1 (en) 1999-02-05 2000-02-04 System and method for automating transcription services
AU35882/00A AU3588200A (en) 1999-02-05 2000-02-04 System and method for automating transcription services
GB0118231A GB2361569B (en) 1999-02-05 2000-02-04 System and method for automating transcription services
US10/014,677 US20020095290A1 (en) 1999-02-05 2001-12-11 Speech recognition program mapping tool to align an audio file to verbatim text
HK02101880.9A HK1041086A1 (zh) 1999-02-05 2002-03-12 自動轉錄服務系統及方法

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11894999P 1999-02-05 1999-02-05
US60/118,949 1999-02-05

Publications (2)

Publication Number Publication Date
WO2000046787A2 WO2000046787A2 (fr) 2000-08-10
WO2000046787A3 true WO2000046787A3 (fr) 2000-12-14

Family

ID=22381731

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/002808 WO2000046787A2 (fr) 1999-02-05 2000-02-04 Systeme et procede d'automatisation de services de transcription

Country Status (5)

Country Link
AU (1) AU3588200A (fr)
CA (1) CA2362462A1 (fr)
GB (1) GB2361569B (fr)
HK (1) HK1041086A1 (fr)
WO (1) WO2000046787A2 (fr)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7383187B2 (en) 2001-01-24 2008-06-03 Bevocal, Inc. System, method and computer program product for a distributed speech recognition tuning platform
US7174296B2 (en) 2001-03-16 2007-02-06 Koninklijke Philips Electronics N.V. Transcription service stopping automatic transcription
DE10126020A1 (de) * 2001-05-28 2003-01-09 Olaf Berberich Hybrides Diktier-/Dialogsystem für Spracheingabe und Tastaturbestätigung
GB2381688B (en) 2001-11-03 2004-09-22 Dremedia Ltd Time ordered indexing of audio-visual data
GB2381638B (en) 2001-11-03 2004-02-04 Dremedia Ltd Identifying audio characteristics
US20080086305A1 (en) * 2006-10-02 2008-04-10 Bighand Ltd. Digital dictation workflow system and method
WO2009016474A2 (fr) 2007-07-31 2009-02-05 Bighand Ltd. Système et procédé pour fournir efficacement du contenu sur un réseau de clients légers
CN109285548A (zh) * 2017-07-19 2019-01-29 阿里巴巴集团控股有限公司 信息处理方法、系统、电子设备、和计算机存储介质
CN116074150B (zh) * 2023-03-02 2023-06-09 广东浩博特科技股份有限公司 智能家居的开关控制方法、装置以及智能家居

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5799273A (en) * 1996-09-24 1998-08-25 Allvoice Computing Plc Automated proofreading using interface linking recognized words to their audio data while text is being changed
US5875448A (en) * 1996-10-08 1999-02-23 Boys; Donald R. Data stream editing system including a hand-held voice-editing apparatus having a position-finding enunciator

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5799273A (en) * 1996-09-24 1998-08-25 Allvoice Computing Plc Automated proofreading using interface linking recognized words to their audio data while text is being changed
US5875448A (en) * 1996-10-08 1999-02-23 Boys; Donald R. Data stream editing system including a hand-held voice-editing apparatus having a position-finding enunciator

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"DRAGON DICTATE FOR WINDOWS", DRAGON DICTATE USER'S GUIDE, XX, XX, 1 January 1995 (1995-01-01), XX, pages 01A - 01L + 01, XP002929983 *

Also Published As

Publication number Publication date
GB2361569A (en) 2001-10-24
CA2362462A1 (fr) 2000-08-10
AU3588200A (en) 2000-08-25
HK1041086A1 (zh) 2002-06-28
GB0118231D0 (en) 2001-09-19
WO2000046787A2 (fr) 2000-08-10
GB2361569B (en) 2003-12-24

Similar Documents

Publication Publication Date Title
CA2351705A1 (fr) Systeme et procede pour services de transcription automatique
AP2001002243A0 (en) Automated transcription system and method using two speech converting instances and computer-assisted correction.
JP3282075B2 (ja) 連続音声認識において句読点を自動的に生成する装置および方法
Traunmüller Conventional, biological and environmental factors in speech communication: a modulation theory
Morgan et al. Meetings about meetings: research at ICSI on speech in multiparty conversations
WO2002054033A3 (fr) Systeme et procede informatiques de reconnaissance de langage bases sur une lecture multiple
ATE297588T1 (de) Anpassung des phonetischen kontextes zur verbesserung der spracherkennung
KR100321841B1 (ko) 스피치 애플리케이션의 언어 모델 갱신 방법
EP1022722A3 (fr) Adaptation au locuteur basée sur des vecteurs propres de voix
CN103903627A (zh) 一种语音数据的传输方法及装置
CN105304080A (zh) 语音合成装置及方法
HK1054813A1 (en) Language independent voice-based user interface
WO2006023631A3 (fr) Adaptation d'un systeme de transcription de documents
WO2004003688A8 (fr) Procede pour comparer un fichier texte transcrit avec un fichier cree prealablement
ATE314718T1 (de) Srecherangepasste spracherkennung
SG128406A1 (en) Character recognizing and translating system and voice recognizing and translating system
WO2001097213A8 (fr) Utilisation d'estimations de confiance du niveau d'emission de parole
WO2002097590A3 (fr) Systeme de gestion des informations a commande vocale et independant du langage
DE69822179D1 (de) Verfahren zum lernen von mustern für die sprach- oder die sprechererkennung
EP0867857A3 (fr) EnrÔlement dans la reconnaissance de la parole
EP1349145A3 (fr) Système et procédé permettant la gestion des informations utilisant un interface de dialogue parlé
EP2306451A3 (fr) Système et procédés permettant d'ameliorer l'exactitude de la reconnaissance vocale
WO1999059135A3 (fr) Dispositif et procede de reconnaissance d'un vocabulaire predetermine dans une parole au moyen d'un ordinateur
ATE407411T1 (de) Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text
CN111192585A (zh) 一种音乐播放控制系统、控制方法及智能家电

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

WWE Wipo information: entry into national phase

Ref document number: 09889870

Country of ref document: US

ENP Entry into the national phase

Ref document number: 200118231

Country of ref document: GB

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 35882/00

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: IN/PCT/2001/780/KOL

Country of ref document: IN

ENP Entry into the national phase

Ref document number: 2362462

Country of ref document: CA

Ref document number: 2362462

Country of ref document: CA

Kind code of ref document: A

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase