CA3036998A1 - Systemes et procedes pour la reconnaissance et la comprehension adaptatives d'entites de noms propres - Google Patents
Systemes et procedes pour la reconnaissance et la comprehension adaptatives d'entites de noms propres Download PDFInfo
- Publication number
- CA3036998A1 CA3036998A1 CA3036998A CA3036998A CA3036998A1 CA 3036998 A1 CA3036998 A1 CA 3036998A1 CA 3036998 A CA3036998 A CA 3036998A CA 3036998 A CA3036998 A CA 3036998A CA 3036998 A1 CA3036998 A1 CA 3036998A1
- Authority
- CA
- Canada
- Prior art keywords
- grammar
- words
- span
- acoustic
- recognizer
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 171
- 230000003044 adaptive effect Effects 0.000 title description 16
- 230000006978 adaptation Effects 0.000 claims description 170
- 238000013518 transcription Methods 0.000 claims description 135
- 230000035897 transcription Effects 0.000 claims description 135
- 238000012545 processing Methods 0.000 claims description 57
- 230000008569 process Effects 0.000 claims description 44
- 230000037361 pathway Effects 0.000 claims description 15
- 230000000694 effects Effects 0.000 claims description 12
- 230000001419 dependent effect Effects 0.000 claims description 11
- 230000005236 sound signal Effects 0.000 description 57
- 238000002360 preparation method Methods 0.000 description 18
- 238000005516 engineering process Methods 0.000 description 15
- 230000006870 function Effects 0.000 description 15
- 238000010586 diagram Methods 0.000 description 13
- 230000004927 fusion Effects 0.000 description 13
- 230000008901 benefit Effects 0.000 description 11
- 238000012937 correction Methods 0.000 description 11
- 238000013461 design Methods 0.000 description 11
- 230000007246 mechanism Effects 0.000 description 10
- 238000012986 modification Methods 0.000 description 8
- 230000004048 modification Effects 0.000 description 8
- 230000009471 action Effects 0.000 description 7
- 230000015556 catabolic process Effects 0.000 description 7
- 238000002372 labelling Methods 0.000 description 7
- 238000013459 approach Methods 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- 230000010485 coping Effects 0.000 description 5
- 238000009877 rendering Methods 0.000 description 5
- 230000008859 change Effects 0.000 description 4
- 241000272814 Anser sp. Species 0.000 description 3
- 235000005156 Brassica carinata Nutrition 0.000 description 3
- 244000257790 Brassica carinata Species 0.000 description 3
- 241000227653 Lycopersicon Species 0.000 description 3
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 3
- 238000004891 communication Methods 0.000 description 3
- 230000002950 deficient Effects 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 238000010348 incorporation Methods 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 230000002829 reductive effect Effects 0.000 description 3
- 230000007704 transition Effects 0.000 description 3
- 230000000454 anti-cipatory effect Effects 0.000 description 2
- 230000003190 augmentative effect Effects 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 235000007466 Corylus avellana Nutrition 0.000 description 1
- 240000003211 Corylus maxima Species 0.000 description 1
- OTMSDBZUPAUEDD-UHFFFAOYSA-N Ethane Chemical compound CC OTMSDBZUPAUEDD-UHFFFAOYSA-N 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 230000001010 compromised effect Effects 0.000 description 1
- 230000001143 conditioned effect Effects 0.000 description 1
- 230000003750 conditioning effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 235000021438 curry Nutrition 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 238000007499 fusion processing Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 231100000989 no adverse effect Toxicity 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 238000013138 pruning Methods 0.000 description 1
- 238000004549 pulsed laser deposition Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 210000003462 vein Anatomy 0.000 description 1
- 210000001260 vocal cord Anatomy 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01C—MEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
- G01C21/00—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
- G01C21/26—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
- G01C21/34—Route searching; Route guidance
- G01C21/36—Input/output arrangements for on-board computers
- G01C21/3605—Destination input or retrieval
- G01C21/3608—Destination input or retrieval using speech input, e.g. using speech recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
- G06F40/295—Named entity recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/32—Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Remote Sensing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Radar, Positioning & Navigation (AREA)
- Artificial Intelligence (AREA)
- Acoustics & Sound (AREA)
- General Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Automation & Control Theory (AREA)
- General Engineering & Computer Science (AREA)
- Machine Translation (AREA)
Abstract
Divers modes de réalisation concernent des systèmes et des procédés de reconnaissance automatique de la parole (ASR) et de compréhension du langage naturel (NLU), qui permettent d'obtenir une reconnaissance et une compréhension très précises d'énoncés prononcés librement, pouvant contenir des noms propres et des entités similaires. Les entités de noms propres peuvent se composer, en totalité ou en partie, de mots qui ne sont pas présents dans les vocabulaires de ces systèmes tels qu'ils sont constitués normalement. La reconnaissance des autres mots dans les énoncés en question, par exemple des mots qui ne font pas partie des entités de noms propres, peut se produire avec une précision régulière et élevée. Divers modes de réalisation produisent en sortie, non seulement un texte courant transcrit avec précision pour la totalité de l'énoncé, mais aussi une représentation symbolique de la signification de l'entrée, dont des représentations symboliques appropriées d'entités de noms propres, aptes à permettre à un système informatique de répondre de manière appropriée à la demande parlée, sans autre analyse de l'entrée utilisateur.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/269,924 US9818401B2 (en) | 2013-05-30 | 2016-09-19 | Systems and methods for adaptive proper name entity recognition and understanding |
US15/269,924 | 2016-09-19 | ||
PCT/US2017/052251 WO2018053502A1 (fr) | 2016-09-19 | 2017-09-19 | Systèmes et procédés pour la reconnaissance et la compréhension adaptatives d'entités de noms propres |
Publications (1)
Publication Number | Publication Date |
---|---|
CA3036998A1 true CA3036998A1 (fr) | 2018-03-22 |
Family
ID=61620171
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA3036998A Pending CA3036998A1 (fr) | 2016-09-19 | 2017-09-19 | Systemes et procedes pour la reconnaissance et la comprehension adaptatives d'entites de noms propres |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP3516649A4 (fr) |
AU (2) | AU2017326987B2 (fr) |
CA (1) | CA3036998A1 (fr) |
WO (1) | WO2018053502A1 (fr) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113688628A (zh) * | 2021-07-28 | 2021-11-23 | 上海携宁计算机科技股份有限公司 | 文本识别方法、电子设备和计算机可读存储介质 |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11886473B2 (en) | 2018-04-20 | 2024-01-30 | Meta Platforms, Inc. | Intent identification for agent matching by assistant systems |
US10963273B2 (en) | 2018-04-20 | 2021-03-30 | Facebook, Inc. | Generating personalized content summaries for users |
CN109257547B (zh) * | 2018-09-21 | 2021-04-06 | 南京邮电大学 | 中文在线音视频的字幕生成方法 |
CN111159366A (zh) * | 2019-12-05 | 2020-05-15 | 重庆兆光科技股份有限公司 | 一种基于正交主题表示的问答优化方法 |
CN111415655B (zh) * | 2020-02-12 | 2024-04-12 | 北京声智科技有限公司 | 语言模型构建方法、装置及存储介质 |
CN114757176B (zh) * | 2022-05-24 | 2023-05-02 | 上海弘玑信息技术有限公司 | 一种获取目标意图识别模型的方法以及意图识别方法 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7120582B1 (en) * | 1999-09-07 | 2006-10-10 | Dragon Systems, Inc. | Expanding an effective vocabulary of a speech recognition system |
WO2002019147A1 (fr) * | 2000-08-28 | 2002-03-07 | Emotion, Inc. | Procede et dispositif utiles pour la gestion, l'extraction et le partage de contenus numeriques |
WO2008106655A1 (fr) * | 2007-03-01 | 2008-09-04 | Apapx, Inc. | Système et procédé d'apprentissage dynamique |
US8583436B2 (en) * | 2007-12-21 | 2013-11-12 | Nec Corporation | Word category estimation apparatus, word category estimation method, speech recognition apparatus, speech recognition method, program, and recording medium |
US8108214B2 (en) * | 2008-11-19 | 2012-01-31 | Robert Bosch Gmbh | System and method for recognizing proper names in dialog systems |
US9818401B2 (en) * | 2013-05-30 | 2017-11-14 | Promptu Systems Corporation | Systems and methods for adaptive proper name entity recognition and understanding |
US9449599B2 (en) | 2013-05-30 | 2016-09-20 | Promptu Systems Corporation | Systems and methods for adaptive proper name entity recognition and understanding |
-
2017
- 2017-09-19 EP EP17851782.7A patent/EP3516649A4/fr active Pending
- 2017-09-19 CA CA3036998A patent/CA3036998A1/fr active Pending
- 2017-09-19 AU AU2017326987A patent/AU2017326987B2/en active Active
- 2017-09-19 WO PCT/US2017/052251 patent/WO2018053502A1/fr unknown
-
2022
- 2022-11-02 AU AU2022263497A patent/AU2022263497A1/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113688628A (zh) * | 2021-07-28 | 2021-11-23 | 上海携宁计算机科技股份有限公司 | 文本识别方法、电子设备和计算机可读存储介质 |
CN113688628B (zh) * | 2021-07-28 | 2023-09-22 | 上海携宁计算机科技股份有限公司 | 文本识别方法、电子设备和计算机可读存储介质 |
Also Published As
Publication number | Publication date |
---|---|
EP3516649A1 (fr) | 2019-07-31 |
AU2017326987B2 (en) | 2022-08-04 |
AU2022263497A1 (en) | 2022-12-22 |
WO2018053502A1 (fr) | 2018-03-22 |
AU2017326987A1 (en) | 2019-04-11 |
EP3516649A4 (fr) | 2020-04-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9818401B2 (en) | Systems and methods for adaptive proper name entity recognition and understanding | |
US11783830B2 (en) | Systems and methods for adaptive proper name entity recognition and understanding | |
US9449599B2 (en) | Systems and methods for adaptive proper name entity recognition and understanding | |
AU2017326987B2 (en) | Systems and methods for adaptive proper name entity recognition and understanding | |
US10957312B2 (en) | Scalable dynamic class language modeling | |
US20230317074A1 (en) | Contextual voice user interface | |
US8346537B2 (en) | Input apparatus, input method and input program | |
US11830485B2 (en) | Multiple speech processing system with synthesized speech styles | |
US8380505B2 (en) | System for recognizing speech for searching a database | |
JP4705023B2 (ja) | 音声認識装置、音声認識方法、及びプログラム | |
US11270687B2 (en) | Phoneme-based contextualization for cross-lingual speech recognition in end-to-end models | |
AU2023258338A1 (en) | Systems and methods for adaptive proper name entity recognition and understanding | |
JP5189874B2 (ja) | 多言語の非ネイティブ音声の認識 | |
EP3005152B1 (fr) | Systèmes et procédés de reconnaissance et compréhension d'entités de noms propres adaptatives | |
JP2008243080A (ja) | 音声を翻訳する装置、方法およびプログラム | |
KR101424496B1 (ko) | 음향 모델 학습을 위한 장치 및 이를 위한 방법이 기록된 컴퓨터 판독 가능한 기록매체 | |
KR101483947B1 (ko) | 핵심어에서의 음소 오류 결과를 고려한 음향 모델 변별 학습을 위한 장치 및 이를 위한 방법이 기록된 컴퓨터 판독 가능한 기록매체 | |
JP6275569B2 (ja) | 対話装置、方法およびプログラム | |
JP3581044B2 (ja) | 音声対話処理方法、音声対話処理システムおよびプログラムを記憶した記憶媒体 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request |
Effective date: 20220831 |
|
EEER | Examination request |
Effective date: 20220831 |
|
EEER | Examination request |
Effective date: 20220831 |
|
EEER | Examination request |
Effective date: 20220831 |
|
EEER | Examination request |
Effective date: 20220831 |