JP2000504848A - 非音波式音声特性記述及び認識のための方法及びその装置 - Google Patents
非音波式音声特性記述及び認識のための方法及びその装置Info
- Publication number
- JP2000504848A JP2000504848A JP9528567A JP52856797A JP2000504848A JP 2000504848 A JP2000504848 A JP 2000504848A JP 9528567 A JP9528567 A JP 9528567A JP 52856797 A JP52856797 A JP 52856797A JP 2000504848 A JP2000504848 A JP 2000504848A
- Authority
- JP
- Japan
- Prior art keywords
- sound
- speech
- organ
- information
- wave
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 238000000034 method Methods 0.000 title claims description 360
- 238000012512 characterization method Methods 0.000 title claims description 4
- 239000013598 vector Substances 0.000 claims abstract description 328
- 210000000056 organ Anatomy 0.000 claims abstract description 326
- 238000004422 calculation algorithm Methods 0.000 claims description 179
- 230000008859 change Effects 0.000 claims description 49
- 210000001847 jaw Anatomy 0.000 claims description 41
- 230000005540 biological transmission Effects 0.000 claims description 34
- 238000001514 detection method Methods 0.000 claims description 33
- 230000008569 process Effects 0.000 claims description 31
- 210000001519 tissue Anatomy 0.000 claims description 30
- 230000005855 radiation Effects 0.000 claims description 19
- 230000000694 effects Effects 0.000 claims description 15
- 230000001427 coherent effect Effects 0.000 claims description 13
- 210000001584 soft palate Anatomy 0.000 claims description 10
- 238000012935 Averaging Methods 0.000 claims description 8
- 238000005070 sampling Methods 0.000 claims description 8
- 238000013528 artificial neural network Methods 0.000 claims description 6
- 230000008520 organization Effects 0.000 claims description 6
- 230000001133 acceleration Effects 0.000 claims description 4
- 230000033001 locomotion Effects 0.000 description 164
- 210000002105 tongue Anatomy 0.000 description 97
- 230000001755 vocal effect Effects 0.000 description 69
- 210000001260 vocal cord Anatomy 0.000 description 61
- 238000005259 measurement Methods 0.000 description 57
- 239000011295 pitch Substances 0.000 description 56
- 238000012545 processing Methods 0.000 description 40
- 230000037303 wrinkles Effects 0.000 description 30
- 230000006870 function Effects 0.000 description 29
- 230000005284 excitation Effects 0.000 description 26
- 238000010606 normalization Methods 0.000 description 25
- 210000003128 head Anatomy 0.000 description 23
- 238000003860 storage Methods 0.000 description 23
- 238000012549 training Methods 0.000 description 23
- 230000005236 sound signal Effects 0.000 description 19
- 238000012360 testing method Methods 0.000 description 19
- 238000004891 communication Methods 0.000 description 18
- 238000005516 engineering process Methods 0.000 description 13
- 210000000214 mouth Anatomy 0.000 description 12
- 230000015572 biosynthetic process Effects 0.000 description 11
- 238000002474 experimental method Methods 0.000 description 11
- 210000003254 palate Anatomy 0.000 description 11
- 238000012546 transfer Methods 0.000 description 9
- 230000008901 benefit Effects 0.000 description 8
- 238000010586 diagram Methods 0.000 description 8
- 238000001914 filtration Methods 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 210000001331 nose Anatomy 0.000 description 7
- 230000001902 propagating effect Effects 0.000 description 7
- 238000013139 quantization Methods 0.000 description 7
- 239000011780 sodium chloride Substances 0.000 description 7
- 230000007704 transition Effects 0.000 description 7
- 230000002238 attenuated effect Effects 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 6
- 230000000875 corresponding effect Effects 0.000 description 6
- 210000004704 glottis Anatomy 0.000 description 6
- 238000003384 imaging method Methods 0.000 description 6
- 230000002829 reductive effect Effects 0.000 description 6
- 230000003252 repetitive effect Effects 0.000 description 6
- 230000004044 response Effects 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 5
- 210000004072 lung Anatomy 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 210000003800 pharynx Anatomy 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 230000002123 temporal effect Effects 0.000 description 5
- 241000282472 Canis lupus familiaris Species 0.000 description 4
- 239000002131 composite material Substances 0.000 description 4
- 238000012937 correction Methods 0.000 description 4
- 230000006872 improvement Effects 0.000 description 4
- 230000036961 partial effect Effects 0.000 description 4
- 230000000737 periodic effect Effects 0.000 description 4
- 238000012805 post-processing Methods 0.000 description 4
- 230000002285 radioactive effect Effects 0.000 description 4
- 238000002310 reflectometry Methods 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 238000001228 spectrum Methods 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 230000007423 decrease Effects 0.000 description 3
- 230000014509 gene expression Effects 0.000 description 3
- 230000007774 longterm Effects 0.000 description 3
- 238000013507 mapping Methods 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 210000004789 organ system Anatomy 0.000 description 3
- 238000007781 pre-processing Methods 0.000 description 3
- 230000011514 reflex Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 230000003595 spectral effect Effects 0.000 description 3
- 101000859758 Homo sapiens Cartilage-associated protein Proteins 0.000 description 2
- 101000916686 Homo sapiens Cytohesin-interacting protein Proteins 0.000 description 2
- 101000726740 Homo sapiens Homeobox protein cut-like 1 Proteins 0.000 description 2
- 101000761460 Homo sapiens Protein CASP Proteins 0.000 description 2
- 101000761459 Mesocricetus auratus Calcium-dependent serine proteinase Proteins 0.000 description 2
- 240000007817 Olea europaea Species 0.000 description 2
- 102100024933 Protein CASP Human genes 0.000 description 2
- 238000012896 Statistical algorithm Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 238000003491 array Methods 0.000 description 2
- 210000000988 bone and bone Anatomy 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 230000001186 cumulative effect Effects 0.000 description 2
- 230000001351 cycling effect Effects 0.000 description 2
- 238000003066 decision tree Methods 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 230000007717 exclusion Effects 0.000 description 2
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 2
- 239000010931 gold Substances 0.000 description 2
- 229910052737 gold Inorganic materials 0.000 description 2
- 210000000088 lip Anatomy 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 238000007620 mathematical function Methods 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 238000003062 neural network model Methods 0.000 description 2
- 230000010287 polarization Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- 238000007619 statistical method Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 210000000515 tooth Anatomy 0.000 description 2
- 102000013830 Calcium-Sensing Receptors Human genes 0.000 description 1
- 108010050543 Calcium-Sensing Receptors Proteins 0.000 description 1
- 241000238366 Cephalopoda Species 0.000 description 1
- 101000822695 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C1 Proteins 0.000 description 1
- 101000655262 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C2 Proteins 0.000 description 1
- 241001481833 Coryphaena hippurus Species 0.000 description 1
- 241000156978 Erebia Species 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 238000007476 Maximum Likelihood Methods 0.000 description 1
- 240000000220 Panda oleosa Species 0.000 description 1
- 235000016496 Panda oleosa Nutrition 0.000 description 1
- 101000655256 Paraclostridium bifermentans Small, acid-soluble spore protein alpha Proteins 0.000 description 1
- 101000655264 Paraclostridium bifermentans Small, acid-soluble spore protein beta Proteins 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- 235000003976 Ruta Nutrition 0.000 description 1
- 240000005746 Ruta graveolens Species 0.000 description 1
- 241000270666 Testudines Species 0.000 description 1
- 206010043946 Tongue conditions Diseases 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- FMYKJLXRRQTBOR-BZSNNMDCSA-N acetylleucyl-leucyl-norleucinal Chemical compound CCCC[C@@H](C=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(C)=O FMYKJLXRRQTBOR-BZSNNMDCSA-N 0.000 description 1
- 230000005534 acoustic noise Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 239000000853 adhesive Substances 0.000 description 1
- 230000001070 adhesive effect Effects 0.000 description 1
- 239000006117 anti-reflective coating Substances 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000010009 beating Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 230000037237 body shape Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 239000004020 conductor Substances 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 230000007850 degeneration Effects 0.000 description 1
- 230000001066 destructive effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000002059 diagnostic imaging Methods 0.000 description 1
- 239000003989 dielectric material Substances 0.000 description 1
- 235000013367 dietary fats Nutrition 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000005684 electric field Effects 0.000 description 1
- 230000005672 electromagnetic field Effects 0.000 description 1
- XXQCMVYBAALAJK-UHFFFAOYSA-N ethyl n-[4-[benzyl(2-phenylethyl)amino]-2-(2-phenylethyl)-1h-imidazo[4,5-c]pyridin-6-yl]carbamate Chemical compound N=1C=2C(N(CCC=3C=CC=CC=3)CC=3C=CC=CC=3)=NC(NC(=O)OCC)=CC=2NC=1CCC1=CC=CC=C1 XXQCMVYBAALAJK-UHFFFAOYSA-N 0.000 description 1
- 238000010304 firing Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 239000010520 ghee Substances 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 230000004886 head movement Effects 0.000 description 1
- 230000035876 healing Effects 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 230000036540 impulse transmission Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 238000002595 magnetic resonance imaging Methods 0.000 description 1
- 210000002050 maxilla Anatomy 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000029052 metamorphosis Effects 0.000 description 1
- 238000004377 microelectronic Methods 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 210000003928 nasal cavity Anatomy 0.000 description 1
- 239000011306 natural pitch Substances 0.000 description 1
- 238000012634 optical imaging Methods 0.000 description 1
- 230000035515 penetration Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 238000000053 physical method Methods 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 229940085913 pluset Drugs 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000002787 reinforcement Effects 0.000 description 1
- 230000033764 rhythmic process Effects 0.000 description 1
- 230000001020 rhythmical effect Effects 0.000 description 1
- 235000005806 ruta Nutrition 0.000 description 1
- 230000036332 sexual response Effects 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000002966 stenotic effect Effects 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 230000036962 time dependent Effects 0.000 description 1
- 210000005182 tip of the tongue Anatomy 0.000 description 1
- 238000003325 tomography Methods 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000002604 ultrasonography Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/05—Detecting, measuring or recording for diagnosis by means of electric currents or magnetic fields; Measuring using microwaves or radio waves
- A61B5/0507—Detecting, measuring or recording for diagnosis by means of electric currents or magnetic fields; Measuring using microwaves or radio waves using microwaves or terahertz waves
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/254—Fusion techniques of classification results, e.g. of results related to same input data
- G06F18/256—Fusion techniques of classification results, e.g. of results related to same input data of results relating to different input data, e.g. multimodal recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/24—Speech recognition using non-acoustical features
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2291/00—Indexing codes associated with group G01N29/00
- G01N2291/02—Indexing codes associated with the analysed material
- G01N2291/024—Mixtures
- G01N2291/02491—Materials with nonlinear acoustic properties
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2291/00—Indexing codes associated with group G01N29/00
- G01N2291/02—Indexing codes associated with the analysed material
- G01N2291/028—Material parameters
- G01N2291/02872—Pressure
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- Artificial Intelligence (AREA)
- Surgery (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Radiology & Medical Imaging (AREA)
- Biophysics (AREA)
- Pathology (AREA)
- Biomedical Technology (AREA)
- Heart & Thoracic Surgery (AREA)
- Medical Informatics (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Animal Behavior & Ethology (AREA)
- General Health & Medical Sciences (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Acoustics & Sound (AREA)
- Image Processing (AREA)
- Machine Translation (AREA)
- Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
Claims (1)
- 【特許請求の範囲】 1. 話し手による音声の音声特性記述の方法において、 電磁(EM)放射を話し手の音声器官へ方向付ける過程と、 EM音声情報を獲得するように音声器官の状態を測定するために音声器官から 散乱した電磁(EM)放射を検出する過程と、 音波音声情報を獲得するために話し手からの音波音声出力を検出する過程と、 音声特性記述アルゴリズムを用いてEM音声情報を音波音声情報と結合する過程 とを有する方法。 2. 請求項1記載の方法において、前記音声が正常に発音された音声、ささ やかれた音声、及び、発音されなかった音声から選定される方法。 3. 請求項1記載の方法において、前記話し手の音波音声出力が少なくとも 1つの音波マイクロホンを用いて検出される方法。 4. 請求項3記載の方法において、更に、振幅対時間、周波数、ゼロ交差時 間、時限当たりのエネルギー、及び、音波音声のLPCまたはセプトラル(ceps tral)係数を獲得するために複数のサンプリングタイムに亙って音波圧力または 音の強さを測定する過程を有する方法。 5. 請求項1記載の方法において、前記話し手の音波音声出力が 音波振動 を検出するための少なくとも1つのEM波マイクロホンを用いて検出される方法 。 6. 請求項1記載の方法において、EM波送信受信システムを用いてEM放 射が前記音声器官に向けられそして前記音声器官から検出されるシステム。 7. 請求項6記載の方法において、前記EM波生成、送信及び検出システム がRF、マイクロ波、ミリメートル波、赤外線、或いは、可視波EMセンサであ る方法。 8. 請求項7記載の方法において、前記EMセンサが経過時間、非コヒーレ ントモードにおいて操作される方法。 9.請求項8記載の方法において、EMセンサがレンジゲートされる方法。 10. 請求項7記載の方法において、前記EMセンサがコヒーレントモード において操作される方法。 11. 請求項10記載の方法において、EMセンサがホモダイン、ヘテロダ イン、または、他の干渉的コヒーレント検出モードにおいて操作される方法。 12. 請求項7記載の方法において、前記EMセンサがレンジゲートを用い るか或いは用いることなくタイム濾波された出力を用いて場妨害モードにおいて 操作される方法。 13. 請求項1記載の方法において、前記EM放射の生成と送信と検出と、 前記音波音声出力の実質的な同時受信との時間を制御する過程を有する方法。 14. 請求項1記載の方法において、更に、音声の定義済みタイムフレーム 期間における音波音声出力及びEMセンサで測定された音声器官状態の特徴を記 述する特徴ベクトルを作る過程を有する方法。 15. 請求項14記載の方法において、更に、前記特徴ベクトルにおいて各 特徴ベクトルの定義済みタイムフレームの開始時間と継続時間と終結時間とを記 憶する過程を有する方法。 16. 請求項14記載の方法において、更に、前記特徴ベクトルに含まれる 情報をタイミングの同期化のための他の機器または装置からの情報に関連させる 過程を有する方法。 17. 請求項14記載の方法において、更に、前記特徴ベクトルを電子式ラ イブラリに記憶する過程を有する方法。 18. 請求項14記載の方法において、更に、1人又は複数の話し手に関す る特徴ベクトルを作成する過程と、1人又は複数の話し手の前記特徴ベクトルを 平均する過程と、前記の平均された特徴ベクトルをライブラリに記憶する過程と を有する方法。 19. 請求項14記載の方法において、更に、前記話し手の特徴ベクトルを 基準話し手または話し手のグループの特徴ベクトルに対して正規化及び量子化す る過程を有する方法。 20. 請求項14記載の方法において、更に、軟口蓋と顎と舌と声門組織と 唇とのうちの少なくとも1つの位置及び速度の少なくとも1つに関する特徴ベク トルを作成する過程を有する方法。 21. 請求項14記載の方法において、更に、単位様音節か音素かPLUか 2フォーンか3フォーンか音波単位か語か或いは語列を定義する1つの単一また は多重音声フレーム特徴ベクトルを形成する過程を有する方法。 22. 請求項14記載の方法において、更に、単位音節か音素かPLUか2 フォーンか3フォーンか音波単位か語か或いは語列を識別するために1つの統計 的技法またはパターンマッチング技法を前記特徴ベクトルに適用する過程を有す る方法。 23. 請求項14記載の方法において、更に、先ず個別の音波及びEM特徴 ベクトルを形成することによって前記特徴ベクトルを形成する過程と、次に、前 記個別の音波及びEM特徴ベクトルを結合する過程とを有する方法。 24. 請求項14記載の方法において、更に、基準特徴ベクトルからの変化 によって定義される新規な特徴ベクトルを定義するために音変化とEM信号変化 とを識別する過程を有する方法。 25. 請求項14記載の方法において、更に、新規な音声タイムフレームを 定義するために、最後のタイムフレームの変化と比較した音波変化とEM信号変 化とを識別する過程を有する方法。 26. 請求項14記載の方法において、前記特徴ベクトルを自動的に形成す る過程を有する方法。 27. 請求項14記載の方法において、複数の音声タイムフレーム期間中に おける少なくとも1つの音声器官の位置、及び速度の少なくとも一方の定義済み 状態からの定義された状態と変化を記述する特徴ベクトルを作成する過程を有す る方法。 28. 請求項14記載の方法において、更に、複数のタイムフレームに亙る 速度と加速度に関する特徴ベクトルを形成する過程を有する方法。 29. 請求項14記載の方法において、更に、順次配列された一連の音声タ イムフレームに亙り当該話し手によって形成された特徴ベクトルのパターンから 話し手を識別する過程を有する方法。 30. 請求項17記載の方法において、特定の話し手の特徴ベクトルの時間 調整を実施する過程と、前記特定の話し手の時間調整済み特徴ベクトルを前記ラ イブラリ内の特徴ベクトルと比較する過程とを有する方法。 31. 請求項1記載の方法において、更に、前記の検出されたEM放射から 器官速度または加速度情報を獲得する過程を有する方法。 32. 請求項1記載の方法において、前記のEM音声情報及び音波音声情報 以外の他の音声情報を測定する過程と、前記の他の音声情報を前記のEM音声情 報及び音波音声情報と結合する過程とを有する方法。 33. 請求項1記載の方法において、更に、音声システムモデル化に関する 前記EM音声情報及び音波音声情報から前記音声システムの1組の機械パラメー タを決定する過程を有する方法。 34. 請求項1記載の方法において、前記アルゴリズムが音声の開始と音声 の終結と音声周期と休止と音声率と外部からのノイズとを決定する方法。 35. 請求項1記載の方法において、前記アルゴリズムが有声または無声の 音声の存在を決定する方法。 36. 請求項22記載の方法において、前記統計技法が隠れたマルコフモデ ル技法又はニューラルネットワーク技法である方法。 37. 請求項22の方法において、前記のパターンマッチング技法が音声- テンプレートマッチング技法である方法。 38. 請求項22記載の方法において、前記のアルゴリズムが、更に高い全 体的な識別確率を獲得するために非音波式技法を用いて識別された特徴ベクトル に対して一般の音波的技法を用いて識別された特徴ベクトルを比較することによ る識別の結合又は排除する方法を用いる方法。 39. 請求項1記載の方法において、更に、1つの器官が別の器官に接触し 、そして、共鳴器官または境界条件効果の変化に起因して前記のEM波反射条件 を著しく変えるような器官接触を測定する過程を有する方法。 40. 請求項1記載の方法において、更に、前記組織及び組織インタフェー スからの干渉性反射と透過を利用して器官インタフェースの間隔を検出するため に一連の既知波長を生成して送信する過程を有する方法。 41. 話し手による音声の音声特性記述のための装置において、 EM波を前記話し手の音声器官に方向づけ、そして、EM音声情報を獲得する ために前記話し手の音声器官から散乱されたEM波を検出するための少なくとも 1つの電磁(EM)波生成、伝送、及び、検出ユニットと、 音波音声情報を獲得するために、前記話し手からの音波音声出力を検出するた めの少なくとも1つのマイクロホンと、 音声特性記述アルゴリズムを用いて前記EM音声情報を音波音声情報と結合す る手段とを有する装置。 42. 請求項41記載の装置において、各EM波生成、送信、及び、受信ユ ニットが1つのRFかマイクロ波かミリメートル波か赤外線か或いは可視波レー ダである装置。 43. 請求項41記載の装置において、各マイクロホンが1つの音波マイク ロホンか又は1つのEMマイクロホンである装置。 44. 請求項41記載の装置において、更に、少なくとも1つのEM波生成 、送信、及び、検出ユニットと少なくとも1つのマイクロホンとを、これらが前 記話し手の音声器官の状態を検出できるように取り付けるための構造を有する装 置。 45. 請求項41記載の装置において、更に、前記EM波の生成と送信と検 出と、前記音波音声出力の実質的な同時受信との時間を制御するための手段を有 する装置。 46. 請求項42記載の装置において、前記EMユニットが、1つの経過時 間、非コヒーレントレーダか、または、レンジゲートを備えるか或いは備えず、 時間濾波された出力を備えた、1つの場妨害センサか、又は、1つのコヒーレン トレーダである装置。 47. 請求項42記載の装置において、EMユニットがレンジゲート付きレ ーダである装置。 48. 請求項42記載の装置において、前記EMユニットが、1つのホモダ イン、ヘテロダイン、または、他の干渉コヒーレント検出EMセンサである装置 。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/597,596 US6006175A (en) | 1996-02-06 | 1996-02-06 | Methods and apparatus for non-acoustic speech characterization and recognition |
US08/597,596 | 1996-02-06 | ||
PCT/US1997/001489 WO1997029481A1 (en) | 1996-02-06 | 1997-01-28 | Methods and apparatus for non-acoustic speech characterization and recognition |
Publications (1)
Publication Number | Publication Date |
---|---|
JP2000504848A true JP2000504848A (ja) | 2000-04-18 |
Family
ID=24392161
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP9528567A Ceased JP2000504848A (ja) | 1996-02-06 | 1997-01-28 | 非音波式音声特性記述及び認識のための方法及びその装置 |
Country Status (6)
Country | Link |
---|---|
US (1) | US6006175A (ja) |
EP (1) | EP0883877B1 (ja) |
JP (1) | JP2000504848A (ja) |
AT (1) | ATE286295T1 (ja) |
DE (1) | DE69732096D1 (ja) |
WO (1) | WO1997029481A1 (ja) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008062782A1 (fr) * | 2006-11-20 | 2008-05-29 | Nec Corporation | Système d'estimation de parole, procédé d'estimation de parole et programme d'estimation de parole |
Families Citing this family (188)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6377919B1 (en) * | 1996-02-06 | 2002-04-23 | The Regents Of The University Of California | System and method for characterizing voiced excitations of speech and acoustic signals, removing acoustic noise from speech, and synthesizing speech |
US6542857B1 (en) * | 1996-02-06 | 2003-04-01 | The Regents Of The University Of California | System and method for characterizing synthesizing and/or canceling out acoustic signals from inanimate sound sources |
FR2762464B1 (fr) * | 1997-04-16 | 1999-06-25 | France Telecom | Procede et dispositif de codage d'un signal audiofrequence par analyse lpc "avant" et "arriere" |
US6718302B1 (en) * | 1997-10-20 | 2004-04-06 | Sony Corporation | Method for utilizing validity constraints in a speech endpoint detector |
US6304846B1 (en) * | 1997-10-22 | 2001-10-16 | Texas Instruments Incorporated | Singing voice synthesis |
US6285979B1 (en) * | 1998-03-27 | 2001-09-04 | Avr Communications Ltd. | Phoneme analyzer |
JPH11296192A (ja) * | 1998-04-10 | 1999-10-29 | Pioneer Electron Corp | 音声認識における音声特徴量の補正方法、音声認識方法、音声認識装置及び音声認識プログラムを記録した記録媒体 |
US6421453B1 (en) * | 1998-05-15 | 2002-07-16 | International Business Machines Corporation | Apparatus and methods for user recognition employing behavioral passwords |
DE69943018D1 (de) * | 1998-10-09 | 2011-01-20 | Sony Corp | Lernvorrichtung und -verfahren, erkennungsvorrichtung und verfahren, und aufnahme-medium |
JP2000200098A (ja) * | 1999-01-07 | 2000-07-18 | Sony Corp | 学習装置および学習方法、並びに認識装置および認識方法 |
WO2001018781A1 (en) * | 1999-03-24 | 2001-03-15 | Lautzenhiser John L | Head-voice control of computer or other output apparatus |
US6487531B1 (en) | 1999-07-06 | 2002-11-26 | Carol A. Tosaya | Signal injection coupling into the human vocal tract for robust audible and inaudible voice recognition |
US6453284B1 (en) * | 1999-07-26 | 2002-09-17 | Texas Tech University Health Sciences Center | Multiple voice tracking system and method |
US6795807B1 (en) * | 1999-08-17 | 2004-09-21 | David R. Baraff | Method and means for creating prosody in speech regeneration for laryngectomees |
DE19941227A1 (de) * | 1999-08-30 | 2001-03-08 | Philips Corp Intellectual Pty | Verfahren und Anordnung zur Spracherkennung |
US6675027B1 (en) * | 1999-11-22 | 2004-01-06 | Microsoft Corp | Personal mobile computing device having antenna microphone for improved speech recognition |
US6816085B1 (en) | 2000-01-14 | 2004-11-09 | Michael N. Haynes | Method for managing a parking lot |
JP3520022B2 (ja) * | 2000-01-14 | 2004-04-19 | 株式会社国際電気通信基礎技術研究所 | 外国語学習装置、外国語学習方法および媒体 |
US7123166B1 (en) | 2000-11-17 | 2006-10-17 | Haynes Michael N | Method for managing a parking lot |
JP2001265375A (ja) * | 2000-03-17 | 2001-09-28 | Oki Electric Ind Co Ltd | 規則音声合成装置 |
US6711699B1 (en) * | 2000-05-04 | 2004-03-23 | International Business Machines Corporation | Real time backup system for information based on a user's actions and gestures for computer users |
US6501100B1 (en) * | 2000-05-15 | 2002-12-31 | General Electric Company | White light emitting phosphor blend for LED devices |
US6687689B1 (en) | 2000-06-16 | 2004-02-03 | Nusuara Technologies Sdn. Bhd. | System and methods for document retrieval using natural language-based queries |
US20030179888A1 (en) * | 2002-03-05 | 2003-09-25 | Burnett Gregory C. | Voice activity detection (VAD) devices and methods for use with noise suppression systems |
US8019091B2 (en) | 2000-07-19 | 2011-09-13 | Aliphcom, Inc. | Voice activity detector (VAD) -based multiple-microphone acoustic noise suppression |
US8280072B2 (en) | 2003-03-27 | 2012-10-02 | Aliphcom, Inc. | Microphone array with rear venting |
US7246058B2 (en) * | 2001-05-30 | 2007-07-17 | Aliph, Inc. | Detecting voiced and unvoiced speech using both acoustic and nonacoustic sensors |
US8467543B2 (en) * | 2002-03-27 | 2013-06-18 | Aliphcom | Microphone and voice activity detection (VAD) configurations for use with communication systems |
US20070233479A1 (en) * | 2002-05-30 | 2007-10-04 | Burnett Gregory C | Detecting voiced and unvoiced speech using both acoustic and nonacoustic sensors |
US6510410B1 (en) * | 2000-07-28 | 2003-01-21 | International Business Machines Corporation | Method and apparatus for recognizing tone languages using pitch information |
EP1189206B1 (en) * | 2000-09-19 | 2006-05-31 | Thomson Licensing | Voice control of electronic devices |
US6999926B2 (en) * | 2000-11-16 | 2006-02-14 | International Business Machines Corporation | Unsupervised incremental adaptation using maximum likelihood spectral transformation |
US20020099541A1 (en) * | 2000-11-21 | 2002-07-25 | Burnett Gregory C. | Method and apparatus for voiced speech excitation function determination and non-acoustic assisted feature extraction |
US7016833B2 (en) * | 2000-11-21 | 2006-03-21 | The Regents Of The University Of California | Speaker verification system using acoustic data and non-acoustic data |
US7136630B2 (en) * | 2000-12-22 | 2006-11-14 | Broadcom Corporation | Methods of recording voice signals in a mobile set |
US7143044B2 (en) * | 2000-12-29 | 2006-11-28 | International Business Machines Corporation | Translator for infants and toddlers |
AU2002253865A1 (en) * | 2001-02-14 | 2002-08-28 | The United States Of America, As Represented By The Aministrator Of The National Aeronautics And Spa | Empirical mode decomposition for analyzing acoustical signals |
US6856952B2 (en) * | 2001-02-28 | 2005-02-15 | Intel Corporation | Detecting a characteristic of a resonating cavity responsible for speech |
US7076429B2 (en) * | 2001-04-27 | 2006-07-11 | International Business Machines Corporation | Method and apparatus for presenting images representative of an utterance with corresponding decoded speech |
US6928409B2 (en) * | 2001-05-31 | 2005-08-09 | Freescale Semiconductor, Inc. | Speech recognition using polynomial expansion and hidden markov models |
US6584437B2 (en) | 2001-06-11 | 2003-06-24 | Nokia Mobile Phones Ltd. | Method and apparatus for coding successive pitch periods in speech signal |
US6898568B2 (en) * | 2001-07-13 | 2005-05-24 | Innomedia Pte Ltd | Speaker verification utilizing compressed audio formants |
EP1280137B1 (en) * | 2001-07-24 | 2004-12-29 | Sony International (Europe) GmbH | Method for speaker identification |
US7162415B2 (en) * | 2001-11-06 | 2007-01-09 | The Regents Of The University Of California | Ultra-narrow bandwidth voice coding |
US7165028B2 (en) * | 2001-12-12 | 2007-01-16 | Texas Instruments Incorporated | Method of speech recognition resistant to convolutive distortion and additive distortion |
US7200635B2 (en) * | 2002-01-09 | 2007-04-03 | International Business Machines Corporation | Smart messenger |
JP2003316387A (ja) * | 2002-02-19 | 2003-11-07 | Ntt Docomo Inc | 学習装置、移動通信端末、情報認識システム、及び、学習方法 |
JP3908965B2 (ja) * | 2002-02-28 | 2007-04-25 | 株式会社エヌ・ティ・ティ・ドコモ | 音声認識装置及び音声認識方法 |
JP2003255993A (ja) * | 2002-03-04 | 2003-09-10 | Ntt Docomo Inc | 音声認識システム、音声認識方法、音声認識プログラム、音声合成システム、音声合成方法、音声合成プログラム |
US20030220787A1 (en) * | 2002-04-19 | 2003-11-27 | Henrik Svensson | Method of and apparatus for pitch period estimation |
US7209882B1 (en) | 2002-05-10 | 2007-04-24 | At&T Corp. | System and method for triphone-based unit selection for visual speech synthesis |
US9066186B2 (en) | 2003-01-30 | 2015-06-23 | Aliphcom | Light-based detection for acoustic applications |
TW200425763A (en) * | 2003-01-30 | 2004-11-16 | Aliphcom Inc | Acoustic vibration sensor |
US9099094B2 (en) | 2003-03-27 | 2015-08-04 | Aliphcom | Microphone array with rear venting |
US20050033571A1 (en) * | 2003-08-07 | 2005-02-10 | Microsoft Corporation | Head mounted multi-sensory audio input system |
US7383181B2 (en) * | 2003-07-29 | 2008-06-03 | Microsoft Corporation | Multi-sensory speech detection system |
CA2473195C (en) * | 2003-07-29 | 2014-02-04 | Microsoft Corporation | Head mounted multi-sensory audio input system |
US7916848B2 (en) * | 2003-10-01 | 2011-03-29 | Microsoft Corporation | Methods and systems for participant sourcing indication in multi-party conferencing and for audio source discrimination |
US7447630B2 (en) * | 2003-11-26 | 2008-11-04 | Microsoft Corporation | Method and apparatus for multi-sensory speech enhancement |
US7684987B2 (en) * | 2004-01-21 | 2010-03-23 | Microsoft Corporation | Segmental tonal modeling for tonal languages |
EP2113227B1 (en) | 2004-02-04 | 2015-07-29 | LDR Medical | Intervertebral disc prosthesis |
US7499686B2 (en) * | 2004-02-24 | 2009-03-03 | Microsoft Corporation | Method and apparatus for multi-sensory speech enhancement on a mobile device |
US7983835B2 (en) | 2004-11-03 | 2011-07-19 | Lagassey Paul J | Modular intelligent transportation system |
KR100636317B1 (ko) * | 2004-09-06 | 2006-10-18 | 삼성전자주식회사 | 분산 음성 인식 시스템 및 그 방법 |
US7574008B2 (en) * | 2004-09-17 | 2009-08-11 | Microsoft Corporation | Method and apparatus for multi-sensory speech enhancement |
JP4943335B2 (ja) * | 2004-09-23 | 2012-05-30 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | 話者に依存しない堅牢な音声認識システム |
US7283850B2 (en) * | 2004-10-12 | 2007-10-16 | Microsoft Corporation | Method and apparatus for multi-sensory speech enhancement on a mobile device |
US7809569B2 (en) * | 2004-12-22 | 2010-10-05 | Enterprise Integration Group, Inc. | Turn-taking confidence |
GB2422238A (en) * | 2005-01-17 | 2006-07-19 | Univ Hull | Generation of data from speech or voiceless mouthed speech |
JP4332129B2 (ja) * | 2005-04-20 | 2009-09-16 | 富士通株式会社 | 文書分類プログラム、文書分類方法および文書分類装置 |
US7346504B2 (en) * | 2005-06-20 | 2008-03-18 | Microsoft Corporation | Multi-sensory speech enhancement using a clean speech prior |
FR2891135B1 (fr) | 2005-09-23 | 2008-09-12 | Ldr Medical Sarl | Prothese de disque intervertebral |
DE102005053109A1 (de) | 2005-11-04 | 2007-05-10 | Koehler, Ullrich, Prof. Dr. | Körpergeräusch-Feststellung |
WO2007057879A1 (en) * | 2005-11-17 | 2007-05-24 | Shaul Simhi | Personalized voice activity detection |
US20070276658A1 (en) * | 2006-05-23 | 2007-11-29 | Barry Grayson Douglass | Apparatus and Method for Detecting Speech Using Acoustic Signals Outside the Audible Frequency Range |
US8251924B2 (en) * | 2006-07-07 | 2012-08-28 | Ambient Corporation | Neural translator |
JP4946293B2 (ja) * | 2006-09-13 | 2012-06-06 | 富士通株式会社 | 音声強調装置、音声強調プログラムおよび音声強調方法 |
JP5151102B2 (ja) * | 2006-09-14 | 2013-02-27 | ヤマハ株式会社 | 音声認証装置、音声認証方法およびプログラム |
US20080147579A1 (en) * | 2006-12-14 | 2008-06-19 | Microsoft Corporation | Discriminative training using boosted lasso |
US7805308B2 (en) * | 2007-01-19 | 2010-09-28 | Microsoft Corporation | Hidden trajectory modeling with differential cepstra for speech recognition |
US20080195395A1 (en) * | 2007-02-08 | 2008-08-14 | Jonghae Kim | System and method for telephonic voice and speech authentication |
US8326636B2 (en) | 2008-01-16 | 2012-12-04 | Canyon Ip Holdings Llc | Using a physical phenomenon detector to control operation of a speech recognition engine |
WO2008157421A1 (en) | 2007-06-13 | 2008-12-24 | Aliphcom, Inc. | Dual omnidirectional microphone array |
US8352274B2 (en) * | 2007-09-11 | 2013-01-08 | Panasonic Corporation | Sound determination device, sound detection device, and sound determination method for determining frequency signals of a to-be-extracted sound included in a mixed sound |
JP5375612B2 (ja) * | 2007-09-25 | 2013-12-25 | 日本電気株式会社 | 周波数軸伸縮係数推定装置とシステム方法並びにプログラム |
EP2045140B1 (en) * | 2007-10-01 | 2010-01-27 | Harman/Becker Automotive Systems GmbH | Adjustment of vehicular elements by speech control |
US8326610B2 (en) * | 2007-10-24 | 2012-12-04 | Red Shift Company, Llc | Producing phonitos based on feature vectors |
WO2009055715A1 (en) * | 2007-10-24 | 2009-04-30 | Red Shift Company, Llc | Producing time uniform feature vectors of speech |
TWI356399B (en) * | 2007-12-14 | 2012-01-11 | Ind Tech Res Inst | Speech recognition system and method with cepstral |
JP5229234B2 (ja) * | 2007-12-18 | 2013-07-03 | 富士通株式会社 | 非音声区間検出方法及び非音声区間検出装置 |
US8817964B2 (en) * | 2008-02-11 | 2014-08-26 | International Business Machines Corporation | Telephonic voice authentication and display |
US8280732B2 (en) * | 2008-03-27 | 2012-10-02 | Wolfgang Richter | System and method for multidimensional gesture analysis |
US9349367B2 (en) * | 2008-04-24 | 2016-05-24 | Nuance Communications, Inc. | Records disambiguation in a multimodal application operating on a multimodal device |
US9129595B2 (en) * | 2008-07-01 | 2015-09-08 | University Of The Witwatersrand | Artificial larynx |
CN101727904B (zh) * | 2008-10-31 | 2013-04-24 | 国际商业机器公司 | 语音翻译方法和装置 |
KR101829865B1 (ko) | 2008-11-10 | 2018-02-20 | 구글 엘엘씨 | 멀티센서 음성 검출 |
US20100131268A1 (en) * | 2008-11-26 | 2010-05-27 | Alcatel-Lucent Usa Inc. | Voice-estimation interface and communication system |
US8271422B2 (en) * | 2008-11-29 | 2012-09-18 | At&T Intellectual Property I, Lp | Systems and methods for detecting and coordinating changes in lexical items |
JP2010190955A (ja) * | 2009-02-16 | 2010-09-02 | Toshiba Corp | 音声合成装置、方法及びプログラム |
US20100241423A1 (en) * | 2009-03-18 | 2010-09-23 | Stanley Wayne Jackson | System and method for frequency to phase balancing for timbre-accurate low bit rate audio encoding |
US8064290B2 (en) * | 2009-04-28 | 2011-11-22 | Luidia, Inc. | Digital transcription system utilizing small aperture acoustical sensors |
WO2011025462A1 (en) * | 2009-08-25 | 2011-03-03 | Nanyang Technological University | A method and system for reconstructing speech from an input signal comprising whispers |
KR20110028095A (ko) * | 2009-09-11 | 2011-03-17 | 삼성전자주식회사 | 실시간 화자 적응을 통한 음성 인식 시스템 및 방법 |
US8457965B2 (en) * | 2009-10-06 | 2013-06-04 | Rothenberg Enterprises | Method for the correction of measured values of vowel nasalance |
US20110224541A1 (en) * | 2009-12-08 | 2011-09-15 | The General Hospital Corporation | Methods and arrangements for analysis, diagnosis, and treatment monitoring of vocal folds by optical coherence tomography |
JP5834449B2 (ja) * | 2010-04-22 | 2015-12-24 | 富士通株式会社 | 発話状態検出装置、発話状態検出プログラムおよび発話状態検出方法 |
CN102237081B (zh) * | 2010-04-30 | 2013-04-24 | 国际商业机器公司 | 语音韵律评估方法与系统 |
US11989659B2 (en) | 2010-05-13 | 2024-05-21 | Salesforce, Inc. | Method and apparatus for triggering the automatic generation of narratives |
US9208147B1 (en) | 2011-01-07 | 2015-12-08 | Narrative Science Inc. | Method and apparatus for triggering the automatic generation of narratives |
US8924214B2 (en) | 2010-06-07 | 2014-12-30 | The United States Of America, As Represented By The Secretary Of The Navy | Radar microphone speech recognition |
WO2012003602A1 (zh) * | 2010-07-09 | 2012-01-12 | 西安交通大学 | 一种电子喉语音重建方法及其系统 |
US8532987B2 (en) | 2010-08-24 | 2013-09-10 | Lawrence Livermore National Security, Llc | Speech masking and cancelling and voice obscuration |
US20120136660A1 (en) * | 2010-11-30 | 2012-05-31 | Alcatel-Lucent Usa Inc. | Voice-estimation based on real-time probing of the vocal tract |
US10185477B1 (en) | 2013-03-15 | 2019-01-22 | Narrative Science Inc. | Method and system for configuring automatic generation of narratives from data |
US9720899B1 (en) | 2011-01-07 | 2017-08-01 | Narrative Science, Inc. | Automatic generation of narratives from data using communication goals and narrative analytics |
US9022032B2 (en) | 2011-03-21 | 2015-05-05 | Lawwrence Livermore National Security, LLC | System for controlling apnea |
US8559813B2 (en) | 2011-03-31 | 2013-10-15 | Alcatel Lucent | Passband reflectometer |
US20120259554A1 (en) * | 2011-04-08 | 2012-10-11 | Sony Computer Entertainment Inc. | Tongue tracking interface apparatus and method for controlling a computer program |
US8666738B2 (en) | 2011-05-24 | 2014-03-04 | Alcatel Lucent | Biometric-sensor assembly, such as for acoustic reflectometry of the vocal tract |
US9171548B2 (en) * | 2011-08-19 | 2015-10-27 | The Boeing Company | Methods and systems for speaker identity verification |
KR101247652B1 (ko) * | 2011-08-30 | 2013-04-01 | 광주과학기술원 | 잡음 제거 장치 및 방법 |
US8787571B2 (en) * | 2011-10-19 | 2014-07-22 | General Electric Company | Wired communications systems with improved capacity and security |
WO2013091677A1 (en) * | 2011-12-20 | 2013-06-27 | Squarehead Technology As | Speech recognition method and system |
US9679575B2 (en) | 2011-12-22 | 2017-06-13 | Intel Corporation | Reproduce a voice for a speaker based on vocal tract sensing using ultra wide band radar |
CN103456301B (zh) * | 2012-05-28 | 2019-02-12 | 中兴通讯股份有限公司 | 一种基于环境声音的场景识别方法及装置及移动终端 |
US9263044B1 (en) * | 2012-06-27 | 2016-02-16 | Amazon Technologies, Inc. | Noise reduction based on mouth area movement recognition |
CN102880656B (zh) * | 2012-08-30 | 2015-03-25 | 苏州大学 | 一种语言中枢解码方法、系统及具有该系统的锁 |
US8700396B1 (en) * | 2012-09-11 | 2014-04-15 | Google Inc. | Generating speech data collection prompts |
US9438985B2 (en) | 2012-09-28 | 2016-09-06 | Apple Inc. | System and method of detecting a user's voice activity using an accelerometer |
US9313572B2 (en) | 2012-09-28 | 2016-04-12 | Apple Inc. | System and method of detecting a user's voice activity using an accelerometer |
US20140095161A1 (en) * | 2012-09-28 | 2014-04-03 | At&T Intellectual Property I, L.P. | System and method for channel equalization using characteristics of an unknown signal |
BR112015007625B1 (pt) * | 2012-10-09 | 2021-12-21 | Mediatek Inc | Aparelho, método de geração de uma medida de interferência de áudio e meio de armazenamento legível por computador |
EP2947658A4 (en) * | 2013-01-15 | 2016-09-14 | Sony Corp | MEMORY CONTROL DEVICE, READ CONTROL DEVICE, AND RECORDING MEDIUM |
US11393461B2 (en) | 2013-03-12 | 2022-07-19 | Cerence Operating Company | Methods and apparatus for detecting a voice command |
US9363596B2 (en) | 2013-03-15 | 2016-06-07 | Apple Inc. | System and method of mixing accelerometer and microphone signals to improve voice quality in a mobile device |
US9640185B2 (en) * | 2013-12-12 | 2017-05-02 | Motorola Solutions, Inc. | Method and apparatus for enhancing the modulation index of speech sounds passed through a digital vocoder |
US10741182B2 (en) * | 2014-02-18 | 2020-08-11 | Lenovo (Singapore) Pte. Ltd. | Voice input correction using non-audio based input |
US9959477B2 (en) * | 2014-03-03 | 2018-05-01 | The Board Of Trustees Of The Leland Stanford Junior University | Mapping of blood vessels for biometric authentication |
US11922344B2 (en) | 2014-10-22 | 2024-03-05 | Narrative Science Llc | Automatic generation of narratives from data using communication goals and narrative analytics |
US11238090B1 (en) | 2015-11-02 | 2022-02-01 | Narrative Science Inc. | Applied artificial intelligence technology for using narrative analytics to automatically generate narratives from visualization data |
KR102396983B1 (ko) * | 2015-01-02 | 2022-05-12 | 삼성전자주식회사 | 문법 교정 방법 및 장치 |
WO2017017572A1 (en) | 2015-07-26 | 2017-02-02 | Vocalzoom Systems Ltd. | Laser microphone utilizing speckles noise reduction |
US10332506B2 (en) * | 2015-09-02 | 2019-06-25 | Oath Inc. | Computerized system and method for formatted transcription of multimedia content |
US11232268B1 (en) | 2015-11-02 | 2022-01-25 | Narrative Science Inc. | Applied artificial intelligence technology for using narrative analytics to automatically generate narratives from line charts |
US11170038B1 (en) | 2015-11-02 | 2021-11-09 | Narrative Science Inc. | Applied artificial intelligence technology for using narrative analytics to automatically generate narratives from multiple visualizations |
US11222184B1 (en) | 2015-11-02 | 2022-01-11 | Narrative Science Inc. | Applied artificial intelligence technology for using narrative analytics to automatically generate narratives from bar charts |
EP3414759B1 (en) | 2016-02-10 | 2020-07-01 | Cerence Operating Company | Techniques for spatially selective wake-up word recognition and related systems and methods |
US10542929B2 (en) * | 2016-02-23 | 2020-01-28 | Dustin Ryan Kimmel | Determining conditions based on intraoral sensing |
WO2017197156A1 (en) * | 2016-05-11 | 2017-11-16 | Ossic Corporation | Systems and methods of calibrating earphones |
US11600269B2 (en) * | 2016-06-15 | 2023-03-07 | Cerence Operating Company | Techniques for wake-up word recognition and related systems and methods |
US11144838B1 (en) | 2016-08-31 | 2021-10-12 | Narrative Science Inc. | Applied artificial intelligence technology for evaluating drivers of data presented in visualizations |
CN106252885B (zh) * | 2016-09-19 | 2018-07-20 | 深圳市华讯方舟太赫兹科技有限公司 | 应用于毫米波成像系统的电扫阵列天线装置 |
US11545146B2 (en) | 2016-11-10 | 2023-01-03 | Cerence Operating Company | Techniques for language independent wake-up word detection |
US11568148B1 (en) | 2017-02-17 | 2023-01-31 | Narrative Science Inc. | Applied artificial intelligence technology for narrative generation based on explanation communication goals |
US11068661B1 (en) | 2017-02-17 | 2021-07-20 | Narrative Science Inc. | Applied artificial intelligence technology for narrative generation based on smart attributes |
US10755053B1 (en) * | 2017-02-17 | 2020-08-25 | Narrative Science Inc. | Applied artificial intelligence technology for story outline formation using composable communication goals to support natural language generation (NLG) |
US11954445B2 (en) | 2017-02-17 | 2024-04-09 | Narrative Science Llc | Applied artificial intelligence technology for narrative generation based on explanation communication goals |
US10943069B1 (en) | 2017-02-17 | 2021-03-09 | Narrative Science Inc. | Applied artificial intelligence technology for narrative generation based on a conditional outcome framework |
KR102017244B1 (ko) * | 2017-02-27 | 2019-10-21 | 한국전자통신연구원 | 자연어 인식 성능 개선 방법 및 장치 |
US10665252B2 (en) * | 2017-05-22 | 2020-05-26 | Ajit Arun Zadgaonkar | System and method for estimating properties and physiological conditions of organs by analysing speech samples |
US10339929B2 (en) | 2017-06-27 | 2019-07-02 | Google Llc | Speech recognition using acoustic features in conjunction with distance information |
WO2019051082A1 (en) * | 2017-09-06 | 2019-03-14 | Georgia Tech Research Corporation | SYSTEMS, METHODS AND DEVICES FOR GESTURE RECOGNITION |
US10529355B2 (en) | 2017-12-19 | 2020-01-07 | International Business Machines Corporation | Production of speech based on whispered speech and silent speech |
CN107910011B (zh) * | 2017-12-28 | 2021-05-04 | 科大讯飞股份有限公司 | 一种语音降噪方法、装置、服务器及存储介质 |
US11042709B1 (en) | 2018-01-02 | 2021-06-22 | Narrative Science Inc. | Context saliency-based deictic parser for natural language processing |
US10963649B1 (en) | 2018-01-17 | 2021-03-30 | Narrative Science Inc. | Applied artificial intelligence technology for narrative generation using an invocable analysis service and configuration-driven analytics |
NL2021041B1 (nl) * | 2018-01-31 | 2019-08-07 | Iebm B V | Spraakherkenning met beeld signaal |
WO2019150234A1 (en) | 2018-01-31 | 2019-08-08 | Iebm B.V. | Speech recognition with image signal |
US10885929B2 (en) * | 2018-02-05 | 2021-01-05 | TS Voice Technology, LLC | Computer-aided conversion system and method for generating intelligible speech |
US10938994B2 (en) * | 2018-06-25 | 2021-03-02 | Cypress Semiconductor Corporation | Beamformer and acoustic echo canceller (AEC) system |
US11334726B1 (en) | 2018-06-28 | 2022-05-17 | Narrative Science Inc. | Applied artificial intelligence technology for using natural language processing to train a natural language generation system with respect to date and number textual features |
CN112739996A (zh) | 2018-07-24 | 2021-04-30 | 弗兰克公司 | 用于分析和显示声学数据的系统和方法 |
US10971132B2 (en) | 2018-08-28 | 2021-04-06 | Acer Incorporated | Multimedia processing method and electronic system |
TWI683226B (zh) | 2018-08-28 | 2020-01-21 | 宏碁股份有限公司 | 多媒體處理電路及電子系統 |
CN109584894A (zh) * | 2018-12-20 | 2019-04-05 | 西京学院 | 一种基于雷达语音与麦克风语音相融合的语音增强方法 |
TWI730585B (zh) * | 2019-01-16 | 2021-06-11 | 美商Ts聲音科技有限公司 | 電腦輔助轉換可理解語言的測試系統及其方法 |
US10990767B1 (en) | 2019-01-28 | 2021-04-27 | Narrative Science Inc. | Applied artificial intelligence technology for adaptive natural language understanding |
JP7331395B2 (ja) * | 2019-03-20 | 2023-08-23 | 富士フイルムビジネスイノベーション株式会社 | プロセス抽出装置およびプログラム |
CN110223686A (zh) * | 2019-05-31 | 2019-09-10 | 联想(北京)有限公司 | 语音识别方法、语音识别装置和电子设备 |
US11544458B2 (en) * | 2020-01-17 | 2023-01-03 | Apple Inc. | Automatic grammar detection and correction |
US20210287674A1 (en) * | 2020-03-16 | 2021-09-16 | Knowles Electronics, Llc | Voice recognition for imposter rejection in wearable devices |
DE102020110901B8 (de) | 2020-04-22 | 2023-10-19 | Altavo Gmbh | Verfahren zum Erzeugen einer künstlichen Stimme |
US20210407493A1 (en) * | 2020-06-30 | 2021-12-30 | Plantronics, Inc. | Audio Anomaly Detection in a Speech Signal |
KR102426792B1 (ko) * | 2020-09-16 | 2022-07-29 | 한양대학교 산학협력단 | 무음 발화 인식 방법 및 장치 |
US20220192523A1 (en) * | 2020-12-18 | 2022-06-23 | Movano Inc. | Method for monitoring a physiological parameter in a person that involves coherently combining data generated from an rf-based sensor system |
DE102022115034A1 (de) | 2022-06-15 | 2023-12-21 | Altavo Gmbh | Multi-modale sensoranordnung für körpernahe anwendung |
WO2024064468A1 (en) * | 2022-09-20 | 2024-03-28 | Qualcomm Incorporated | Voice user interface assisted with radio frequency sensing |
CN116819482B (zh) * | 2023-08-28 | 2023-11-10 | 四川省石棉县恒达粉体材料有限责任公司 | 一种基于雷达数据的方解石探测方法 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH04140799A (ja) * | 1990-09-29 | 1992-05-14 | Emerson & Stahn Assoc Inc | 調音パラメータを音声データから決定する方法及び装置 |
JPH04504767A (ja) * | 1990-01-31 | 1992-08-20 | アメリカ合衆国 | 時系列結合学習 |
JPH0643897A (ja) * | 1992-05-26 | 1994-02-18 | Ricoh Co Ltd | 会話認識システム |
JPH06214711A (ja) * | 1992-09-25 | 1994-08-05 | Sextant Avionique | 対話システムの管理システム |
JPH0824227A (ja) * | 1994-07-19 | 1996-01-30 | Hitachi Medical Corp | 医用画像診断装置 |
US5729694A (en) * | 1996-02-06 | 1998-03-17 | The Regents Of The University Of California | Speech coding, reconstruction and recognition using acoustics and electromagnetic waves |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5361070B1 (en) * | 1993-04-12 | 2000-05-16 | Univ California | Ultra-wideband radar motion sensor |
US5473726A (en) * | 1993-07-06 | 1995-12-05 | The United States Of America As Represented By The Secretary Of The Air Force | Audio and amplitude modulated photo data collection for speech recognition |
US5573012A (en) * | 1994-08-09 | 1996-11-12 | The Regents Of The University Of California | Body monitoring and imaging apparatus and method |
US5549658A (en) * | 1994-10-24 | 1996-08-27 | Advanced Bionics Corporation | Four-Channel cochlear system with a passive, non-hermetically sealed implant |
-
1996
- 1996-02-06 US US08/597,596 patent/US6006175A/en not_active Expired - Lifetime
-
1997
- 1997-01-28 DE DE69732096T patent/DE69732096D1/de not_active Expired - Fee Related
- 1997-01-28 AT AT97906883T patent/ATE286295T1/de not_active IP Right Cessation
- 1997-01-28 WO PCT/US1997/001489 patent/WO1997029481A1/en active IP Right Grant
- 1997-01-28 JP JP9528567A patent/JP2000504848A/ja not_active Ceased
- 1997-01-28 EP EP97906883A patent/EP0883877B1/en not_active Expired - Lifetime
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH04504767A (ja) * | 1990-01-31 | 1992-08-20 | アメリカ合衆国 | 時系列結合学習 |
JPH04140799A (ja) * | 1990-09-29 | 1992-05-14 | Emerson & Stahn Assoc Inc | 調音パラメータを音声データから決定する方法及び装置 |
JPH0643897A (ja) * | 1992-05-26 | 1994-02-18 | Ricoh Co Ltd | 会話認識システム |
JPH06214711A (ja) * | 1992-09-25 | 1994-08-05 | Sextant Avionique | 対話システムの管理システム |
JPH0824227A (ja) * | 1994-07-19 | 1996-01-30 | Hitachi Medical Corp | 医用画像診断装置 |
US5729694A (en) * | 1996-02-06 | 1998-03-17 | The Regents Of The University Of California | Speech coding, reconstruction and recognition using acoustics and electromagnetic waves |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008062782A1 (fr) * | 2006-11-20 | 2008-05-29 | Nec Corporation | Système d'estimation de parole, procédé d'estimation de parole et programme d'estimation de parole |
Also Published As
Publication number | Publication date |
---|---|
DE69732096D1 (de) | 2005-02-03 |
US6006175A (en) | 1999-12-21 |
ATE286295T1 (de) | 2005-01-15 |
EP0883877A4 (en) | 1999-08-11 |
EP0883877A1 (en) | 1998-12-16 |
EP0883877B1 (en) | 2004-12-29 |
WO1997029481A1 (en) | 1997-08-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP2000504848A (ja) | 非音波式音声特性記述及び認識のための方法及びその装置 | |
Hansen et al. | Speech under stress: Analysis, modeling and recognition | |
Wrench | A Multi-Channel/Multi-Speaker Articulatory Database for Continuous Speech Recognition Research. | |
Cohen et al. | Vocal tract normalization in speech recognition: Compensating for systematic speaker variability | |
Perrot et al. | Voice disguise and automatic detection: review and perspectives | |
JP2000504849A (ja) | 音響学および電磁波を用いた音声の符号化、再構成および認識 | |
JPH09500223A (ja) | 多言語音声認識システム | |
US7480616B2 (en) | Information recognition device and information recognition method | |
Zlokarnik | Adding articulatory features to acoustic features for automatic speech recognition | |
US11763799B2 (en) | Electronic apparatus and controlling method thereof | |
Sak et al. | A corpus-based concatenative speech synthesis system for Turkish | |
Saito | Speech science and technology | |
Cao et al. | Magtrack: A wearable tongue motion tracking system for silent speech interfaces | |
US10885929B2 (en) | Computer-aided conversion system and method for generating intelligible speech | |
Chen et al. | Automatic pronunciation assessment for mandarin chinese: Approaches and system overview | |
Stone | A silent-speech interface using electro-optical stomatography | |
Raitio | Voice source modelling techniques for statistical parametric speech synthesis | |
Chen | Acoustic-phonetic constraints in continuous speech recognition: a case study using the digit vocabulary. | |
Malik et al. | Efficacy of Current Dysarthric Speech Recognition Techniques | |
Niemann et al. | Statistical Modeling of Segmental and Suprasegmental Information | |
Rahim et al. | Parameter estimation for spectral matching in articulatory synthesis | |
Bhabad | Speech Recognition & Rectification for Articulatory Handicapped People | |
Blackburn et al. | Enhanced speech recognition using an articulatory production model trained on X-ray data | |
Macon et al. | Speech synthesis based on an overlap‐add sinusoidal model | |
Hagmüller | Recognition of regional variants of German using prosodic features |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20040127 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20060905 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20061205 |
|
A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20070129 |
|
A313 | Final decision of rejection without a dissenting response from the applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A313 Effective date: 20070420 |
|
A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20080812 |