DK179558B1 - Detecting a trigger of a digital assistant - Google Patents
Detecting a trigger of a digital assistant Download PDFInfo
- Publication number
- DK179558B1 DK179558B1 DKPA201770421A DKPA201770421A DK179558B1 DK 179558 B1 DK179558 B1 DK 179558B1 DK PA201770421 A DKPA201770421 A DK PA201770421A DK PA201770421 A DKPA201770421 A DK PA201770421A DK 179558 B1 DK179558 B1 DK 179558B1
- Authority
- DK
- Denmark
- Prior art keywords
- electronic device
- user
- module
- audio
- digital assistant
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 claims abstract description 166
- 238000000034 method Methods 0.000 claims abstract description 117
- 230000000977 initiatory effect Effects 0.000 claims abstract description 26
- 238000005070 sampling Methods 0.000 claims abstract description 15
- 238000003860 storage Methods 0.000 claims description 26
- 238000012545 processing Methods 0.000 abstract description 79
- 230000008569 process Effects 0.000 abstract description 61
- 230000015654 memory Effects 0.000 abstract description 59
- 238000004891 communication Methods 0.000 description 57
- 230000033001 locomotion Effects 0.000 description 53
- 230000004044 response Effects 0.000 description 38
- 238000003058 natural language processing Methods 0.000 description 31
- 238000005111 flow chemistry technique Methods 0.000 description 29
- 230000003287 optical effect Effects 0.000 description 21
- 230000015572 biosynthetic process Effects 0.000 description 20
- 230000002093 peripheral effect Effects 0.000 description 20
- 238000003786 synthesis reaction Methods 0.000 description 20
- 230000006870 function Effects 0.000 description 14
- 230000007246 mechanism Effects 0.000 description 14
- 238000007726 management method Methods 0.000 description 13
- 230000000007 visual effect Effects 0.000 description 12
- 241000699666 Mus <mouse, genus> Species 0.000 description 11
- 238000010586 diagram Methods 0.000 description 11
- 238000001514 detection method Methods 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 10
- 230000009471 action Effects 0.000 description 9
- 238000009499 grossing Methods 0.000 description 7
- 230000003993 interaction Effects 0.000 description 7
- 230000002452 interceptive effect Effects 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 6
- 230000001413 cellular effect Effects 0.000 description 5
- 238000006073 displacement reaction Methods 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 230000001133 acceleration Effects 0.000 description 4
- 230000003213 activating effect Effects 0.000 description 4
- 238000005259 measurement Methods 0.000 description 4
- 230000003252 repetitive effect Effects 0.000 description 4
- 230000021317 sensory perception Effects 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 230000001960 triggered effect Effects 0.000 description 4
- 241000227653 Lycopersicon Species 0.000 description 3
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 3
- 239000008186 active pharmaceutical agent Substances 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000010801 machine learning Methods 0.000 description 3
- 230000004913 activation Effects 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 230000000881 depressing effect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000010365 information processing Effects 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 238000010295 mobile communication Methods 0.000 description 2
- 238000003062 neural network model Methods 0.000 description 2
- 230000035807 sensation Effects 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000013589 supplement Substances 0.000 description 2
- 230000001755 vocal effect Effects 0.000 description 2
- MQJKPEGWNLWLTK-UHFFFAOYSA-N Dapsone Chemical compound C1=CC(N)=CC=C1S(=O)(=O)C1=CC=C(N)C=C1 MQJKPEGWNLWLTK-UHFFFAOYSA-N 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 241001422033 Thestylus Species 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000001149 cognitive effect Effects 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 230000009849 deactivation Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 229920001746 electroactive polymer Polymers 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000004424 eye movement Effects 0.000 description 1
- 235000013410 fast food Nutrition 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 235000012054 meals Nutrition 0.000 description 1
- 229910044991 metal oxide Inorganic materials 0.000 description 1
- 150000004706 metal oxides Chemical class 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 235000013550 pizza Nutrition 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000005096 rolling process Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- 230000000153 supplemental effect Effects 0.000 description 1
- 238000010897 surface acoustic wave method Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
Landscapes
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Quality & Reliability (AREA)
- Artificial Intelligence (AREA)
- User Interface Of Digital Computer (AREA)
- Telephone Function (AREA)
Claims (12)
- PATENTKRAV1. Fremgangsmåde (900) til betjening af en digital assistent, hvilken fremgangsmåde omfatter:sampling (902), ved brug af en første mikrofon i en første elektronisk indretning, af et første lydsignal;sampling (904), ved brug af en anden mikrofon i en anden elektronisk indretning der er forskellig fra den første elektroniske indretning, af et andet lydsignal;bestemmelse (906), ved en tredje elektronisk indretning der er forskellig fra den første elektroniske indretning og den anden elektroniske indretning, af om hvilket som helst af det første lydsignal og det andet lydsignal svarer til en mundtlig udløser;tilvejebringelse af retningsbestemt information associeret med en lydkilde, baseret på det første lydsignal og det andet lydsignal;i overensstemmelse med en bestemmelse af at det første lydsignal eller det andet lydsignal svarer til en mundtlig udløser:initiering (908), med en fjerde elektronisk indretning, af en session med den digitale assistent, hvor initiering af sessionen med den digitale assistent omfatter tilvejebringelse, med den digitale assistent, af et lydoutput baseret på den retningsbestemte information;i overensstemmelse med en bestemmelse af at det første lydsignal og det andet lydsignal ikke svarer til den mundtlige udløser:undladelse (910), af den fjerde elektroniske indretning, af initiering af en session med den digitale assistent.
- 2. Fremgangsmåde ifølge krav 1, hvor den fjerde elektroniske indretning er den første elektroniske indretning.
- 3. Fremgangsmåde ifølge krav 1, hvor den tredje elektroniske indretning er en fjernserverindretning.
- 4. Fremgangsmåde ifølge et hvilket som helst af kravene 1-3, der endvidere omfatter: tilvejebringelse, med den tredje elektroniske indretning, af information der svarer til det første lydsignal og det andet lydsignal, hvor informationen omfatter lokationsinformation for den første mikrofon.
- 5. Fremgangsmåde ifølge et hvilket som helst af kravene 1-3, der endvidere omfatter: tilvejebringelse, med den tredje elektroniske indretning, af information der svarer til det første lydsignal og det andet lydsignal, hvor informationen omfatter retningsbestemt information for det første lydsignal.
- 6. Fremgangsmåde ifølge et hvilket som helst af kravene 1-3, der endvidere omfatter: tilvejebringelse, med den tredje elektroniske indretning, af information der svarer til det første lydsignal og det andet lydsignal, hvor informationen omfatter en indretningstype associeret med den første elektroniske indretning.
- 7. Fremgangsmåde ifølge et hvilket som helst af kravene 1-6, hvor den anden elektroniske indretning er en indretning af en første type og den tredje elektroniske indretning er en indretning af en anden type der er forskellig fra den første type.
- 8. Fremgangsmåde ifølge et hvilket som helst af kravene 1-7, hvor bestemmelse af om hvilket som helst af det første lydsignal og det andet lydsignal svarer til en mundtlig udløser omfatter: bestemmelse af om en kombination af det første lydsignal og det andet lydsignal svarer til en mundtlig udløser.
- 9. Fremgangsmåde ifølge et hvilket som helst af kravene 1-8, hvor den første elektroniske indretning er en computer, en tv-boks, en højtaler, et smartwatch, en telefon, eller hvilken som helst kombination deraf.
- 10. Fremgangsmåde ifølge et hvilket som helst af kravene 1-8, hvor den anden elektroniske indretning er en computer, en tv-boks, en højtaler, et smartwatch, en telefon, eller hvilken som helst kombination deraf.
- 11. Ét eller flere ikke-flygtigt computerlæsbart lagermedium, der indeholder ét eller flere programmer, idet det ene eller flere programmer omfatter instruktioner, som ved udførelse på én eller flere processorer i én eller flere elektroniske indretninger, får den5 ene eller flere elektroniske indretninger til at udføre fremgangsmåderne ifølge et hvilket som helst af kravene 1-10.
- 12. System, der omfatter:midler til udførelse af fremgangsmåderne ifølge et hvilket som helst af kravene 1-10.
Priority Applications (17)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/920,091 US20180336892A1 (en) | 2017-05-16 | 2018-03-13 | Detecting a trigger of a digital assistant |
AU2018260948A AU2018260948B2 (en) | 2017-05-16 | 2018-04-25 | Detecting a trigger of a digital assistant |
CN201880002529.3A CN109328381B (zh) | 2017-05-16 | 2018-04-25 | 检测数字助理的触发 |
KR1020217013243A KR102363177B1 (ko) | 2017-05-16 | 2018-04-25 | 디지털 어시스턴트의 트리거의 검출 |
CN201910787515.3A CN110473538B (zh) | 2017-05-16 | 2018-04-25 | 检测数字助理的触发 |
EP18724082.5A EP3443556B1 (en) | 2017-05-16 | 2018-04-25 | Detecting a trigger of a digital assistant |
EP19182046.3A EP3570277B1 (en) | 2017-05-16 | 2018-04-25 | Operation of a digital assistant |
EP20209827.3A EP3806091B1 (en) | 2017-05-16 | 2018-04-25 | Detecting a trigger of a digital assistant |
KR1020187034152A KR102249298B1 (ko) | 2017-05-16 | 2018-04-25 | 디지털 어시스턴트의 트리거의 검출 |
CN201910574413.3A CN110288994B (zh) | 2017-05-16 | 2018-04-25 | 检测数字助理的触发 |
PCT/US2018/029474 WO2018212953A1 (en) | 2017-05-16 | 2018-04-25 | Detecting a trigger of a digital assistant |
KR1020197031306A KR102180832B1 (ko) | 2017-05-16 | 2018-04-25 | 디지털 어시스턴트의 트리거의 검출 |
US16/181,138 US20190074009A1 (en) | 2017-05-16 | 2018-11-05 | Detecting a trigger of a digital assistant |
AU2019202136A AU2019202136B2 (en) | 2017-05-16 | 2019-03-28 | Detecting a trigger of a digital assistant |
DKPA201970527A DK201970527A1 (da) | 2017-05-16 | 2019-08-22 | Detecting a trigger of a digital assistant |
US17/111,132 US11532306B2 (en) | 2017-05-16 | 2020-12-03 | Detecting a trigger of a digital assistant |
US18/080,550 US20230111509A1 (en) | 2017-05-16 | 2022-12-13 | Detecting a trigger of a digital assistant |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201762507042P | 2017-05-16 | 2017-05-16 | |
US62/507,042 | 2017-05-16 |
Publications (2)
Publication Number | Publication Date |
---|---|
DK201770421A1 DK201770421A1 (da) | 2018-12-07 |
DK179558B1 true DK179558B1 (da) | 2019-02-13 |
Family
ID=69137536
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DKPA201770420A DK179930B1 (da) | 2017-05-16 | 2017-05-31 | Detecting a trigger of a digital assistant |
DKPA201770421A DK179558B1 (da) | 2017-05-16 | 2017-05-31 | Detecting a trigger of a digital assistant |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DKPA201770420A DK179930B1 (da) | 2017-05-16 | 2017-05-31 | Detecting a trigger of a digital assistant |
Country Status (1)
Country | Link |
---|---|
DK (2) | DK179930B1 (da) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20180336892A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Detecting a trigger of a digital assistant |
EP3709194A1 (en) | 2019-03-15 | 2020-09-16 | Spotify AB | Ensemble-based data comparison |
US11094319B2 (en) | 2019-08-30 | 2021-08-17 | Spotify Ab | Systems and methods for generating a cleaned version of ambient sound |
US11328722B2 (en) * | 2020-02-11 | 2022-05-10 | Spotify Ab | Systems and methods for generating a singular voice audio stream |
US11308959B2 (en) | 2020-02-11 | 2022-04-19 | Spotify Ab | Dynamic adjustment of wake word acceptance tolerance thresholds in voice-controlled devices |
-
2017
- 2017-05-31 DK DKPA201770420A patent/DK179930B1/da not_active IP Right Cessation
- 2017-05-31 DK DKPA201770421A patent/DK179558B1/da not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
DK201770421A1 (da) | 2018-12-07 |
DK179930B1 (da) | 2019-10-11 |
DK201770420A1 (da) | 2018-12-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11532306B2 (en) | Detecting a trigger of a digital assistant | |
US11862151B2 (en) | Low-latency intelligent automated assistant | |
DK180756B1 (da) | Bruger-specifikke akustiske modeller | |
EP3701520B1 (en) | Multi-turn canned dialog | |
US10755703B2 (en) | Offline personal assistant | |
US20180330714A1 (en) | Machine learned systems | |
EP4075426B1 (en) | Low-latency intelligent automated assistant | |
EP3593350B1 (en) | User interface for correcting recognition errors | |
DK179558B1 (da) | Detecting a trigger of a digital assistant | |
EP3424046B1 (en) | User-specific acoustic models |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PAT | Application published |
Effective date: 20181117 |
|
PME | Patent granted |
Effective date: 20190213 |
|
PBP | Patent lapsed |
Effective date: 20230531 |