WO2022158633A1 - 음성인식 및 음성합성을 이용한 무선통신장치 - Google Patents
음성인식 및 음성합성을 이용한 무선통신장치 Download PDFInfo
- Publication number
- WO2022158633A1 WO2022158633A1 PCT/KR2021/001397 KR2021001397W WO2022158633A1 WO 2022158633 A1 WO2022158633 A1 WO 2022158633A1 KR 2021001397 W KR2021001397 W KR 2021001397W WO 2022158633 A1 WO2022158633 A1 WO 2022158633A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- voice
- information
- wireless communication
- communication device
- syllable
- Prior art date
Links
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 13
- 238000003786 synthesis reaction Methods 0.000 title claims abstract description 13
- 230000005540 biological transmission Effects 0.000 claims abstract description 28
- 238000000034 method Methods 0.000 claims description 27
- 230000000630 rising effect Effects 0.000 claims description 5
- 230000007935 neutral effect Effects 0.000 description 7
- 230000008901 benefit Effects 0.000 description 3
- 230000006835 compression Effects 0.000 description 3
- 238000007906 compression Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000010420 art technique Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B1/00—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
- H04B1/38—Transceivers, i.e. devices in which transmitter and receiver form a structural unit and in which at least one part is used for functions of transmitting and receiving
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01C—MEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
- G01C19/00—Gyroscopes; Turn-sensitive devices using vibrating masses; Turn-sensitive devices without moving masses; Measuring angular rate using gyroscopic effects
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/0018—Speech coding using phonetic or linguistical decoding of the source; Reconstruction using text-to-speech synthesis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/027—Syllables being the recognition units
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B1/00—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
- H04B1/38—Transceivers, i.e. devices in which transmitter and receiver form a structural unit and in which at least one part is used for functions of transmitting and receiving
- H04B1/3827—Portable transceivers
- H04B1/385—Transceivers carried on the body, e.g. in helmets
- H04B2001/3872—Transceivers carried on the body, e.g. in helmets with extendable microphones or earphones
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Signal Processing (AREA)
- Remote Sensing (AREA)
- Radar, Positioning & Navigation (AREA)
- General Physics & Mathematics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Mobile Radio Communication Systems (AREA)
- Telephonic Communication Services (AREA)
- Telephone Function (AREA)
- Transceivers (AREA)
Abstract
Description
Claims (11)
- 마이크를 통해 입력되는 음성신호를 음성인식을 이용하여 음절정보의 스트림으로 변환하는 음성인식부;상기 음절정보의 스트림을 부호화하여 디지털 송신 데이터를 생성하는 부호화부;상기 디지털 송신 데이터를 변조하여 송신 신호를 안테나를 통해 송신하는 송신부;상기 안테나를 통해 수신되는 수신 신호를 복조하여 디지털 수신 데이터를 출력하는 수신부;상기 디지털 수신 데이터를 복호화하여 음절정보의 스트림으로 변환하는 복호화부; 및상기 음절정보의 스트림을 음성합성을 이용하여 음성신호로 변환해 스피커를 통해 출력하는 음성합성부를 포함하는 것을 특징으로 하는 무선통신장치.
- 제1항에 있어서,상기 음절정보는, 초성, 중성, 종성의 조합을 포함하는 것을 특징으로 하는 무선통신장치.
- 제2항에 있어서,상기 음절정보는 운율 정보를 더 포함하는 것을 특징으로 하는 무선통신장치.
- 제3항에 있어서,상기 운율 정보는 보통음, 상승음, 하강음, 장음, 강세음을 포함하는 것을 특징으로 하는 무선통신장치.
- 제2항에 있어서,상기 음절정보는 음색 정보를 더 포함하는 것을 특징으로 하는 무선통신장치.
- 제5항에 있어서,상기 음색 정보는 남자, 여자, 노인, 어린이 별로 소정 개수의 레벨을 포함하는 것을 특징으로 하는 무선통신장치.
- 제2항에 있어서,상기 음절정보를 구성하는 초성, 중성, 종성은 3차원 좌표계의 세 축에 각각 대응하고, 상기 음절정보는 상기 3차원 좌표계에서의 상기 초성, 중성, 종성 각각의 좌표값에 따라 디지털 데이터에 매핑되는 것을 특징으로 하는 무선통신장치.
- 제7항에 있어서,상기 음절정보는 운율 정보를 더 포함하고,상기 음절정보는 상기 3차원 좌표계에서의 상기 초성, 중성, 종성 각각의 좌표값 및 상기 운율 정보에 따라 상기 디지털 데이터에 매핑되는 것을 특징으로 하는 무선통신장치.
- 제1항에 있어서,상기 무선통신장치는 인공위성을 통한 음성통화를 위한 무선통신장치이고,상기 송신부 및 상기 수신부는 상기 송신 신호 및 상기 수신 신호를 인공위성과 송수신할 수 있도록 변조 및 복조하는 것을 특징으로 하는 무선통신장치.
- 제9항에 있어서,자이로 센서;상기 안테나에 연결된 3축 기어; 및상기 자이로 센서의 센싱 값에 따라 상기 안테나가 상방을 향하도록 상기 3축 기어를 제어하는 안테나 자세 제어부를 더 포함하는 것을 특징으로 하는 무선통신장치.
- 제1항에 있어서,상기 부호화부에서 출력되는 상기 디지털 송신 데이터 및 상기 수신부에서 출력되는 상기 디지털 수신 데이터를 저장하는 녹음부를 더 구비하는 것을 특징으로 하는 무선통신장치.
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2023544784A JP2024506527A (ja) | 2021-01-25 | 2021-02-03 | 音声認識及び音声合成を利用した無線通信装置 |
CN202180091762.5A CN116848581A (zh) | 2021-01-25 | 2021-02-03 | 使用语音识别和语音合成的无线通信设备 |
US17/439,197 US11942072B2 (en) | 2021-01-25 | 2021-02-03 | Wireless communication device using voice recognition and voice synthesis |
EP21921404.6A EP4283612A1 (en) | 2021-01-25 | 2021-02-03 | Wireless communication device using voice recognition and voice synthesis |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020210010472A KR102548618B1 (ko) | 2021-01-25 | 2021-01-25 | 음성인식 및 음성합성을 이용한 무선통신장치 |
KR10-2021-0010472 | 2021-01-25 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022158633A1 true WO2022158633A1 (ko) | 2022-07-28 |
Family
ID=82549119
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2021/001397 WO2022158633A1 (ko) | 2021-01-25 | 2021-02-03 | 음성인식 및 음성합성을 이용한 무선통신장치 |
Country Status (6)
Country | Link |
---|---|
US (1) | US11942072B2 (ko) |
EP (1) | EP4283612A1 (ko) |
JP (1) | JP2024506527A (ko) |
KR (1) | KR102548618B1 (ko) |
CN (1) | CN116848581A (ko) |
WO (1) | WO2022158633A1 (ko) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH10260692A (ja) * | 1997-03-18 | 1998-09-29 | Toshiba Corp | 音声の認識合成符号化/復号化方法及び音声符号化/復号化システム |
KR20060124063A (ko) * | 2005-05-30 | 2006-12-05 | 충남대학교산학협력단 | 3축 위성 안테나 |
KR100819928B1 (ko) * | 2007-04-26 | 2008-04-08 | (주)부성큐 | 휴대 단말기의 음성 인식장치 및 그 방법 |
KR101102520B1 (ko) * | 2011-02-22 | 2012-01-03 | 이윤재 | 한글 자모의 메트릭스 결합 관계를 기반으로 하는 시청각 한글학습 시스템 및 그 운영 방법 |
KR20180049422A (ko) * | 2016-11-01 | 2018-05-11 | 한국전자통신연구원 | 화자 인증 시스템 및 그 방법 |
KR20190024148A (ko) * | 2017-08-31 | 2019-03-08 | 경북대학교 산학협력단 | 음성 인식 장치 및 음성 인식 방법 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100270237B1 (ko) * | 1997-10-15 | 2000-10-16 | 윌리암 손 | 무선네트워크상에서음성대화식인터넷접속휴대통신장치및방법 |
US9666204B2 (en) * | 2014-04-30 | 2017-05-30 | Qualcomm Incorporated | Voice profile management and speech signal generation |
-
2021
- 2021-01-25 KR KR1020210010472A patent/KR102548618B1/ko active IP Right Grant
- 2021-02-03 JP JP2023544784A patent/JP2024506527A/ja active Pending
- 2021-02-03 US US17/439,197 patent/US11942072B2/en active Active
- 2021-02-03 WO PCT/KR2021/001397 patent/WO2022158633A1/ko active Application Filing
- 2021-02-03 EP EP21921404.6A patent/EP4283612A1/en active Pending
- 2021-02-03 CN CN202180091762.5A patent/CN116848581A/zh active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH10260692A (ja) * | 1997-03-18 | 1998-09-29 | Toshiba Corp | 音声の認識合成符号化/復号化方法及び音声符号化/復号化システム |
KR20060124063A (ko) * | 2005-05-30 | 2006-12-05 | 충남대학교산학협력단 | 3축 위성 안테나 |
KR100819928B1 (ko) * | 2007-04-26 | 2008-04-08 | (주)부성큐 | 휴대 단말기의 음성 인식장치 및 그 방법 |
KR101102520B1 (ko) * | 2011-02-22 | 2012-01-03 | 이윤재 | 한글 자모의 메트릭스 결합 관계를 기반으로 하는 시청각 한글학습 시스템 및 그 운영 방법 |
KR20180049422A (ko) * | 2016-11-01 | 2018-05-11 | 한국전자통신연구원 | 화자 인증 시스템 및 그 방법 |
KR20190024148A (ko) * | 2017-08-31 | 2019-03-08 | 경북대학교 산학협력단 | 음성 인식 장치 및 음성 인식 방법 |
Also Published As
Publication number | Publication date |
---|---|
KR102548618B1 (ko) | 2023-06-27 |
KR20220107631A (ko) | 2022-08-02 |
JP2024506527A (ja) | 2024-02-14 |
US20230090052A1 (en) | 2023-03-23 |
CN116848581A (zh) | 2023-10-03 |
EP4283612A1 (en) | 2023-11-29 |
US11942072B2 (en) | 2024-03-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US4707858A (en) | Utilizing word-to-digital conversion | |
CN101923858B (zh) | 一种实时同步互译语音终端 | |
FI962572A0 (fi) | Hajautettu äänentunnistusjärjestelmä | |
US6980834B2 (en) | Method and apparatus for performing text to speech synthesis | |
WO2022158633A1 (ko) | 음성인식 및 음성합성을 이용한 무선통신장치 | |
JPH09292971A (ja) | 翻訳装置 | |
CN101500028A (zh) | 采用读写模式的通信终端以及实现读写模式通信的方法 | |
JPH03132797A (ja) | 音声認識装置 | |
EP1298647A1 (en) | A communication device and a method for transmitting and receiving of natural speech, comprising a speech recognition module coupled to an encoder | |
JPH06181463A (ja) | デジタル通信装置 | |
JPS6171730A (ja) | 音声デ−タ転送方式 | |
JPH07175495A (ja) | 音声認識方式 | |
WO2002001551A9 (en) | Input device for voice recognition and articulation using keystroke data. | |
JP2630307B2 (ja) | 通話路試験装置 | |
JPH04258037A (ja) | 音声符号化装置 | |
JPH01316874A (ja) | 対話翻訳方式 | |
WO2001042875A2 (en) | Language translation voice telephony | |
KR950010434A (ko) | 음성 및 데이타 통신 장치 | |
JPS58151726A (ja) | 衛星回線による音声伝送方式 | |
JPS61274534A (ja) | 音声伝達システム | |
JPS6073699A (ja) | 音声伝送装置 | |
JPH0220148A (ja) | 音声データパケット伝送装置 | |
JPH01208931A (ja) | 補助伝送路伝送方式 | |
NO920906D0 (no) | Fremgangsmaate og innretning for kodering og dekodering avet analogt lavfrekvens-signal i et pcm-format, fortrinnsvis for ovrfoering av taleinformasjon ved et hoeyttalende telefonanlegg | |
Mullen | Unlimited vocabulary speech synthesis with low data rates |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21921404 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 202180091762.5 Country of ref document: CN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2023544784 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2021921404 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2021921404 Country of ref document: EP Effective date: 20230825 |