KR20220088633A - 음성을 텍스트로 변환하는 방법, 시스템, 장치 및 매체 - Google Patents
음성을 텍스트로 변환하는 방법, 시스템, 장치 및 매체 Download PDFInfo
- Publication number
- KR20220088633A KR20220088633A KR1020217034957A KR20217034957A KR20220088633A KR 20220088633 A KR20220088633 A KR 20220088633A KR 1020217034957 A KR1020217034957 A KR 1020217034957A KR 20217034957 A KR20217034957 A KR 20217034957A KR 20220088633 A KR20220088633 A KR 20220088633A
- Authority
- KR
- South Korea
- Prior art keywords
- client
- language category
- voice
- chat message
- user account
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 82
- 238000006243 chemical reaction Methods 0.000 claims abstract description 77
- 230000004044 response Effects 0.000 claims abstract description 40
- 230000000694 effects Effects 0.000 claims abstract description 32
- 230000015654 memory Effects 0.000 claims description 23
- 238000004590 computer program Methods 0.000 claims description 9
- 238000013519 translation Methods 0.000 claims description 7
- 238000010586 diagram Methods 0.000 description 16
- 238000004891 communication Methods 0.000 description 15
- 238000013461 design Methods 0.000 description 14
- 230000002093 peripheral effect Effects 0.000 description 6
- 238000003825 pressing Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 5
- 238000004088 simulation Methods 0.000 description 5
- 230000004083 survival effect Effects 0.000 description 5
- 230000003993 interaction Effects 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000035876 healing Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 230000009184 walking Effects 0.000 description 3
- 238000013473 artificial intelligence Methods 0.000 description 2
- 230000004888 barrier function Effects 0.000 description 2
- 238000005266 casting Methods 0.000 description 2
- 230000002860 competitive effect Effects 0.000 description 2
- 230000009193 crawling Effects 0.000 description 2
- 230000009187 flying Effects 0.000 description 2
- 230000009191 jumping Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000010079 rubber tapping Methods 0.000 description 2
- 230000009183 running Effects 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000004575 stone Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000002618 waking effect Effects 0.000 description 1
- PICXIOQBANWBIZ-UHFFFAOYSA-N zinc;1-oxidopyridine-2-thione Chemical class [Zn+2].[O-]N1C=CC=CC1=S.[O-]N1C=CC=CC1=S PICXIOQBANWBIZ-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/58—Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- A—HUMAN NECESSITIES
- A63—SPORTS; GAMES; AMUSEMENTS
- A63F—CARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
- A63F13/00—Video games, i.e. games using an electronically generated display having two or more dimensions
- A63F13/20—Input arrangements for video game devices
- A63F13/21—Input arrangements for video game devices characterised by their sensors, purposes or types
- A63F13/215—Input arrangements for video game devices characterised by their sensors, purposes or types comprising means for detecting acoustic signals, e.g. using a microphone
-
- A—HUMAN NECESSITIES
- A63—SPORTS; GAMES; AMUSEMENTS
- A63F—CARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
- A63F13/00—Video games, i.e. games using an electronically generated display having two or more dimensions
- A63F13/85—Providing additional services to players
- A63F13/87—Communicating with other players during game play, e.g. by e-mail or chat
-
- A—HUMAN NECESSITIES
- A63—SPORTS; GAMES; AMUSEMENTS
- A63F—CARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
- A63F13/00—Video games, i.e. games using an electronically generated display having two or more dimensions
- A63F13/90—Constructional details or arrangements of video game devices not provided for in groups A63F13/20 or A63F13/25, e.g. housing, wiring, connections or cabinets
- A63F13/92—Video game devices specially adapted to be hand-held while playing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/263—Language identification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/005—Language recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L51/00—User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
- H04L51/04—Real-time or near real-time messaging, e.g. instant messaging [IM]
- H04L51/046—Interoperability with other network applications or services
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L51/00—User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
- H04L51/06—Message adaptation to terminal or network requirements
- H04L51/063—Content adaptation, e.g. replacement of unsuitable content
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Theoretical Computer Science (AREA)
- Acoustics & Sound (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Information Transfer Between Computers (AREA)
- User Interface Of Digital Computer (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011504638.0A CN112494958B (zh) | 2020-12-18 | 2020-12-18 | 语音转换文字的方法、系统、设备及介质 |
CN202011504638.0 | 2020-12-18 | ||
PCT/CN2021/115897 WO2022127197A1 (zh) | 2020-12-18 | 2021-09-01 | 语音转换文字的方法、系统、设备及介质 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20220088633A true KR20220088633A (ko) | 2022-06-28 |
Family
ID=82022437
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020217034957A KR20220088633A (ko) | 2020-12-18 | 2021-09-01 | 음성을 텍스트로 변환하는 방법, 시스템, 장치 및 매체 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20220199087A1 (ja) |
JP (1) | JP2023510057A (ja) |
KR (1) | KR20220088633A (ja) |
Family Cites Families (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020069048A1 (en) * | 2000-04-07 | 2002-06-06 | Sadhwani Deepak Kishinchand | Communication system |
JP2003205176A (ja) * | 2002-01-15 | 2003-07-22 | Arika:Kk | 通信ネットワークを介した麻雀ゲーム実行方式 |
US10987597B2 (en) * | 2002-12-10 | 2021-04-27 | Sony Interactive Entertainment LLC | System and method for managing audio and video channels for video game players and spectators |
US7305438B2 (en) * | 2003-12-09 | 2007-12-04 | International Business Machines Corporation | Method and system for voice on demand private message chat |
US7398215B2 (en) * | 2003-12-24 | 2008-07-08 | Inter-Tel, Inc. | Prompt language translation for a telecommunications system |
US8700396B1 (en) * | 2012-09-11 | 2014-04-15 | Google Inc. | Generating speech data collection prompts |
US20140164476A1 (en) * | 2012-12-06 | 2014-06-12 | At&T Intellectual Property I, Lp | Apparatus and method for providing a virtual assistant |
US9231898B2 (en) * | 2013-02-08 | 2016-01-05 | Machine Zone, Inc. | Systems and methods for multi-user multi-lingual communications |
US9298703B2 (en) * | 2013-02-08 | 2016-03-29 | Machine Zone, Inc. | Systems and methods for incentivizing user feedback for translation processing |
US8996355B2 (en) * | 2013-02-08 | 2015-03-31 | Machine Zone, Inc. | Systems and methods for reviewing histories of text messages from multi-user multi-lingual communications |
JP2014167517A (ja) * | 2013-02-28 | 2014-09-11 | Nippon Telegraph & Telephone East Corp | 会話提供システム、ゲーム提供システム、会話提供方法、ゲーム提供方法及びプログラム |
US9262405B1 (en) * | 2013-02-28 | 2016-02-16 | Google Inc. | Systems and methods of serving a content item to a user in a specific language |
US20150088485A1 (en) * | 2013-09-24 | 2015-03-26 | Moayad Alhabobi | Computerized system for inter-language communication |
JP6148163B2 (ja) * | 2013-11-29 | 2017-06-14 | 本田技研工業株式会社 | 会話支援装置、会話支援装置の制御方法、及び会話支援装置のプログラム |
KR102214178B1 (ko) * | 2013-12-13 | 2021-02-10 | 한국전자통신연구원 | 자동 통역 장치 및 방법 |
WO2017099483A1 (en) * | 2015-12-09 | 2017-06-15 | Samsung Electronics Co., Ltd. | Device and method for providing user-customized content |
KR101861006B1 (ko) * | 2016-08-18 | 2018-05-28 | 주식회사 하이퍼커넥트 | 통역 장치 및 방법 |
US10430042B2 (en) * | 2016-09-30 | 2019-10-01 | Sony Interactive Entertainment Inc. | Interaction context-based virtual reality |
US20200125643A1 (en) * | 2017-03-24 | 2020-04-23 | Jose Rito Gutierrez | Mobile translation application and method |
US10586369B1 (en) * | 2018-01-31 | 2020-03-10 | Amazon Technologies, Inc. | Using dialog and contextual data of a virtual reality environment to create metadata to drive avatar animation |
US11361211B2 (en) * | 2018-06-20 | 2022-06-14 | Accenture Global Solutions Limited | Artificial intelligence (AI) based chatbot creation and communication system |
CN109327613B (zh) * | 2018-10-15 | 2020-09-29 | 华为技术有限公司 | 一种基于语音通话翻译能力的协商方法及电子设备 |
JP7179093B2 (ja) * | 2019-01-24 | 2022-11-28 | 株式会社ソニー・インタラクティブエンタテインメント | 情報処理システム及び情報処理装置 |
US11328131B2 (en) * | 2019-03-12 | 2022-05-10 | Jordan Abbott ORLICK | Real-time chat and voice translator |
US10599786B1 (en) * | 2019-03-19 | 2020-03-24 | Servicenow, Inc. | Dynamic translation |
JP7188302B2 (ja) * | 2019-07-08 | 2022-12-13 | トヨタ自動車株式会社 | サーバ装置、車載装置、情報処理方法、及び情報処理プログラム |
CN111309207A (zh) * | 2020-02-06 | 2020-06-19 | 北京一起教育信息咨询有限责任公司 | 一种译文显示方法、装置、电子设备及存储介质 |
US11358054B2 (en) * | 2020-02-18 | 2022-06-14 | Electronic Arts Inc. | Systems and methods for transcribing user interface elements of a game application into haptic feedback |
US11023688B1 (en) * | 2020-05-27 | 2021-06-01 | Roblox Corporation | Generation of text tags from game communication transcripts |
CN111672099B (zh) * | 2020-05-28 | 2023-03-24 | 腾讯科技(深圳)有限公司 | 虚拟场景中的信息展示方法、装置、设备及存储介质 |
US11321856B1 (en) * | 2020-12-18 | 2022-05-03 | Roblox Corporation | Detection of inauthentic virtual objects |
-
2021
- 2021-09-01 JP JP2021564719A patent/JP2023510057A/ja active Pending
- 2021-09-01 KR KR1020217034957A patent/KR20220088633A/ko unknown
- 2021-10-13 US US17/500,011 patent/US20220199087A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
US20220199087A1 (en) | 2022-06-23 |
JP2023510057A (ja) | 2023-03-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11439917B2 (en) | Voice help system using artificial intelligence | |
CN112691377B (zh) | 虚拟角色的控制方法、装置、电子设备及存储介质 | |
CN111589126B (zh) | 虚拟对象的控制方法、装置、设备及存储介质 | |
US8784214B2 (en) | Method and system for establishing location-based leaderboard | |
JP2022527662A (ja) | 仮想オブジェクトの制御方法、装置、機器及びコンピュータプログラム | |
Oppermann et al. | Playing on AREEF: evaluation of an underwater augmented reality game for kids | |
US11931653B2 (en) | Virtual object control method and apparatus, terminal, and storage medium | |
US20220379214A1 (en) | Method and apparatus for a control interface in a virtual environment | |
Sreedharan et al. | 3D input for 3D worlds | |
WO2022127197A1 (zh) | 语音转换文字的方法、系统、设备及介质 | |
TWI831074B (zh) | 虛擬場景中的信息處理方法、裝置、設備、媒體及程式產品 | |
CN112691366B (zh) | 虚拟道具的显示方法、装置、设备及介质 | |
US20220288497A1 (en) | Method and apparatus for displaying pre-ordered prop, device, medium, and product | |
US20230072463A1 (en) | Contact information presentation | |
KR20220088633A (ko) | 음성을 텍스트로 변환하는 방법, 시스템, 장치 및 매체 | |
CN114053693B (zh) | 虚拟场景中的对象控制方法、装置及终端设备 | |
CN113018862B (zh) | 虚拟对象的控制方法、装置、电子设备及存储介质 | |
KR20230130109A (ko) | 가상 시나리오 디스플레이 방법, 장치, 단말 및 저장매체 | |
KR20190127308A (ko) | 게임 동작 예측 장치 및 방법 | |
JP7170454B2 (ja) | システム、端末装置及びサーバ | |
WO2024021847A1 (zh) | 虚拟对象的标记方法、装置、终端及存储介质 | |
WO2024021781A9 (zh) | 虚拟对象的交互方法、装置、计算机设备及存储介质 | |
KR102170825B1 (ko) | 게임 제어 장치 및 방법 | |
CN117654061A (zh) | 对象控制方法、装置、电子设备、存储介质及程序产品 | |
KR102463571B1 (ko) | 게임 제어 장치 및 방법 |