JP2021028715A - 端末機及びその動作方法 - Google Patents
端末機及びその動作方法 Download PDFInfo
- Publication number
- JP2021028715A JP2021028715A JP2020134046A JP2020134046A JP2021028715A JP 2021028715 A JP2021028715 A JP 2021028715A JP 2020134046 A JP2020134046 A JP 2020134046A JP 2020134046 A JP2020134046 A JP 2020134046A JP 2021028715 A JP2021028715 A JP 2021028715A
- Authority
- JP
- Japan
- Prior art keywords
- voice
- terminal
- host
- user
- specific text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims description 38
- 238000004891 communication Methods 0.000 claims abstract description 46
- 230000000052 comparative effect Effects 0.000 claims description 11
- 239000000284 extract Substances 0.000 claims description 9
- 238000005070 sampling Methods 0.000 claims description 4
- 238000010586 diagram Methods 0.000 description 9
- 238000000605 extraction Methods 0.000 description 8
- 238000013528 artificial neural network Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 238000004364 calculation method Methods 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000011017 operating method Methods 0.000 description 2
- 239000011295 pitch Substances 0.000 description 2
- 230000000306 recurrent effect Effects 0.000 description 2
- 230000002457 bidirectional effect Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/478—Supplemental services, e.g. displaying phone caller identification, shopping application
- H04N21/4788—Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/21—Server components or server architectures
- H04N21/218—Source of audio or video content, e.g. local disk arrays
- H04N21/2187—Live feed
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/414—Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
- H04N21/41407—Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance embedded in a portable device, e.g. video client on a mobile phone, PDA, laptop
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/431—Generation of visual interfaces for content selection or interaction; Content or additional data rendering
- H04N21/4312—Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
- H04N21/4316—Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations for displaying supplemental content in a region of the screen, e.g. an advertisement in a separate window
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/45—Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
- H04N21/466—Learning process for intelligent management, e.g. learning user preferences for recommending movies
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/81—Monomedia components thereof
- H04N21/8106—Monomedia components thereof involving special audio data, e.g. different tracks for different languages
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/60—Network streaming of media packets
- H04L65/61—Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio
- H04L65/611—Network streaming of media packets for supporting one-way streaming services, e.g. Internet radio for multicast or broadcast
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Business, Economics & Management (AREA)
- Marketing (AREA)
- Telephonic Communication Services (AREA)
- User Interface Of Digital Computer (AREA)
- Digital Computer Display Output (AREA)
Abstract
Description
Claims (15)
- 放送チャンネルを介してリアルタイム放送を行うことができるサービスを提供する端末機の動作方法において、
前記放送チャンネルを介して前記端末機の使用者がホストである前記リアルタイム放送が開始される段階と、
前記リアルタイム放送が開始されると、前記端末機のディスプレイが2つの領域に分割され、前記2つの領域のうち1つの領域が前記ホストに割り当てられる段階と、
前記リアルタイム放送中に前記ホストの音声を認識する段階と、
前記放送チャンネルに入場した少なくとも1人以上のゲストのうち特定ゲストの端末機から少なくとも1つ以上のアイテムのうちから選択された1つのアイテム及び特定テキストを受信する段階と、
前記特定テキストを前記ホストの音声または前記特定ゲストの音声に変換した音声メッセージを生成する段階と、
前記音声メッセージを出力する段階と、
を含む端末機の動作方法。 - 前記特定テキストを前記ホストの音声に変換した前記音声メッセージを生成するためのアルゴリズムを準備する段階をさらに含む
請求項1に記載の端末機の動作方法。 - 前記特定テキストを前記ホストの音声に変換した音声メッセージを生成する段階は、
前記ホストの音声及び前記特定テキストを前記アルゴリズムに適用して前記音声メッセージを生成する
請求項2に記載の端末機の動作方法。 - 前記特定テキストを前記ホストの音声に変換した前記音声メッセージを生成するための前記アルゴリズムを準備する段階は、
複数の音声と複数のテキスト、そして前記複数のテキストのそれぞれを前記複数の音声に変換した複数の音声メッセージとの間の相関関係について学習された、学習モデルを準備する
請求項2に記載の端末機の動作方法。 - 前記ホストの音声から音声特徴を抽出する段階と、
前記抽出された音声特徴に基づいて比較音声を生成する段階と、
前記ホストの音声及び前記比較音声を比較する段階と、
前記比較の結果に応じて前記音声特徴を保存する段階と、
をさらに含む請求項1に記載の端末機の動作方法。 - 前記ホストの音声及び前記比較音声を比較する段階は、
前記ホストの音声及び前記比較音声の間のサンプリング値の誤差を計算し、
前記比較の結果に応じて前記音声特徴を保存する段階は、
前記誤差が基準値以下である場合、前記音声特徴を保存する
請求項5に記載の端末機の動作方法。 - 前記特定テキストを前記ホストの音声に変換した前記音声メッセージを生成する段階は、
前記特定テキスト及び前記音声特徴に基づいて前記音声メッセージを生成する
請求項5に記載の端末機の動作方法。 - 前記少なくとも1つ以上のアイテムは、前記サービス内で財貨的価値を有する
請求項1に記載の端末機の動作方法。 - 前記放送チャンネルに入場した前記少なくとも1人以上のゲストのうち第1ゲストが前記リアルタイム放送に直接参加する段階と、
前記ディスプレイの前記2つの領域のうち前記ホストに割り当てられた領域を除いた他の領域が前記第1ゲストに割り当てられる段階と、
をさらに含む請求項1に記載の端末機の動作方法。 - 請求項1から9のいずれか1項に記載の動作方法を行うためのプログラムが記録されたコンピュータ読み取り可能な記録媒体。
- 放送チャンネルを介して端末機の使用者がホストであるリアルタイム放送が開始されると、2つの領域に分割され、前記2つの領域のうち1つの領域が前記ホストに割り当てられるディスプレイと、
前記ホストの音声を受信する入出力インターフェースと、
前記放送チャンネルに入場した少なくとも1人以上のゲストのうち特定ゲストの端末機から少なくとも1つ以上のアイテムのうちから選択された1つのアイテム及び特定テキストを受信する通信インターフェースと、
前記特定テキストを前記ホストの音声または前記特定ゲストの音声に変換した音声メッセージを生成するプロセッサと、
を含む端末機。 - 前記プロセッサは、複数の音声と複数のテキスト、そして前記複数のテキストのそれぞれを前記複数の音声に変換した複数の音声メッセージとの間の相関関係について学習された、学習モデルを準備し、前記ホストの音声及び前記特定テキストを前記学習モデルに適用して前記音声メッセージを生成する
請求項11に記載の端末機。 - 前記学習モデルを保存するメモリーをさらに含む
請求項12に記載の端末機。 - 前記プロセッサは、前記ホストの音声から音声特徴を抽出し、前記抽出された音声特徴に基づいて比較音声を生成し、前記ホストの音声及び前記比較音声を比較し、前記比較の結果に応じて、前記特定テキスト及び前記音声特徴に基づいて前記音声メッセージを生成する
請求項11に記載の端末機。 - 前記ディスプレイは、前記放送チャンネルに入場した前記少なくとも1人以上のゲストのうち第1ゲストが前記リアルタイム放送に直接参加する場合、前記ディスプレイの前記2つの領域のうち前記ホストに割り当てられた領域を除いた他の領域が前記第1ゲストに割り当てられる
請求項12に記載の端末機。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2022103809A JP2022137114A (ja) | 2019-08-09 | 2022-06-28 | 端末機及びその動作方法 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR10-2019-0097398 | 2019-08-09 | ||
KR1020190097398A KR102430020B1 (ko) | 2019-08-09 | 2019-08-09 | 단말기 및 그것의 동작 방법 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2022103809A Division JP2022137114A (ja) | 2019-08-09 | 2022-06-28 | 端末機及びその動作方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
JP2021028715A true JP2021028715A (ja) | 2021-02-25 |
Family
ID=71950558
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2020134046A Pending JP2021028715A (ja) | 2019-08-09 | 2020-08-06 | 端末機及びその動作方法 |
JP2022103809A Pending JP2022137114A (ja) | 2019-08-09 | 2022-06-28 | 端末機及びその動作方法 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2022103809A Pending JP2022137114A (ja) | 2019-08-09 | 2022-06-28 | 端末機及びその動作方法 |
Country Status (4)
Country | Link |
---|---|
US (2) | US11615777B2 (ja) |
EP (1) | EP3772732A1 (ja) |
JP (2) | JP2021028715A (ja) |
KR (1) | KR102430020B1 (ja) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109766473B (zh) * | 2018-11-30 | 2019-12-24 | 北京达佳互联信息技术有限公司 | 信息交互方法、装置、电子设备及存储介质 |
US20230403435A1 (en) * | 2022-06-08 | 2023-12-14 | Hytto Pte, Ltd | Method and system for processing information across broadcast platforms |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003202885A (ja) * | 2001-12-28 | 2003-07-18 | Canon Electronics Inc | 情報処理装置及び方法 |
WO2018074516A1 (ja) * | 2016-10-21 | 2018-04-26 | 株式会社Myth | 情報処理システム |
KR20190060838A (ko) * | 2016-10-21 | 2019-06-03 | 슈가 가부시키가이샤 | 정보 처리 시스템 |
Family Cites Families (56)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000250826A (ja) * | 1999-03-01 | 2000-09-14 | Fujitsu Ltd | 状態変化通知方法及び状態変化通知システム |
US6804675B1 (en) * | 1999-05-11 | 2004-10-12 | Maquis Techtrix, Llc | Online content provider system and method |
US6571234B1 (en) * | 1999-05-11 | 2003-05-27 | Prophet Financial Systems, Inc. | System and method for managing online message board |
KR20000036463A (ko) * | 2000-03-15 | 2000-07-05 | 한남용 | 인터넷을 이용한 가상현실 대화 시스템 및 방법 |
KR20010091677A (ko) | 2000-03-17 | 2001-10-23 | 최승현 | 음성합성을 이용한 선택형 온라인 대화시스템의 구성 및운용방법 |
US7277855B1 (en) * | 2000-06-30 | 2007-10-02 | At&T Corp. | Personalized text-to-speech services |
US6970820B2 (en) * | 2001-02-26 | 2005-11-29 | Matsushita Electric Industrial Co., Ltd. | Voice personalization of speech synthesizer |
US6804647B1 (en) * | 2001-03-13 | 2004-10-12 | Nuance Communications | Method and system for on-line unsupervised adaptation in speaker verification |
US7483832B2 (en) * | 2001-12-10 | 2009-01-27 | At&T Intellectual Property I, L.P. | Method and system for customizing voice translation of text to speech |
US7685237B1 (en) * | 2002-05-31 | 2010-03-23 | Aol Inc. | Multiple personalities in chat communications |
US7305438B2 (en) * | 2003-12-09 | 2007-12-04 | International Business Machines Corporation | Method and system for voice on demand private message chat |
US20060210034A1 (en) * | 2005-03-17 | 2006-09-21 | Beadle Bruce A | Enabling a user to store a messaging session entry for delivery when an intended recipient is next available |
US20060235932A1 (en) * | 2005-04-18 | 2006-10-19 | International Business Machines Corporation | Chat server mute capability |
US20070005754A1 (en) * | 2005-06-30 | 2007-01-04 | Microsoft Corporation | Systems and methods for triaging attention for providing awareness of communications session activity |
KR100787890B1 (ko) * | 2006-03-06 | 2007-12-27 | 주식회사 모빌리언스 | 인터넷 아이템의 선물 조르기를 이용한 모바일 환경의 무선결제 시스템 및 그 무선 결제 방법 |
US7996222B2 (en) * | 2006-09-29 | 2011-08-09 | Nokia Corporation | Prosody conversion |
US20080147385A1 (en) * | 2006-12-15 | 2008-06-19 | Nokia Corporation | Memory-efficient method for high-quality codebook based voice conversion |
JP2008185805A (ja) * | 2007-01-30 | 2008-08-14 | Internatl Business Mach Corp <Ibm> | 高品質の合成音声を生成する技術 |
US7826872B2 (en) * | 2007-02-28 | 2010-11-02 | Sony Ericsson Mobile Communications Ab | Audio nickname tag associated with PTT user |
US8886537B2 (en) * | 2007-03-20 | 2014-11-11 | Nuance Communications, Inc. | Method and system for text-to-speech synthesis with personalized voice |
CN101359473A (zh) * | 2007-07-30 | 2009-02-04 | 国际商业机器公司 | 自动进行语音转换的方法和装置 |
KR100920174B1 (ko) * | 2007-09-14 | 2009-10-06 | 주식회사 케이티 | 본인 음성 기반의 tts 서비스 제공 장치와 시스템 및 그방법 |
US8224648B2 (en) * | 2007-12-28 | 2012-07-17 | Nokia Corporation | Hybrid approach in voice conversion |
US20090177473A1 (en) * | 2008-01-07 | 2009-07-09 | Aaron Andrew S | Applying vocal characteristics from a target speaker to a source speaker for synthetic speech |
US8401849B2 (en) * | 2008-12-18 | 2013-03-19 | Lessac Technologies, Inc. | Methods employing phase state analysis for use in speech synthesis and recognition |
US8731371B2 (en) * | 2009-08-12 | 2014-05-20 | Sony Corporation | Information processing system and information processing device |
US20120226500A1 (en) * | 2011-03-02 | 2012-09-06 | Sony Corporation | System and method for content rendering including synthetic narration |
EP2737480A4 (en) * | 2011-07-25 | 2015-03-18 | Incorporated Thotra | SYSTEM AND METHOD FOR ACOUSTIC TRANSFORMATION |
US9495450B2 (en) * | 2012-06-12 | 2016-11-15 | Nuance Communications, Inc. | Audio animation methods and apparatus utilizing a probability criterion for frame transitions |
KR20140120560A (ko) * | 2013-04-03 | 2014-10-14 | 삼성전자주식회사 | 통역 장치 제어 방법, 통역 서버의 제어 방법, 통역 시스템의 제어 방법 및 사용자 단말 |
WO2014161091A1 (en) * | 2013-04-04 | 2014-10-09 | Rand James S | Unified communications system and method |
GB201315142D0 (en) * | 2013-08-23 | 2013-10-09 | Ucl Business Plc | Audio-Visual Dialogue System and Method |
US10008216B2 (en) * | 2014-04-15 | 2018-06-26 | Speech Morphing Systems, Inc. | Method and apparatus for exemplary morphing computer system background |
US20150379654A1 (en) * | 2014-06-26 | 2015-12-31 | Xerox Corporation | Methods and systems for digitally capturing and managing attendance |
US9613620B2 (en) * | 2014-07-03 | 2017-04-04 | Google Inc. | Methods and systems for voice conversion |
US9324318B1 (en) * | 2014-10-14 | 2016-04-26 | Nookster, Inc. | Creation and application of audio avatars from human voices |
CN104918124B (zh) * | 2015-05-11 | 2017-12-08 | 腾讯科技(北京)有限公司 | 直播互动系统、信息发送方法、信息接收方法及装置 |
KR101632435B1 (ko) * | 2015-10-20 | 2016-06-21 | 이요훈 | 유무선ip기반 gui를 활용한 sns 시스템 및 이를 이용한 통화 방법 |
US20170171509A1 (en) * | 2015-12-14 | 2017-06-15 | Le Holdings (Beijing) Co., Ltd. | Method and electronic apparatus for realizing two-person simultaneous live video |
US10311855B2 (en) * | 2016-03-29 | 2019-06-04 | Speech Morphing Systems, Inc. | Method and apparatus for designating a soundalike voice to a target voice from a database of voices |
US10218939B2 (en) * | 2016-04-14 | 2019-02-26 | Popio Ip Holdings, Llc | Methods and systems for employing virtual support representatives in connection with mutli-pane video communications |
US10176819B2 (en) * | 2016-07-11 | 2019-01-08 | The Chinese University Of Hong Kong | Phonetic posteriorgrams for many-to-one voice conversion |
US20180063556A1 (en) * | 2016-08-29 | 2018-03-01 | YouNow, Inc. | Systems and methods for providing guest broadcasting on a live stream video platform |
US20180090126A1 (en) * | 2016-09-26 | 2018-03-29 | Lenovo (Singapore) Pte. Ltd. | Vocal output of textual communications in senders voice |
US10777201B2 (en) * | 2016-11-04 | 2020-09-15 | Microsoft Technology Licensing, Llc | Voice enabled bot platform |
KR20180059322A (ko) | 2016-11-25 | 2018-06-04 | 주식회사 투스라이프 | 기부 금액 기반 이펙트 설정 장치 및 방법 |
US10403287B2 (en) * | 2017-01-19 | 2019-09-03 | International Business Machines Corporation | Managing users within a group that share a single teleconferencing device |
KR102136413B1 (ko) * | 2017-04-06 | 2020-07-21 | 주식회사 스무디 | 다자간 커뮤니케이션 서비스를 제공하기 위한 방법, 시스템 및 비일시성의 컴퓨터 판독 가능 기록 매체 |
US20180316964A1 (en) * | 2017-04-28 | 2018-11-01 | K, Online Inc | Simultaneous live video amongst multiple users for discovery and sharing of information |
US10664524B2 (en) * | 2017-09-13 | 2020-05-26 | Facebook, Inc. | Highlighting portions of a live video broadcast |
EP3739572A4 (en) | 2018-01-11 | 2021-09-08 | Neosapience, Inc. | METHOD AND DEVICE FOR TEXT-TO-LANGUAGE SYNTHESIS USING MACHINE LEARNING AND COMPUTER-READABLE STORAGE MEDIUM |
US11238843B2 (en) * | 2018-02-09 | 2022-02-01 | Baidu Usa Llc | Systems and methods for neural voice cloning with a few samples |
US20200013422A1 (en) * | 2018-07-03 | 2020-01-09 | Ralph W. Matkin | System, Method, and Apparatus for Morphing of an Audio Track |
US10953332B2 (en) * | 2018-12-20 | 2021-03-23 | Roblox Corporation | Online gaming platform voice communication system |
US10902841B2 (en) * | 2019-02-15 | 2021-01-26 | International Business Machines Corporation | Personalized custom synthetic speech |
US10930263B1 (en) * | 2019-03-28 | 2021-02-23 | Amazon Technologies, Inc. | Automatic voice dubbing for media content localization |
-
2019
- 2019-08-09 KR KR1020190097398A patent/KR102430020B1/ko active IP Right Grant
-
2020
- 2020-08-05 EP EP20189677.6A patent/EP3772732A1/en active Pending
- 2020-08-06 US US16/987,111 patent/US11615777B2/en active Active
- 2020-08-06 JP JP2020134046A patent/JP2021028715A/ja active Pending
-
2022
- 2022-06-28 JP JP2022103809A patent/JP2022137114A/ja active Pending
-
2023
- 2023-03-14 US US18/183,860 patent/US20230215418A1/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003202885A (ja) * | 2001-12-28 | 2003-07-18 | Canon Electronics Inc | 情報処理装置及び方法 |
WO2018074516A1 (ja) * | 2016-10-21 | 2018-04-26 | 株式会社Myth | 情報処理システム |
KR20190060838A (ko) * | 2016-10-21 | 2019-06-03 | 슈가 가부시키가이샤 | 정보 처리 시스템 |
CN109964212A (zh) * | 2016-10-21 | 2019-07-02 | 舒格有限公司 | 信息处理系统 |
US20190334842A1 (en) * | 2016-10-21 | 2019-10-31 | Sugar Inc. | Information processing system |
Also Published As
Publication number | Publication date |
---|---|
JP2022137114A (ja) | 2022-09-21 |
US20210043187A1 (en) | 2021-02-11 |
EP3772732A1 (en) | 2021-02-10 |
US20230215418A1 (en) | 2023-07-06 |
KR20210017708A (ko) | 2021-02-17 |
US11615777B2 (en) | 2023-03-28 |
KR102430020B1 (ko) | 2022-08-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111699528B (zh) | 电子装置及执行电子装置的功能的方法 | |
US10546578B2 (en) | Method and device for transmitting and receiving audio data | |
CN108304846B (zh) | 图像识别方法、装置及存储介质 | |
KR102283972B1 (ko) | 통신 장치, 서버 및 동작 방법 | |
US11308955B2 (en) | Method and apparatus for recognizing a voice | |
US10089974B2 (en) | Speech recognition and text-to-speech learning system | |
US11430438B2 (en) | Electronic device providing response corresponding to user conversation style and emotion and method of operating same | |
US20230215418A1 (en) | Terminal and Operating Method Thereof | |
JP2019102063A (ja) | ページ制御方法および装置 | |
CN105393302A (zh) | 多级语音识别 | |
US11416703B2 (en) | Network optimization method and apparatus, image processing method and apparatus, and storage medium | |
JP6732977B2 (ja) | サーバー及びその動作方法(server and operating method thereof) | |
US11606397B2 (en) | Server and operating method thereof | |
WO2019101099A1 (zh) | 视频节目识别方法、设备、终端、系统和存储介质 | |
CN105100672A (zh) | 显示装置及其视频通话执行方法 | |
CN114333804A (zh) | 音频分类识别方法、装置、电子设备及存储介质 | |
US20200410605A1 (en) | Mobile, server and operating method thereof | |
CN109189822A (zh) | 数据处理方法及装置 | |
KR102315211B1 (ko) | 단말기 및 그것의 동작 방법 | |
CN110865853A (zh) | 云服务的智能操作方法和装置以及电子设备 | |
CN112771608A (zh) | 语音信息的处理方法、装置、存储介质及电子设备 | |
US20240104420A1 (en) | Accurate and efficient inference in multi-device environments | |
US20240015262A1 (en) | Facilitating avatar modifications for learning and other videotelephony sessions in advanced networks | |
CN116150597A (zh) | 基于多网络的试井解释方法及装置、电子设备、存储介质 | |
CN116978359A (zh) | 音素识别方法、装置、电子设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20200806 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20210831 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20210910 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20211210 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20211224 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20220328 |
|
A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20221021 |