WO2020045712A1 - Dispositif, procédé et support d'enregistrement lisible par ordinateur de fourniture de service de messagerie instantanée asynchrone - Google Patents

Dispositif, procédé et support d'enregistrement lisible par ordinateur de fourniture de service de messagerie instantanée asynchrone Download PDF

Info

Publication number
WO2020045712A1
WO2020045712A1 PCT/KR2018/010172 KR2018010172W WO2020045712A1 WO 2020045712 A1 WO2020045712 A1 WO 2020045712A1 KR 2018010172 W KR2018010172 W KR 2018010172W WO 2020045712 A1 WO2020045712 A1 WO 2020045712A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
messages
user
message
character
Prior art date
Application number
PCT/KR2018/010172
Other languages
English (en)
Korean (ko)
Inventor
장준수
윤용기
장재웅
김세미
신희욱
김영상
임중신
정정화
Original Assignee
주식회사 닫닫닫
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 주식회사 닫닫닫 filed Critical 주식회사 닫닫닫
Publication of WO2020045712A1 publication Critical patent/WO2020045712A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/50Business processes related to the communications industry
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/40Business processes related to the transportation industry
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/04Real-time or near real-time messaging, e.g. instant messaging [IM]
    • H04L51/046Interoperability with other network applications or services
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L51/00User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail
    • H04L51/07User-to-user messaging in packet-switching networks, transmitted according to store-and-forward or real-time protocols, e.g. e-mail characterised by the inclusion of specific contents
    • H04L51/10Multimedia information

Definitions

  • the present disclosure relates to an apparatus, a method and a computer readable storage medium for providing an asynchronous instant message service.
  • a user using an instant message service can transfer messages between two or more users relatively quickly and simply.
  • mobile devices such as smart phones are widely used, the use of instant messaging services has exploded.
  • the instant message service enables the transmission of a relatively short voice message.
  • the voice message is easier to input than the text message, and may transmit various features that the user who inputs the voice message wants to deliver.
  • the data size of the voice message is generally larger than that of the text message, and the user must perform an operation of playing each voice message (eg, clicking, touching, etc.) the voice message, and listening to the voice message being played.
  • voice messages may have spatial constraints, such as temporal constraints or memory space or physical space, compared to text messages that can be quickly identified by eye.
  • Prior Art Document 1 when a user inputs a voice message, performs a voice recognition from the voice message to generate a text message, and extracts the user's emotion from the voice message
  • Prior Art Document 1 A text representation method of changing and outputting a font of a text message generated from a voice message is disclosed.
  • the prior art document 1 extracts information on various emotions from a voice message and generates a text message using the information, but only a part of information that a user wants to deliver through the voice message can be obtained. The rest of the information may be lost.
  • the present disclosure is directed to solving the above problems, and provides an apparatus, method, and computer readable storage medium that are convenient for playing a voice message in an instant message service and are efficient in data management.
  • the present disclosure provides an apparatus, method, and computer readable storage medium capable of providing an improved instant message service utilizing characters in an instant message service.
  • a method of providing an instant message service performed under the control of a computing device of a first user includes receiving one or more voice messages including a voice message sent by a computing device of a second user and one or more voice recognized text messages corresponding to each of the one or more voice messages; Receiving a request for playing a voice message from a first user; And responsive to the voice message reproducing request, reproducing one or more voice messages sequentially received.
  • the method may include storing each of the one or more voice recognized text messages corresponding to the one or more voice messages.
  • the method may further include displaying, in response to the playing of the one or more voice messages, the one or more voice recognized text messages.
  • the exemplary method may further include receiving character information for the character of the second user.
  • sequentially playing the received one or more voice messages may include playing back the voice message along with displaying the character of the second user based on the character information.
  • An instant message service providing apparatus may include a communication module, a user interface module, a voice playback module, a display module, and a memory module.
  • the communication module may be configured to receive one or more voice messages including voice messages sent by the sender's external computing device and one or more voice recognized text messages corresponding to the one or more voice messages, respectively.
  • the user interface module may be configured to receive input for an instant message service.
  • the user interface module may receive a voice message playing request from a user.
  • the voice playback module may be configured to sequentially reproduce one or more voice messages received by the communication module in response to the voice message playback request received by the user interface module.
  • the display module may be configured to sequentially display one or more voice recognized text messages in response to the playback of the one or more voice messages by the voice playback module.
  • the memory module may be configured to store one or more voice messages and one or more voice recognized text messages.
  • the user interface module can be configured to receive an input voice message from the user.
  • the instant message service providing apparatus may further include a voice recognition module and a text recognition module.
  • the voice recognition module may be configured to perform voice recognition on the input voice message to obtain a voice recognized input text message.
  • the character recognition module may be configured to detect an operation character that enables selecting an operation of a character of a user displayed by the display module, from the voice recognized input text message obtained by the speech recognition module.
  • the instant message service providing apparatus may further include a camera module and an expression determining module.
  • the camera module may be configured to obtain face information of the user.
  • the facial expression determination module may be configured to determine the facial expression of the character of the user based on the face information.
  • a computer readable storage medium having stored thereon a computer program for providing an instant message service.
  • the computer program stored in the computer readable storage medium when executed, causes the first user's computing device to correspond to the one or more voice messages and the one or more voice messages, respectively, including the voice message sent by the second user's computing device.
  • Receiving one or more voice recognized text messages ; Storing at least one voice message and at least one voice recognized text message; Receiving a request for playing a voice message from a first user; Responsive to the voice message reproducing request, reproducing one or more voice messages sequentially received; And displaying the one or more voice recognized text messages in response to the reproduction of the one or more voice messages.
  • FIG. 1 is an exemplary environmental diagram illustrating an environment in which an instant message service is provided in accordance with at least some embodiments of the present disclosure
  • FIG. 2 illustrates an example of using an instant message service in accordance with the present disclosure
  • FIG. 3 illustrates an example of displaying and playing a message on a user's mobile device when using an instant message service according to FIG. 2;
  • FIGS. 2 and 3 shows an example showing a log of an instant message, in the example according to FIGS. 2 and 3;
  • FIG. 5 is a block diagram schematically illustrating an apparatus for providing an instant message service according to at least some embodiments of the present disclosure
  • FIG. 6 is a flow diagram illustrating an example process for a method of providing instant message services, in accordance with at least some embodiments of the present disclosure
  • FIG. 7 is a flowchart illustrating another example process for a method of providing instant message services, in accordance with at least some embodiments of the present disclosure.
  • FIG 8 illustrates an example computer program product that may be used to provide an instant message service, in accordance with at least some embodiments of the present disclosure.
  • FIG. 9 is a block diagram schematically illustrating a server for providing an instant message service according to at least some embodiments of the present disclosure.
  • the present disclosure generally relates to an apparatus, a method and a computer readable storage medium for providing an instant message service.
  • instant message service may refer to a service in which a message received by a recipient is displayed and / or played back if the sender sends a message, such as a text message, voice message, image, or the like, to one or more recipients.
  • character means an object that is represented by a computer graphic and has a face, and can be expressed in various forms such as, for example, a person, an animal, a virtual animal, a robot, and the like, and according to the present disclosure, a character is instant It is an object displayed on the message service and can be operated by the control of the user or the user's device.
  • module may refer to an apparatus, a server, a program unit, or a suitable combination thereof.
  • memory module refers not only to hardware for storing data in memory, but also to devices, servers, program units or their suitable controls for deleting data stored in such hardware according to predetermined conditions. May refer to a combination.
  • FIG. 1 is an exemplary environmental diagram illustrating an environment 100 in which an instant message service is provided, in accordance with at least some embodiments of the present disclosure.
  • Exemplary environment 100 includes network environment 110, one or more user devices 120-1, 120-2, 120-3, 120-4,. (130-1, 130-2, 130-3, ...; hereinafter referred to as 130).
  • the network environment 110 represents various environments for connecting the user device 120 and the user device 130 by wired or wireless communication.
  • the network environment 110 may include a server 115 for providing an instant message service.
  • the user device 120 may send an instant message to or receive an instant message from the user device 130 through the server 115 providing an instant message service.
  • user device 120 may send a notification for an instant message to user device 130 via server 115, where user device 130 receives the notification from the server and sends the user an instant message. Can be received directly from the device 120.
  • server 115 may serve as a relay server, and after user device 130 receives the notification, user device 130 may, for example, be peer-to-peer.
  • An instant message can be received by connecting directly to the user device 120 using a peer technique.
  • network environment 110 may further include a communication environment, such as a wired environment, a wireless environment, a base station, or the like, between user devices 120, 130.
  • the server 115 stores the instant message sent by the user device 120 and then, when the user device 130 is connected to the server 115, receives the instant message received from the user device 120. Can be configured to transmit.
  • server 115 may assist network environment 110 to establish a peer-to-peer connection between user devices 120, 130.
  • user devices 120 and 130 include communicable devices, such as smartphones, tablet computers, desktop computers, laptop computers, mobile phones, personal digital assistants (PDAs), special purpose devices, or any of the above functions.
  • Small form factor portable (mobile) electronic devices such as a fusion device.
  • user devices 120 and 130 may perform one-to-one or many-to-many instant message communications as well as one-to-one instant message communications, and server 115 provides such instant messaging services. can do.
  • a user (first user) of user device 120 may enter an instant message to send to user (second user) of user device 130 using user device 120.
  • the instant message may be a voice message entered by the first user.
  • the first user may input a voice message to the user device 120 for a predetermined time.
  • the user device 120 may perform voice recognition on the voice message to generate a voice recognized text message, and various voice recognition techniques well known in the art according to the present disclosure may be used.
  • the user device 120 may transmit a voice message and a voice recognized text message corresponding to the voice message to the user device 130 by an input of the first user. Voice and text messages may be sent from the user device 120 to the user device 130 through the server 115 or directly from the user device 120 to the user device 130.
  • the user device 120 may combine two or more voice messages input by the first user and transmit them in one voice message. In some other examples, the user device 120 may combine one or more voice messages input by the first user and one or more text messages input by the first user and send them into one instant message.
  • a character for the first user and a character for the second user may be displayed.
  • the user device 120 may transmit character information about the character for the first user while transmitting the instant message to the user device 130.
  • the character information may include information on at least one of the type of the character of the first user, the expression of the character, or the operation of the character.
  • the user device 120 may obtain face information of the first user using a device such as an image / video camera, a depth camera, and the like, and based on the acquired face information, a character for the first user. Can determine the facial expression.
  • the facial expression of the character for the first user may be determined while the first user is entering an instant message.
  • the facial expression of the character may be selected by the first user.
  • the first user may select an action of the character from a list of predetermined actions.
  • the user device 120 may recognize the characters associated with the character's movement from the voice recognized text message or typed text message. The first user may select an action of the character by selecting the recognized character.
  • the first user may check a voice recognized text message, a facial expression and motion of the character, and determine the transmission of the voice message.
  • the server 115 may receive the instant message received from the user device 120. For example, voice messages, typed text messages, etc.).
  • the server 115 may store the voice message corresponding to the voice recognized text message.
  • the server 115 may receive character information from the user device 120 along with the instant message and store it in correspondence with the received instant message.
  • server 115 may send a notification to user device 130, and the user device ( 130 may receive a notification message for an instant message received from the user device 120 from the server 115.
  • the user device 130 may access the server 115.
  • the user device 130 may receive from the user device 120 one or more voice messages including voice messages input by the first user and one or more voice recognized text messages respectively corresponding to the one or more voice messages. Can be received.
  • the notification may include an indication of one or more voice messages to the user device 130 including the voice message received from the user device 120. have.
  • user device 130 may store therein one or more voice messages and corresponding one or more voice recognized text messages. In some examples, user device 130 may store the received one or more voice messages and the voice recognized text message in correspondence.
  • user device 130 may receive a voice message playback request from a user (second user) to play the received one or more voice messages.
  • the user device 130 may sequentially play one or more received voice messages in response to the voice message play request.
  • the voice message can be played back asynchronously with the reception of the voice message.
  • playback of the voice message may be controlled to pause, stop, or play the previous voice message or the next voice message at the request of the user.
  • the user device 130 may display one or more voice recognized text messages in response to playback of the one or more voice messages.
  • the user device 130 may filter the one or more voice recognized text messages based on a predetermined censoring condition.
  • predetermined censorship conditions may include, but are not limited to, abusive language, slang, and the like.
  • the user device 130 may play one or more voice messages based on the result of the filtering. For example, if a slang is included in the voice recognized text message, the user device 130 may mute and reproduce at least a part of the corresponding voice message.
  • This filtering process is not limited to what the user device 130 performs, and in some embodiments, the server 115 may perform the filtering process before sending the one or more voice messages and the corresponding one or more voice recognized text messages. have.
  • user device 130 may delete one or more voice messages based on a predetermined condition. In some examples, the user device 130 may delete the corresponding voice message when playback ends. In some other examples, user device 130 may delete the old voice message based on a predetermined storage capacity condition.
  • the user device 130 may receive character information for the character of the first user from the server 115 in addition to receiving one or more voice messages. In some examples, when the user device 130 sequentially plays one or more voice messages, the user device 130 may display the character of the first user based on the received character information. In some examples, the user device 130 may display a character whose expression, motion, or the like of the character of the first user is controlled based on the character information.
  • FIG. 2 illustrates an example of using an instant message service according to the present disclosure
  • FIG. 3 illustrates an example of displaying and playing a message on a user's mobile device when using the instant message service according to FIG. 2.
  • the first user 210, the second user 220, and the third user 230 are respectively connected to the user device 212, the user device 222, and the user device 232.
  • the users 210, 220, and 230 may share an instant message transmitted and received at the request of at least one of the users 210, 220, and 230.
  • the first user 210 transmits an instant message the second user 220 and the third user 230 may receive the instant message.
  • the first user 210 and the second user 220 may transmit a voice message.
  • first user 210 may select character 216 and second user 220 may select character 226.
  • the first user 210 may input a voice message 214 that is "not cold?"
  • the user device 212 may detect the facial expression of the first user 210 and may determine the facial expression of the character 216.
  • the user device 212 can obtain the speech recognized text message 214-2 by performing speech recognition on the voice message 214.
  • the first user 210 selects one of a list of predetermined actions or removes a character (eg, “cold” is recognized) that the user device 212 recognizes from the voice recognized text message 214-2. 1
  • the user 210 can determine the operation of the character 216 by selecting.
  • the voice message 214 may then be sent towards the user devices 222, 232 of the second and third users 220, 230.
  • the second user 220 may input the voice message 224 "I'm hot!"
  • the user device 222 may detect an expression of the second user 220 and may determine an expression of the character 226.
  • the user device 222 may obtain the voice recognized text message 224-2 from the voice message 224.
  • the second user 220 selects one of a list of predetermined actions, or selects a character recognized by the user device 222 from the voice recognized text message 224-2 (eg, “hot” is recognized).
  • the operation of the character 226 may be determined by the selection of the second user 220.
  • the voice message 224 may then be sent towards the user devices 212, 232 of the first and third users 210, 230.
  • the third user 230 may access a server (not shown) that provides an instant message service using the user device 232, and the user device 232 may have a voice message 214 and a voice message 224. ) Can be received.
  • the user device 232 may receive the voice recognized text messages 214-2, 224-2.
  • the voice recognized text messages 214-2 and 224-2 may be displayed in response to playback of the voice messages 214 and 224.
  • the third user 230 may input a voice message playback request using the user interface 240 displayed on the user device 232.
  • a voice message reproduction request is input, as shown in Fig. 3A, the voice message 214 is reproduced with the display of the character 216.
  • voice recognized text message 214-2 may be displayed in response to playback of voice message 214.
  • the character 216 may show facial expressions and actions determined by the user device 212 of the first user 210.
  • the voice message 224 is played along with the display of the character 226.
  • the voice recognized text message 224-2 may be displayed in response to the reproduction of the voice message 224.
  • the character 226 may show facial expressions and actions determined by the user device 222 of the second user 220.
  • the third user 230 may be configured to receive instant messages 214-2 and 224-2 received from the first user 210 and the second user 220.
  • An example displayed sequentially is shown.
  • instant messages transmitted by the user device 232 of the third user 330 may also be sequentially displayed based on the transmission time and / or the reception time.
  • the instant message service providing apparatus 500 may include a communication module 510, a user interface module 520, a voice playback module 530, a display module 540, a memory module 550, and a voice.
  • Recognition module 570 may be included.
  • the instant message service providing apparatus 500 may further include a message filter 560, a text recognition module 580, a camera module 590-1, and an facial expression determination module 590-2.
  • Components included in the instant message service providing apparatus 500 may be individually implemented or two or more of the components may be combined to form one component.
  • the instant message service providing apparatus 500 transmits a voice message and a voice recognized text message, while the text message received from an external computing device such as the instant message service providing apparatus 500. Can be configured to display and play the voice message at the request of the user.
  • the instant message service providing device 500 may be a variety of computing devices, such as, for example, smartphones, tablet computers, desktop computers, laptop computers, mobile phones, personal digital assistants (PDAs), special purpose devices, or any of the above functions. Small form factor portable (mobile) electronic devices, such as including fusion devices.
  • the communication module 510 may be configured to connect the instant message service providing apparatus 500 to a server.
  • the communication module 510 may be configured to receive one or more voice messages and one or more voice recognized text messages corresponding to one or more voice messages, respectively, from an external computing device.
  • communication module 510 may receive one or more voice messages and corresponding one or more voice recognized text messages from an external computing device via a server.
  • communication module 510 may receive a notification from a server. The notification may indicate that one or more voice messages have been sent.
  • the communication module 510 receives one or more voice messages and corresponding one or more voice recognized text messages directly from an external computing device, such as in a manner such as a peer to peer connection, based on this notification. can do.
  • the user interface module 520 may be configured to receive a voice message playing request for the received one or more voice messages from the user.
  • the voice playing module 530 may be configured to sequentially play one or more voice messages received by the communication module 510 in response to a voice message playing request received by the user interface module 520.
  • the user interface module 520 may receive various requests from the user for pausing, stopping, playing the previous voice message, or playing the next voice message, as required. ) May play the voice message according to the request received by the user interface module 520.
  • the display module 540 may be configured to display one or more voice recognized text messages corresponding to one or more voice messages, respectively, in response to playback of the one or more voice messages.
  • the memory module 550 may be configured to store one or more voice messages received by the communication module 510 and corresponding one or more voice recognized text messages. In some examples, the memory module 550 may store one or more received voice messages and voice recognized text messages in correspondence. In some embodiments, the memory module 550 may delete one or more voice messages based on a predetermined condition. In some examples, the memory module 550 may delete the corresponding voice message stored in the memory module 550 when the playback of the voice message ends. In some other examples, the memory module 550 may delete the old voice message based on the predetermined storage capacity condition.
  • the memory module 550 may delete the oldest voice message if the total capacity of the stored voice message exceeds a predetermined value. In another example, the memory module 550 may delete the voice message when the voice message, for which playback ends, exceeds a predetermined value.
  • the message filter 560 may filter the one or more voice recognized text messages based on a predetermined censoring condition before the voice playback module 530 plays the received one or more voice messages.
  • the censoring method by the message filter 560 may use a well-known censoring method for a text message.
  • the predetermined censorship condition may include, but is not limited to, abusive language, slang, and the like.
  • the voice reproduction module 530 may reproduce one or more voice messages based on the filtering result of the message filter 560. For example, when a slang is included in a voice recognized text message, the voice reproducing module 530 may mute the corresponding voice message.
  • the communication module 510 may receive character information about the character of the sender from the server while receiving one or more voice messages.
  • the display module 540 may display the sender's character based on the received character information.
  • the display module 540 may display a character whose expression, motion, or the like is controlled based on the character information.
  • the user interface module 520 may be configured to receive a voice message (hereinafter, “input voice message”) input by a user of the instant message service providing apparatus 500.
  • user interface module 520 may receive an input voice message for a predetermined time.
  • the user interface module 520 may generate one input voice message by combining two or more input voice messages received for a predetermined time at a user's request.
  • the voice recognition module 570 may generate a voice recognized text message by performing voice recognition on the input voice message received by the user interface module 520, and various voices well known in the art according to the present disclosure. Recognition techniques can be used. It is also possible for a user to type a text message through the user interface module 520.
  • the communication module 510 may transmit an input voice message and a corresponding voice recognized text message to an external computing device. The voice recognized text message corresponding to the input voice message may be transmitted to the external computing device or directly to the external computing device through the server.
  • the display module 540 may display a character of the user of the instant message service providing apparatus 500.
  • the communication module 510 may transmit character information about the character of the user.
  • the character information may include information about at least one of a type of a character of a user, an expression of a character, or an operation of the character.
  • text recognition module 580 may recognize text associated with the movement of the character from a speech recognized text message or typed text message generated by speech recognition module 570. The user can select an action of the character by selecting the recognized character. In some other examples, the user may select an action of the character from the list of predetermined actions of the character via the user interface module 520.
  • the camera module 590-1 may acquire face information of the user using a device such as an image / video camera, a depth camera, or the like.
  • the facial expression determination module 590-2 may determine the facial expression of the character of the user based on the face information obtained by the camera module 590-1.
  • the user interface module 520 receives an instant message, such as a voice message or text message
  • the camera module 590-1 obtains face information of the user
  • the facial expression determination module 590-2 The facial expression of the character may be determined based on the acquired face information.
  • the user may determine the transmission of an instant message, such as a voice message, after confirming a text message, a facial expression, an action, or the like appearing on the display module 540.
  • an instant message such as a voice message
  • FIG. 6 and 7 are flowcharts illustrating example processes 600 and 700 for a method of providing instant message services, in accordance with at least some embodiments of the present disclosure.
  • 6 illustrates a process 600 for receiving an instant message
  • FIG. 7 illustrates a process 700 for sending an instant message.
  • the processes 600 and 700 may be performed under the control of a computing device such as the user device 120, 130 of FIG. 1, or the instant message service providing device 500 of FIG. 5.
  • the process 600 shown in FIG. 6 may include one or more operations, functions or actions as illustrated by blocks 610, 620, 630, 640 and / or 650.
  • FIGS. 6 and 7 may include one or more operations, functions, or actions as illustrated by blocks 710, 720, 730, and / or 740.
  • the various blocks are not intended to be limited to the described embodiments.
  • those skilled in the art will appreciate that, for the present processes disclosed herein, the functions performed in the processes and methods may be implemented in a different order.
  • the schematic operations illustrated in FIGS. 6 and 7 are provided by way of example only, and some of the operations may be optional, may be combined in fewer operations, or extended to additional operations without departing from the spirit of the disclosed embodiment. Can be.
  • the process 600 shown in FIG. 6 begins at block 610 connecting to a server.
  • the computing device may connect to the server.
  • Process 600 may continue to block 620 to receive one or more voice messages and one or more voice recognized text messages at block 610.
  • the computing device may receive one or more voice messages and one or more voice recognized text messages corresponding to each of the one or more voice messages from the sender's external computing device.
  • the server may send a notification for one or more voice messages received from the sender, and the computing device may receive such a notification from the server.
  • the computing device may then receive one or more voice messages and corresponding one or more voice recognized text messages that appear in the notification directly from an external computing device or through a server.
  • Process 600 may continue at block 620 with block 630 storing the received one or more voice messages and one or more voice recognized text messages.
  • the computing device may store therein one or more voice messages and corresponding one or more voice recognized text messages therein. In some examples, the computing device may store the received one or more voice messages and the one or more voice recognized text messages in correspondence. Process 600 may continue to block 640 to receive a voice message playback request from a user at block 630.
  • the computing device may receive a voice message playback request from the user to play the received one or more voice messages.
  • Process 600 may continue to block 650 to sequentially play one or more voice messages at block 640.
  • the computing device may sequentially play one or more voice messages received in response to the voice message playback request.
  • playback of the voice message may be controlled to pause, stop, or play the previous voice message or the next voice message at the request of the user.
  • the computing device may delete the voice message for which playback has ended based on a predetermined condition.
  • the computing device may play one or more voice messages sequentially while displaying the sender's character based on the character information.
  • the computing device may filter the one or more voice recognized text messages based on the predetermined censoring condition before performing block 650. Thereafter, at block 650, the computing device may play one or more voice messages based on the result of the filtering. For example, when a slang is included in the voice recognized text message, the computing device may mute and reproduce at least a portion of the corresponding voice message.
  • Process 700 shown in FIG. 7 may begin at block 710 for receiving an input voice message from a user.
  • a user can enter a voice message to be sent using the computing device.
  • a user may enter a voice message into the computing device, for example, for a predetermined time, and the computing device may receive this input voice message.
  • Process 700 may continue to block 720 to obtain a voice recognized input text message at block 710.
  • the computing device may perform voice recognition on the input voice message to generate a voice recognized input text message.
  • Speech recognition for input voice messages can use a variety of known techniques.
  • the computing device may display a voice recognized input text message and the user may confirm the displayed message.
  • Process 700 may continue to block 730 to obtain character information at block 720.
  • the computing device may obtain character information for the character of the user to be displayed on the instant message service.
  • the computing device may obtain facial information of the user using an accessory device connected to the computing device, such as an image / video camera, a depth camera, and the like, and based on the acquired facial information, the facial expression of the character to the user Can be determined.
  • the facial expression of the character may be selected by the user.
  • the computing device may obtain gesture information of the character selected from the list of gestures predefined by the user.
  • the computing device may recognize a character associated with the movement of the character from the voice recognized input text message, and when the user selects one of the recognized characters, the computing device may retrieve the movement information of the character corresponding to the character. Can be obtained.
  • the character information may include not only the type of character but also the expression of the character and the operation of the character.
  • Process 700 may continue from block 730 to block 740 where the computing device may transmit a voice message, a voice recognized input text message and character information to an external computing device.
  • signal bearing media 802 of one or more computer program products 700 may include computer readable media 806, recordable media 808, and / or communication media 810.
  • the instructions 804 included in the signal bearing medium 802 may be executed by a computing device such as the user device 120, 130 shown in FIG. 1 and / or the instant message service providing device shown in FIG. 5.
  • the instruction 804 may, when executed, provide an instant message service for the first user in accordance with the present disclosure.
  • the instructions 804 may include one or more instructions for receiving one or more voice messages including a voice message sent by the computing device of the second user and one or more voice recognized text messages respectively corresponding to the one or more voice messages; One or more instructions for storing one or more voice messages and one or more voice recognized text messages; One or more instructions for receiving a voice message playing request from a first user; One or more instructions for reproducing the one or more voice messages sequentially received in response to the voice message playback request; And one or more instructions for displaying the one or more voice recognized text messages in response to playing of the one or more voice messages.
  • the instant message service providing server 900 may include a communication module 910, a character module 920, a voice memory 930, and a text memory 940.
  • the communication module 910 may receive instant message and character information, such as a voice message and / or text message, from the sender.
  • the communication module 910 may transmit a notification and / or an instant message for the instant message to the recipient of the instant message.
  • the communication module 910 may transmit a voice message, a voice recognized text message, and character information of the sender.
  • the character module 920 may store character information received from the sender, for example, information on a type, facial expression, motion, and the like of the sender's character.
  • the voice memory 930 may store a voice message received from the sender. In some examples, the voice message stored in the voice memory 930 may be deleted according to a predetermined condition.
  • the text memory 940 can store voice recognized text messages and typed text messages. In some examples, the text memory 940 may store the voice recognized text message corresponding to the voice message stored in the voice memory 930, and the character module 920 may store the character information in the voice message stored in the voice memory 930. And / or corresponding to the voice recognized text message or typed text message stored in the text memory 940.
  • the claimed subject matter is not limited in scope to the specific embodiments described herein.
  • some implementations may be in hardware, such as may be used to operate on a device or combination of devices, while other implementations may be in software and / or firmware, for example.
  • the claimed subject matter is not limited in scope in this respect, but some embodiments may include one or more articles, such as signal bearing media, storage media.
  • Such storage media such as CD-ROMs, computer disks, flash memory, etc., may be executed by computing devices such as, for example, computing systems, computing platforms, or other systems, for example, to claimed subject matter, such as one of the embodiments described above.
  • instructions may be stored that may cause the processor to execute.
  • the computing device may comprise one or more processing units or processors, one or more input / output devices such as displays, keyboards and / or mice, and static random access memory, dynamic random access memory, flash memory and / or hard drives. It may include one or more of the same memory.
  • the implementer may primarily choose hardware and / or firmware means; if flexibility is paramount, the implementer may choose a software implementation primarily; Or, as another alternative, the implementer may choose any combination of hardware, software and / or firmware.
  • aspects of the embodiments of the present disclosure may include one or more computer programs running on one or more computers (eg, one or more programs running on one or more computer systems), one running on one or more processors.
  • Software, and / or firmware that may be implemented in integrated circuits, in whole or in part, as one or more programs (eg, one or more programs running on one or more microprocessors), firmware, or substantially any combination thereof It will be appreciated that the writing of code for and / or the design of circuitry is within the skill of one of ordinary skill in the art in light of this disclosure.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Tourism & Hospitality (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Information Transfer Between Computers (AREA)
  • User Interface Of Digital Computer (AREA)
  • Operations Research (AREA)

Abstract

L'invention concerne un procédé de fourniture d'un service de messagerie instantanée. Un procédé décrit à titre d'exemple peut être réalisé sous le contrôle d'un dispositif informatique d'un premier utilisateur. Le procédé peut comporter les étapes consistant à: recevoir un ou plusieurs messages vocaux, qui comprennent un message vocal émis au moyen d'un dispositif informatique d'un second utilisateur, et un ou plusieurs messages textuels issus d'une reconnaissance vocale correspondant respectivement audit ou auxdits messages vocaux; recevoir une demande de lecture de messages vocaux de la part du premier utilisateur; et, en réponse à la demande de lecture de messages vocaux, lire un ou plusieurs messages vocaux qui ont été reçus séquentiellement.
PCT/KR2018/010172 2018-08-27 2018-08-31 Dispositif, procédé et support d'enregistrement lisible par ordinateur de fourniture de service de messagerie instantanée asynchrone WO2020045712A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR10-2018-0100210 2018-08-27
KR1020180100210A KR20200023814A (ko) 2018-08-27 2018-08-27 비동기적 인스턴트 메시지 서비스를 제공하기 위한 장치, 방법 및 컴퓨터 판독가능 저장 매체

Publications (1)

Publication Number Publication Date
WO2020045712A1 true WO2020045712A1 (fr) 2020-03-05

Family

ID=69645007

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2018/010172 WO2020045712A1 (fr) 2018-08-27 2018-08-31 Dispositif, procédé et support d'enregistrement lisible par ordinateur de fourniture de service de messagerie instantanée asynchrone

Country Status (2)

Country Link
KR (1) KR20200023814A (fr)
WO (1) WO2020045712A1 (fr)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102368300B1 (ko) * 2020-09-08 2022-03-02 박일호 음성 및 표정에 기반한 캐릭터의 동작 및 감정 표현 시스템
CN117014397A (zh) * 2022-09-02 2023-11-07 腾讯科技(深圳)有限公司 基于语音消息的交互方法、装置、计算机设备和存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20010035529A (ko) * 2001-02-27 2001-05-07 이병관 음성 캐릭터 메시지 전송방법, 음성캐릭터 메시징 서비스시스템
JP2010507353A (ja) * 2006-10-18 2010-03-04 ソニー オンライン エンタテインメント エルエルシー オーバラップするメディアメッセージを調整するシステム及び方法
KR20100129122A (ko) * 2009-05-28 2010-12-08 삼성전자주식회사 텍스트 기반 데이터를 애니메이션으로 재생하는 애니메이션 시스템
KR20120107293A (ko) * 2011-03-21 2012-10-02 김주연 발신자 또는 수신자의 선택에 의한 캐릭터, 음성 기반의 메시지 전송시스템 및 전송방법
KR20140107736A (ko) * 2013-02-26 2014-09-05 에스케이플래닛 주식회사 음성 메시지 제공 방법, 이를 위한 장치 및 시스템

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20010035529A (ko) * 2001-02-27 2001-05-07 이병관 음성 캐릭터 메시지 전송방법, 음성캐릭터 메시징 서비스시스템
JP2010507353A (ja) * 2006-10-18 2010-03-04 ソニー オンライン エンタテインメント エルエルシー オーバラップするメディアメッセージを調整するシステム及び方法
KR20100129122A (ko) * 2009-05-28 2010-12-08 삼성전자주식회사 텍스트 기반 데이터를 애니메이션으로 재생하는 애니메이션 시스템
KR20120107293A (ko) * 2011-03-21 2012-10-02 김주연 발신자 또는 수신자의 선택에 의한 캐릭터, 음성 기반의 메시지 전송시스템 및 전송방법
KR20140107736A (ko) * 2013-02-26 2014-09-05 에스케이플래닛 주식회사 음성 메시지 제공 방법, 이를 위한 장치 및 시스템

Also Published As

Publication number Publication date
KR20200023814A (ko) 2020-03-06

Similar Documents

Publication Publication Date Title
US11405678B2 (en) Live streaming interactive method, apparatus, electronic device, server and storage medium
CN109921976B (zh) 一种基于群组的通信控制方法、装置及存储介质
JP6910300B2 (ja) チャット履歴記録を表示するための方法およびチャット履歴記録を表示するための装置
US11061641B2 (en) Screen sharing system, and information processing apparatus
WO2016129811A1 (fr) Procédé et système de présentation d'un menu riche dans un service de messagerie instantanée et support d'enregistrement
WO2015133777A1 (fr) Procédé et dispositif de fourniture d'un service de réseau social
CN107317689B (zh) 一种消息处理方法及电子设备、计算机存储介质
WO2020045712A1 (fr) Dispositif, procédé et support d'enregistrement lisible par ordinateur de fourniture de service de messagerie instantanée asynchrone
JP2023516449A (ja) 情報処理方法、装置及び記憶媒体
CN110989889A (zh) 信息展示方法、信息展示装置和电子设备
WO2018182223A1 (fr) Systèmes et procédés de distribution de notifications
CN112328094A (zh) 信息输入方法、云端输入法系统和客户端
WO2014058153A1 (fr) Système de service d'informations de carnets d'adresses, et procédé et dispositif pour service d'informations de carnets d'adresses dans celui-ci
WO2018182063A1 (fr) Dispositif, procédé et programme d'ordinateur fournissant un appel vidéo
WO2019221385A1 (fr) Procédé permettant de faire fonctionner une application de messagerie
WO2015102125A1 (fr) Système et procédé de conversation de texto
WO2019031621A1 (fr) Procédé et système permettant de reconnaître une émotion pendant un appel téléphonique et d'utiliser une émotion reconnue
WO2022092439A1 (fr) Procédé de fourniture d'image d'élocution et son dispositif informatique d'exécution
WO2015183043A1 (fr) Procédé, dispositif et serveur pour grouper des messages de conversation
WO2014171613A1 (fr) Procédé de prestation de service de messagerie, support d'enregistrement enregistré avec un programme afférent et terminal correspondant
WO2015037871A1 (fr) Système, serveur et terminal permettant de fournir un service de lecture vocale au moyen d'une reconnaissance de textes
WO2012057561A2 (fr) Système et procédé pour fournir un service de messagerie instantanée, et terminal de communication et procédé de communication associés
WO2020067597A1 (fr) Dispositif, procédé et support d'enregistrement lisible par ordinateur pour la fourniture d'un service de messagerie instantanée asynchrone
CN113542257B (zh) 视频处理方法、视频处理装置、电子设备和存储介质
CN112968826B (zh) 语音交互方法、装置和电子设备

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18932160

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18932160

Country of ref document: EP

Kind code of ref document: A1