Detailed Description
Fig. 1 is a schematic block diagram of a communication system according to an embodiment of the present invention. Fig. 2 is a schematic diagram illustrating an operating environment of a communication system according to an embodiment of the invention. Referring to fig. 1 and fig. 2, a communication system 100 according to an embodiment of the present invention includes a server 110 and at least one speaker device 120. At least one speaker 120 is connected to the server 110 through the internet.
In an embodiment, at least one speaker device 120 includes a first speaker device 121 and a second speaker device 122, the first speaker device 121 has a first identification ID1 and the second speaker device 122 has a second identification ID2, but the invention is not limited thereto. In other embodiments, at least one speaker device 120 may include only one speaker device or more than two speaker devices, and each speaker device has an identification mark.
Fig. 3 is a schematic block diagram of a speaker device according to an embodiment of the present invention. Referring to fig. 3, the speaker device (e.g., the first speaker device 121) includes a communication component CM, a speaker component SPK, an input component MPH, and an identification mark (e.g., the first identification mark ID 1). The communication element CM is a wireless network module such as a Wi-Fi module, for example, and is used to connect the speaker to the server 110 through the internet. The speaker component SPK is a speaker such as a loudspeaker, and is used for playing voice information. The input element MPH includes a voice input element such as a microphone, for example, for receiving voice information. In some embodiments, the input element MPH may further include a physical or virtual key, etc., through which a user may select or input a signal. The identification mark is, for example, a bar code or a two-dimensional code, and each speaker corresponds to a specific identification mark. The communication element CM is coupled to the speaker element SPK and the input element MPH, and is configured to send the voice message received by the input element MPH or play the voice message received by the communication element CM through the speaker element SPK. The identification mark is displayed on the appearance (e.g., on the housing) of the speaker device, for example.
Each speaker device may be implemented as, for example, an intelligent home appliance having a voice function, a child toy, or the like. In one embodiment, the speaker device (e.g., the first speaker device 121 or the second speaker device 122) is, for example, a bluetooth speaker including a microphone and a network function. In another embodiment, the speaker device (e.g., the first speaker device 121 or the second speaker device 122) is, for example, a teddy bear with intercom function and network function. In other words, the embodiments of the present invention do not limit the specific implementation manner of each speaker device.
Fig. 4 is a schematic block diagram of a server according to an embodiment of the present invention. Referring to fig. 4, the server 110 includes a database 111, a processor 113, and a communication device 115, wherein the database 111 and the communication device 115 are coupled to the processor 113.
The communication element 115 is, for example, a network module for connecting to the internet in a wired or wireless manner. The processor 113 is, for example, a Central Processing Unit (CPU), a system-on-chip (SOC), an application processor (application processor), a media processor (media processor), a microprocessor (microprocessor), a digital signal processor (digital signal processor), or other similar components with computing functions, and is used to manage the overall operation of the server 110. In one embodiment, the processor 113 further includes a natural speech recognition module capable of analyzing the received speech information. Based on the analysis result, the processor 113 can determine whether the voice message includes a specific word or not, or convert the voice message into text message. Those skilled in the art can obtain sufficient teaching about the speech recognition module and technology from the related documents, and thus the description thereof is omitted here.
The database 111 stores at least one speaker device 120 and an identification mark corresponding to each speaker device. In one embodiment, at least the first speaker 121 corresponding to the first identification ID1 and the second speaker 122 corresponding to the second identification ID2 are recorded in the database 111.
In an embodiment, the first electronic device ED1 and the second electronic device ED2 are similar devices such as a personal computer, a desktop computer, a notebook computer, a smart phone, a tablet computer, and the like. At least one speaker 120 is located in the same or different homes, respectively. The first electronic device ED1 and the second electronic device ED2 may be connected to one of the at least one speaker device 120 through the identification mark to communicate with the connected speaker device.
Fig. 5 is a flowchart illustrating a communication method according to an embodiment of the invention. In one embodiment, a parent (e.g., dad) holding the first electronic device ED1 communicates with a user (e.g., a child) of the first speaker device 121, for example, through the first identification ID 1. However, the present invention is not limited to this, and the first electronic device ED1 may communicate with a different speaker (e.g., the second speaker 122) in the communication system 100 by a different identifier (e.g., the second identifier ID 2).
Furthermore, in another embodiment, in addition to the first electronic device ED1, other electronic devices (e.g., the second electronic device ED2) may be included to communicate with the first speaker device 121 through the first identification ID 2. In other words, the present invention does not limit the number of electronic devices that communicate with the first speaker device 121 through the first identification ID2 at the same time. The steps of the communication method of the embodiment of the present invention are described in detail below.
Referring to fig. 5, in step S510, the server 110 receives at least one login information including the first identification ID1 from at least one electronic device 120.
In one embodiment, the first identifier ID1 is, for example, a barcode. After scanning the first ID1, the first electronic device ED1 logs in the server 110 with the first ID1 to establish a connection between the first electronic device ED1 and the first speaker 121. In detail, the first electronic device ED1 transmits first login information to the server 110, wherein the first login information includes a first identifier ID1 and first identity information, and the first identity information includes an identity (e.g., dad) of a login user, which may correspond to the first electronic device ED 1. The server 110 receives the first login information to establish a connection between the first speaker device 121 and the first electronic device ED 1.
In another embodiment, in addition to the first electronic device ED1, after the second electronic device ED2 scans the first identifier ID1, it can log in the server 110 with the first identifier ID1 to establish a connection between the second electronic device ED2 and the first speaker device 121. In detail, the second electronic device ED2 transmits second login information to the server 110, wherein the second login information includes the first identifier ID1 and second identity information, and the second identity information includes an identity of a login user (e.g., mom), which may correspond to the second electronic device ED 2. The server 110 also receives the second login information to establish a connection between the first speaker 121 and the second electronic device ED 2. Therefore, the first electronic device ED1 and the second electronic device ED2 log in the server 110 with the first login information and the second login information, respectively, to establish a connection with the first speaker 121.
In step S530, the speaker device corresponding to the first identification ID1 receives the first voice message and transmits the first voice message to the server 110.
In one embodiment, when the first speaker 121 corresponding to the first identification ID1 receives the first voice message, the first voice message is transmitted to the server 110.
Subsequently, in step S550, the server 110 transmits the first message to the at least one electronic device according to the first voice message and the received login message. In one embodiment, the first information transmitted by the server 110 is first voice information. In another embodiment, the server 110 may also convert the first voice message into the first text message by using a natural voice recognition technique after receiving the first voice message, and then transmit the first text message to the electronic device. In other words, in this embodiment, the first message is the first text message after the first voice message is converted.
In one embodiment, the server 110 analyzes the received first voice message to determine whether the first voice message indicates to be transmitted to a specific subscriber. For example, if the analysis result of the server 110 indicates that the first voice information indicates that it corresponds to a subscriber (e.g., dad) of the first identity information, the server 110 will transmit the first information to the first electronic device ED 1; if the analysis result of the server 110 indicates that the first voice information is a login user (e.g., mom) corresponding to the second identity information, the server 110 will transmit the first information to the second electronic device ED 2. The embodiment of the present invention is not limited to the specific implementation manner of analyzing the voice information and determining the identity information corresponding to the voice information, and those skilled in the art can make sufficient suggestions from the knowledge related to voice recognition to accomplish this.
In another embodiment, the server 110 transmits the first information to all electronic devices connected to the first speaker 121 according to all the received login information. When only the first electronic device ED1 logs in the server 110 with the first login information to establish a connection with the first speaker device 121, the server 110 will transmit the first information to the first electronic device ED1, and when the first electronic device ED1 and the second electronic device ED2 respectively log in the server 110 with the first login information and the second login information to establish a connection with the first speaker device 121, the server 110 can transmit the first information to the first electronic device ED1 and the second electronic device ED 2.
In particular, in some embodiments, the server 110 may also have a portion of the electronic device preset, for example, to be a delivery object of the first information. For example, the first speaker device 121 may issue a prompt message at a specific time or day (e.g., christmas) to prompt a user (e.g., a child) of the first speaker device 121 to make a wish, and the user's wish through the first speaker device 121 will only be transmitted to the first electronic device ED1 and the second electronic device ED2 corresponding to the first identity information (e.g., dad) and the second identity information (e.g., mom). At this time, even if another electronic device logs in the server 110 and is connected to the first speaker device 121, the user's desire is not transmitted to the electronic devices corresponding to other identification information except the first electronic device ED1 and the second electronic device ED 2.
It should be noted that the sending time of the prompt message may be set by the server 110, and the prompt message is sent to the first speaker 121 when the sending time is up. On the other hand, if an additional timer is provided in the first speaker device 121, the setting for sending the prompt message may be automatically completed in the first speaker device 121, and the present invention is not limited thereto.
The first voice message received by the first speaker 121 after continuing the prompt message is regarded as the wishing message corresponding to the prompt message. Therefore, after the first speaker device 121 transmits the first voice message to the server 110, the server 110 transmits the first voice message or the converted first text message to the first electronic device ED1 and the second electronic device ED 2. In other words, the server 110 can transmit the user's desire to the first electronic device ED1 and the second electronic device ED2 in the form of voice message or text message.
The embodiment shown in fig. 5 is used to describe a manner in which the speaker device receives the voice message and then transmits the voice message or text message to the electronic device. However, in addition to receiving voice information or text information, after the electronic device (e.g., the first electronic device ED1 or the second electronic device ED2) logs in the server 110 to connect to the speaker device (e.g., the first speaker device 121), the electronic device may also transmit or reply information to the speaker device by voice or text, and the speaker device may play the received information by voice.
Fig. 6 is a flowchart illustrating a communication method according to an embodiment of the invention. In one embodiment, the first electronic device ED1 replies to the first speaker device 121 after receiving the first message, for example, by means of a text message. In another embodiment, in addition to the first electronic device ED1, the second electronic device ED2 can also transmit information to the first speaker 121 in a similar manner, and the invention is not limited thereto.
Referring to fig. 6, in step S610, the server 110 receives the second text message from the at least one electronic device and converts the second text message into a second voice message.
In one embodiment, the user (e.g., dad) inputs the information (e.g., the second text information) to be transmitted to the first speaker device 121 by the first electronic device ED1 as text information, and transmits the input information to the server 110. After receiving the second text message, the server 110 converts the second text message into a second voice message by using text-to-speech (TTS) technology. Those skilled in the art will be able to fully understand the related technology of text-to-speech, and therefore, the detailed description thereof is omitted here.
In another embodiment, similarly, in addition to the first electronic device ED1, the user (e.g., mom) can also input the information (e.g., the third text information) to be transmitted to the first speaker 121 by the second electronic device ED2 in the form of text information, and then transmit the information to the server 110 after the input is completed. After receiving the third text message, the server 110 converts the third text message into a third voice message by using a text-to-voice technology.
Subsequently, in step S630, the server 110 transmits the second voice message to the speaker device corresponding to the first identifier ID1 according to the received login information.
In one embodiment, the source of the second text message is the first electronic device ED1, and the first electronic device ED1 logs in the server 110 with the first login information including the first identification ID1 to connect to the first speaker device 121. Therefore, the server 110 transmits the second voice message converted from the second text message to the first speaker device 121.
In another embodiment, the source of the third text message is the second electronic device ED2, and the second electronic device ED2 logs in the server 110 with the second login information including the first identification ID1 to connect to the first speaker device 121. Therefore, the server 110 also transmits the third voice message converted from the third text message to the first speaker device 121.
Finally, in step S650, the speaker device receiving the second voice message plays the received second voice message in response to the reception of the second voice message.
In one embodiment, after receiving the second voice message, the first speaker device 121 sends a notification in a voice manner to notify the user of the first speaker device 121 that the second voice message and the first identity information (e.g., dad) corresponding to the second voice message are received. Similarly, if the first speaker device 121 receives the third voice message, it sends a notification in a voice manner to notify the user of the first speaker device 121 that the third voice message and the second identity information (e.g., mom) corresponding to the third voice message are received. In another embodiment, the first speaker 121 may additionally be provided with a status light, and the status light is used to send a notification of receiving the second voice message or the third voice message. In other embodiments, the first speaker device 121 may also play the second voice message directly after receiving the second voice message without sending any notification.
In an embodiment, after the first speaker device 121 sends the notification corresponding to the second voice message, the user can select to play the second voice message by voice or by pressing a button through the input element MPH. In another embodiment, the first speaker device 121 can also issue a notification corresponding to the second voice message and the third voice message at the same time, and the user can select to play the second voice message or the third voice message by voice or pressing a button through the input element MPH.
In summary, the communication system and the communication method according to the embodiments of the present invention are implemented by using the identification mark on the speaker device, and the electronic device can establish a connection with the speaker device through the server. The server can convert the voice information from the speaker device into text information and transmit the text information to the electronic device, and can also convert the text information from the electronic device into voice information and transmit the voice information to the speaker device. Therefore, the user of the speaker device can face the speaker device and directly speak with the electronic device connected with the speaker device, and the electronic device is convenient to use and low in cost.
The above description is only for the preferred embodiment of the present invention, and it is not intended to limit the scope of the present invention, and any person skilled in the art can make further modifications and variations without departing from the spirit and scope of the present invention, therefore, the scope of the present invention should be determined by the claims of the present application.