CN112182170A - Remote interaction system - Google Patents
Remote interaction system Download PDFInfo
- Publication number
- CN112182170A CN112182170A CN202010946916.1A CN202010946916A CN112182170A CN 112182170 A CN112182170 A CN 112182170A CN 202010946916 A CN202010946916 A CN 202010946916A CN 112182170 A CN112182170 A CN 112182170A
- Authority
- CN
- China
- Prior art keywords
- voice
- information
- image information
- contact person
- standard
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation or dialogue systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3343—Query execution using phonetics
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/478—Supplemental services, e.g. displaying phone caller identification, shopping application
- H04N21/4788—Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/142—Constructional details of the terminal equipment, e.g. arrangements of the camera and the display
Abstract
The invention discloses a remote interaction system, which comprises: the system comprises a plurality of called clients, a voice module for acquiring voice information, a cloud server for storing a plurality of contact person data sets, a three-dimensional projection module and a calling client; the contact data set comprises standard contact information and standard image information; the cloud server is used for receiving the voice information and extracting a corresponding contact person data set according to the voice information to obtain standard contact person information and standard image information; the three-dimensional projection module is used for receiving the standard image information, and converting and projecting the standard image information to obtain a three-dimensional image; the calling client is used for receiving the standard contact information and establishing a first communication link with the corresponding called client according to the standard contact information so as to enable the calling client and the called client to transmit communication signals mutually. The invention can realize three-dimensional remote communication, improves the sense of reality of communication between people and improves the use experience of users.
Description
Technical Field
The invention relates to the field of information interaction, in particular to a remote interaction system.
Background
At present, remote communication between people is generally realized through telephone, video or voice, but the communication mode lacks sense of reality, especially for solitary old people, the communication mode cannot meet children for a long time, and the solitary sense of the old people is difficult to eliminate by a simple communication mode.
Disclosure of Invention
The present invention is directed to solving at least one of the problems of the prior art. Therefore, the invention provides a remote interaction system which can realize three-dimensional remote communication, improve the sense of reality of communication between people and improve the use experience of users.
A remote interactive system according to an embodiment of the present invention includes: the system comprises a plurality of called clients, a voice module, a cloud server, a three-dimensional projection module and a calling client; the cloud server is internally stored with a plurality of contact person data sets, the contact person data sets comprise standard contact person information and standard image information, and each standard contact person information corresponds to one called client; the voice module is used for acquiring voice information; the cloud server is used for receiving the voice information and extracting the corresponding contact person data set according to the voice information to obtain the standard contact person information and the standard image information; the three-dimensional projection module is used for receiving the standard image information, and converting and projecting the standard image information to obtain a three-dimensional image; the calling client is used for receiving the standard contact information and establishing a first communication link with the corresponding called client according to the standard contact information so as to enable the calling client and the called client to mutually transmit communication signals through the first communication link.
The remote interaction system provided by the embodiment of the invention at least has the following beneficial effects: the voice module is convenient for acquiring voice information of a user, and the cloud server extracts corresponding standard contact information and standard image information through the voice information, so that the information extraction speed is greatly improved. The calling client side and the called client side can transmit communication signals mutually through the first communication link so as to realize remote communication between the user and the client, and meanwhile, the three-dimensional projection module can project three-dimensional images of the client, so that the sense of reality of communication between the user and the client is increased, and the use experience of the user is improved.
According to some embodiments of the present invention, the called client is further configured to establish a second communication link with the cloud server, and upload initial image information to the corresponding contact data set through the second communication link.
According to some embodiments of the present invention, the initial image information is provided with a contact tag, and the cloud server acquires the contact data set corresponding to the contact tag by identifying the contact tag, so that the initial image information is stored in the corresponding contact data set.
According to some embodiments of the present invention, the cloud server is further configured to perform normalization processing on the initial image information to obtain standard image information.
According to some embodiments of the present invention, the calling client is further configured to establish a third communication link with the cloud server, and upload initial image information to the corresponding contact data set through the third communication link.
According to some embodiments of the invention, the speech module comprises: the system comprises a voice acquisition unit, a voice processing unit and a voice recognition unit; the voice acquisition unit is used for acquiring initial voice, and the voice processing unit is used for carrying out conversion processing on the initial voice and transmitting the obtained converted voice to the voice recognition unit for recognition so as to obtain the voice information.
According to some embodiments of the invention, the system further comprises a camera module for acquiring real-time dynamic state of the user; the camera module transmits the real-time dynamic state to the cloud server for storage, and the called client acquires the real-time dynamic state of the cloud server through the second communication link.
Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.
Drawings
The above and/or additional aspects and advantages of the present invention will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
FIG. 1 is a schematic diagram of a remote interactive system according to an embodiment of the present invention.
Detailed Description
The embodiments of the present invention will be further explained with reference to the drawings.
As shown in fig. 1, a remote interactive system according to an embodiment of the present invention includes: the system comprises a plurality of called clients, a voice module, a cloud server, a three-dimensional projection module and a calling client; the cloud server stores a plurality of contact person data sets, the contact person data sets comprise standard contact person information and standard image information, and each standard contact person information corresponds to a called client; the voice module is used for acquiring voice information; the cloud server is used for receiving the voice information and extracting a corresponding contact person data set according to the voice information to obtain standard contact person information and standard image information; the three-dimensional projection module is used for receiving the standard image information, and converting and projecting the standard image information to obtain a three-dimensional image; the calling client is used for receiving the standard contact information and establishing a first communication link with the corresponding called client according to the standard contact information so as to enable the calling client and the called client to mutually transmit communication signals through the first communication link.
For example, as shown in fig. 1, the user of the calling client is defined as a subscriber, and the user of the called client is defined as a client. The voice module is convenient for acquiring the voice information of the user, and the cloud server extracts the corresponding standard contact information and the standard image information through the voice information, so that the information extraction speed is greatly improved. Wherein the contact data set may be named and distinguished by the name of the customer, e.g. named Xiaoming; the standard contact information may be the telephone or IP number of the customer, etc., for example, a few telephone numbers; the standard image information is an image of the customer, for example, several images of small and bright.
For example, the voice information content received by the voice module is Xiaoming, and the cloud server searches a plurality of contact person data sets to obtain the contact person data sets belonging to Xiaoming, so as to obtain a plurality of telephone numbers and a plurality of images belonging to Xiaoming.
The calling client can transmit communication signals with the called client through the first communication link, so as to realize remote communication between the user and the client, wherein the remote communication mode can be telephone communication, voice communication and the like.
Meanwhile, the three-dimensional projection module can convert standard image information into a three-dimensional image through a holographic three-dimensional stereo projection technology and project the three-dimensional image, so that the sense of reality of communication between a user and a client is increased, and the use experience of the user is improved.
The holographic three-dimensional stereo projection technology is a 3D imaging technology for recording and reproducing object light waves by utilizing the principles of laser interference and diffraction, and aims to record all information of the object light waves and present a 3D virtual image in space through diffraction and refraction of light. Therefore, the holographic three-dimensional stereo projection technology can convert two-dimensional standard image information to have three-dimensional parameters, and project the three-dimensional parameters into space to realize the stereo projection of three-dimensional images.
In some embodiments of the present invention, the called client is further configured to establish a second communication link with the cloud server, and upload the initial image information to the corresponding contact data set through the second communication link.
Specifically, the called client can upload initial image information to the corresponding contact data set through the second communication link, so that the number of standard image information is increased, the user can obtain more types of three-dimensional images, and the experience of the user is improved.
In some embodiments of the present invention, the initial image information has a contact tag, and the cloud server obtains a contact data set corresponding to the contact tag by identifying the contact tag, so that the initial image information is stored in the corresponding contact data set.
Specifically, when the client uploads the initial image information, a contact tag is added to the initial image information, and the contact tag corresponds to the contact data set, so that the cloud server can conveniently store the initial image information into the corresponding contact data set by identifying the contact tag and searching the contact data set corresponding to the contact tag, and the accuracy of storing the initial image information is ensured.
In some embodiments of the present invention, the cloud server is further configured to perform normalization processing on the initial image information to obtain standard image information.
Specifically, the normalization process includes: the normalization processing of the pixels, the normalization processing of the specification and the like enable the parameters of the initial image information and the standard image information to be consistent, and eliminate the difference between the images, so that the speed and the accuracy of the three-dimensional projection module for converting and projecting the standard image information are improved.
In some embodiments of the present invention, the calling client is further configured to establish a third communication link with the cloud server, and upload the initial image information to the corresponding contact data set through the third communication link.
Specifically, the calling client can upload the required initial image information to the corresponding contact data set through the third communication link, so that the number of standard image information is increased, the user can obtain more types of three-dimensional images, and the experience of the user is improved.
In some embodiments of the invention, the speech module comprises: the system comprises a voice acquisition unit, a voice processing unit and a voice recognition unit; the voice acquisition unit is used for acquiring initial voice, the voice processing unit is used for carrying out conversion processing on the initial voice, and transmitting the obtained converted voice to the voice recognition unit for recognition to obtain voice information.
Specifically, the voice collecting unit can collect initial voice of a user, and since the initial voice has many kinds, such as dialect, mandarin, cantonese, etc., the voice processing unit performs conversion processing on the initial voice through natural language processing technology, so that the initial voice is converted into a language easy to recognize.
The natural language processing technology is the field of interaction between computer science, artificial intelligence, linguistics and human natural language, and aims to make a computer understand and accept instructions input by human in the natural language and complete the translation function from one language to another language. Moreover, the language actually converted by the natural language processing technology can be set according to the requirement, for example, converting the dialect into the mandarin; in addition, the specific conversion method of the natural language processing technique is also not limited as long as conversion can be achieved.
Contain the speech database in the speech recognition unit, the speech recognition unit carries out the analysis to the conversion pronunciation through automatic speech recognition technology, obtains the speech parameter, then compares speech parameter and the data in the speech database, obtains the text that is closest with the conversion pronunciation, moves the speech recognition technology promptly and can convert the conversion pronunciation into the text, is favorable to the discernment of high in the clouds server to the text, has improved the speed and the rate of accuracy of high in the clouds server discernment.
In some embodiments of the present invention, the system further includes a camera module for acquiring real-time dynamics of the user; the camera module transmits the real-time dynamic state to the cloud server for storage, and the called client acquires the real-time dynamic state of the cloud server through the second communication link.
Specifically, the camera module can record the real-time dynamic state of the user and upload the recorded real-time dynamic state to the cloud server for storage; and the called client can acquire the real-time dynamic state of the cloud server through the second communication link, so that the client can conveniently know the real-time dynamic state of the user, and the information interaction between the client and the user is enhanced. Wherein, the real-time dynamic can be set as a real-time video or a real-time image, etc.
Furthermore, this embodiment is particularly useful for calling client's user to be the condition of old man, because son and daughter can't accompany the side of old man constantly, consequently son and daughter can obtain the real-time developments of high in the clouds server through the client of being called to realize constantly paying close attention to old man's state, avoid old man to take place unexpected condition.
Other constructions and operations of the remote interactive system according to the embodiments of the present invention are known to those skilled in the art and will not be described in detail herein.
The remote interactive system according to the embodiment of the present invention is described in detail in a specific embodiment with reference to fig. 1, it is to be understood that the following description is only exemplary and not a specific limitation of the invention.
As shown in fig. 1, a remote interactive system includes: the system comprises a plurality of called clients, a voice acquisition unit, a voice processing unit, a voice recognition unit, a cloud server, a three-dimensional projection module, a calling client and a camera module. The cloud server is internally stored with a plurality of contact person data sets, the contact person data sets comprise standard contact person information and standard image information, and each standard contact person information corresponds to one called client.
The voice acquisition unit is used for acquiring initial voice, the voice processing unit is used for carrying out conversion processing on the initial voice, and transmitting the obtained converted voice to the voice recognition unit for recognition to obtain voice information. The cloud server is used for receiving the voice information and extracting a corresponding contact person data set according to the voice information to obtain standard contact person information and standard image information. The three-dimensional projection module is used for receiving the standard image information, and converting and projecting the standard image information to obtain a three-dimensional image; the calling client is used for receiving the standard contact information and establishing a first communication link with the corresponding called client according to the standard contact information so as to enable the calling client and the called client to mutually transmit communication signals through the first communication link.
The called client is also used for establishing a second communication link with the cloud server and uploading the initial image information with the contact person label to the cloud server through the second communication link; the calling client is also used for establishing a third communication link with the cloud server and uploading initial image information with the contact person label to the cloud server through the third communication link; the cloud server identifies the contact person tags, obtains contact person data sets corresponding to the contact person tags, normalizes the initial image information and stores the normalized initial image information into the corresponding contact person data sets.
The camera module is used for acquiring the real-time dynamic state of the user, transmitting the real-time dynamic state to the cloud server for storage, and the called client acquires the real-time dynamic state of the cloud server through the second communication link.
According to the remote interaction system provided by the embodiment of the invention, at least some effects can be achieved, the voice acquisition unit, the voice processing unit and the voice recognition unit are used for acquiring the dialect of the user, converting and recognizing the dialect to enable the dialect to be processed into the text, so that the text content can be conveniently recognized by the cloud server, the contact person data set corresponding to the text content can be obtained through searching, the standard contact person information and the standard image information can be further obtained, and the speed and the accuracy of information extraction are greatly improved.
The calling client side and the called client side can transmit communication signals mutually through the first communication link so as to realize remote communication between the user and the client, and meanwhile, the three-dimensional projection module can project three-dimensional images of the client, so that the sense of reality of communication between the user and the client is increased, and the use experience of the user is improved. In addition, the calling client and the called client can upload initial image information to the corresponding contact data set, so that the number of standard image information is increased, a user can obtain more types of three-dimensional images, and the experience of the user is improved.
The embodiment is particularly suitable for the condition that the user of the calling client is the old, and is convenient for children to acquire the real-time dynamic state of the cloud server through the called client, so that the state of paying attention to the old at any time is realized, and the condition that the old is in an accident is avoided.
In the description herein, references to the description of "one embodiment," "some embodiments," or "the embodiment" or the like are intended to mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
While embodiments of the invention have been shown and described, it will be understood by those of ordinary skill in the art that: various changes, modifications, substitutions and alterations can be made to the embodiments without departing from the principles and spirit of the invention, the scope of which is defined by the claims and their equivalents.
Claims (7)
1. A remote interactive system, comprising:
a plurality of called clients;
the voice module is used for acquiring voice information;
the cloud server stores a plurality of contact person data sets, the contact person data sets comprise standard contact person information and standard image information, and each standard contact person information corresponds to one called client; the cloud server is used for receiving the voice information and extracting the corresponding contact person data set according to the voice information to obtain the standard contact person information and the standard image information;
the three-dimensional projection module is used for receiving the standard image information, and converting and projecting the standard image information to obtain a three-dimensional image;
and the calling client is used for receiving the standard contact information and establishing a first communication link with the corresponding called client according to the standard contact information so as to enable the calling client and the called client to mutually transmit communication signals through the first communication link.
2. A remote interactive system as claimed in claim 1, characterized in that: and the called client is also used for establishing a second communication link with the cloud server and uploading initial image information to the corresponding contact data set through the second communication link.
3. A remote interactive system as claimed in claim 2, characterized in that: the initial image information is provided with a contact person tag, and the cloud server identifies the contact person tag to acquire the contact person data set corresponding to the contact person tag so that the initial image information is stored in the corresponding contact person data set.
4. A remote interactive system as claimed in claim 2, characterized in that: and the cloud server is also used for carrying out normalization processing on the initial image information to obtain standard image information.
5. A remote interactive system as claimed in claim 1, characterized in that: the calling client is further used for establishing a third communication link with the cloud server and uploading initial image information to the corresponding contact person data set through the third communication link.
6. A remote interactive system as claimed in claim 1, characterized in that: the voice module includes: the system comprises a voice acquisition unit, a voice processing unit and a voice recognition unit; the voice acquisition unit is used for acquiring initial voice, and the voice processing unit is used for carrying out conversion processing on the initial voice and transmitting the obtained converted voice to the voice recognition unit for recognition so as to obtain the voice information.
7. A remote interactive system as claimed in claim 2, characterized in that: the system also comprises a camera module for acquiring the real-time dynamic state of the user; the camera module transmits the real-time dynamic state to the cloud server for storage, and the called client acquires the real-time dynamic state of the cloud server through the second communication link.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010946916.1A CN112182170A (en) | 2020-09-10 | 2020-09-10 | Remote interaction system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010946916.1A CN112182170A (en) | 2020-09-10 | 2020-09-10 | Remote interaction system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112182170A true CN112182170A (en) | 2021-01-05 |
Family
ID=73921692
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010946916.1A Pending CN112182170A (en) | 2020-09-10 | 2020-09-10 | Remote interaction system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112182170A (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102413306A (en) * | 2011-11-21 | 2012-04-11 | 康佳集团股份有限公司 | 3D television-based three-dimensional video call method and 3D television |
CN107463248A (en) * | 2017-06-20 | 2017-12-12 | 昆明理工大学 | A kind of remote interaction method caught based on dynamic with line holographic projections |
CN108122555A (en) * | 2017-12-18 | 2018-06-05 | 北京百度网讯科技有限公司 | The means of communication, speech recognition apparatus and terminal device |
CN110012257A (en) * | 2019-02-21 | 2019-07-12 | 百度在线网络技术(北京)有限公司 | Call method, device and terminal |
-
2020
- 2020-09-10 CN CN202010946916.1A patent/CN112182170A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102413306A (en) * | 2011-11-21 | 2012-04-11 | 康佳集团股份有限公司 | 3D television-based three-dimensional video call method and 3D television |
CN107463248A (en) * | 2017-06-20 | 2017-12-12 | 昆明理工大学 | A kind of remote interaction method caught based on dynamic with line holographic projections |
CN108122555A (en) * | 2017-12-18 | 2018-06-05 | 北京百度网讯科技有限公司 | The means of communication, speech recognition apparatus and terminal device |
CN110012257A (en) * | 2019-02-21 | 2019-07-12 | 百度在线网络技术(北京)有限公司 | Call method, device and terminal |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7062851B2 (en) | Voiceprint creation / registration method and equipment | |
US11138903B2 (en) | Method, apparatus, device and system for sign language translation | |
US20190188903A1 (en) | Method and apparatus for providing virtual companion to a user | |
KR101887637B1 (en) | Robot system | |
KR102276951B1 (en) | Output method for artificial intelligence speakers based on emotional values calculated from voice and face | |
US20240070397A1 (en) | Human-computer interaction method, apparatus and system, electronic device and computer medium | |
CN110853646A (en) | Method, device and equipment for distinguishing conference speaking roles and readable storage medium | |
CN111586469B (en) | Bullet screen display method and device and electronic equipment | |
CN112016367A (en) | Emotion recognition system and method and electronic equipment | |
CN113392270A (en) | Video processing method, video processing device, computer equipment and storage medium | |
CN111564157A (en) | Conference record optimization method, device, equipment and storage medium | |
CN111046148A (en) | Intelligent interaction system and intelligent customer service robot | |
CN113392687A (en) | Video title generation method and device, computer equipment and storage medium | |
US11580971B2 (en) | Photo album management method, storage medium and electronic device | |
EP2503545A1 (en) | Arrangement and method relating to audio recognition | |
CN114419527B (en) | Data processing method, equipment and computer readable storage medium | |
CN114268747A (en) | Interview service processing method based on virtual digital people and related device | |
CN116562270A (en) | Natural language processing system supporting multi-mode input and method thereof | |
CN115687664A (en) | Chinese image-text retrieval method and data processing method for Chinese image-text retrieval | |
Artemov et al. | Designing Soft-Hardware Complex for Gesture Language Recognition using Neural Network Methods | |
JP6843409B1 (en) | Learning method, content playback device, and content playback system | |
CN112182170A (en) | Remote interaction system | |
CN113709364B (en) | Camera identifying equipment and object identifying method | |
CN109740510B (en) | Method and apparatus for outputting information | |
CN115171673A (en) | Role portrait based communication auxiliary method and device and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |