CN112182170A - Remote interaction system - Google Patents

Remote interaction system Download PDF

Info

Publication number
CN112182170A
CN112182170A CN202010946916.1A CN202010946916A CN112182170A CN 112182170 A CN112182170 A CN 112182170A CN 202010946916 A CN202010946916 A CN 202010946916A CN 112182170 A CN112182170 A CN 112182170A
Authority
CN
China
Prior art keywords
voice
information
image information
contact person
standard
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010946916.1A
Other languages
Chinese (zh)
Inventor
李伟科
黄永深
黄锐豪
邓辅秦
林淮荣
冯华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuyi University
Original Assignee
Wuyi University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuyi University filed Critical Wuyi University
Priority to CN202010946916.1A priority Critical patent/CN112182170A/en
Publication of CN112182170A publication Critical patent/CN112182170A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3343Query execution using phonetics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/142Constructional details of the terminal equipment, e.g. arrangements of the camera and the display

Abstract

The invention discloses a remote interaction system, which comprises: the system comprises a plurality of called clients, a voice module for acquiring voice information, a cloud server for storing a plurality of contact person data sets, a three-dimensional projection module and a calling client; the contact data set comprises standard contact information and standard image information; the cloud server is used for receiving the voice information and extracting a corresponding contact person data set according to the voice information to obtain standard contact person information and standard image information; the three-dimensional projection module is used for receiving the standard image information, and converting and projecting the standard image information to obtain a three-dimensional image; the calling client is used for receiving the standard contact information and establishing a first communication link with the corresponding called client according to the standard contact information so as to enable the calling client and the called client to transmit communication signals mutually. The invention can realize three-dimensional remote communication, improves the sense of reality of communication between people and improves the use experience of users.

Description

Remote interaction system
Technical Field
The invention relates to the field of information interaction, in particular to a remote interaction system.
Background
At present, remote communication between people is generally realized through telephone, video or voice, but the communication mode lacks sense of reality, especially for solitary old people, the communication mode cannot meet children for a long time, and the solitary sense of the old people is difficult to eliminate by a simple communication mode.
Disclosure of Invention
The present invention is directed to solving at least one of the problems of the prior art. Therefore, the invention provides a remote interaction system which can realize three-dimensional remote communication, improve the sense of reality of communication between people and improve the use experience of users.
A remote interactive system according to an embodiment of the present invention includes: the system comprises a plurality of called clients, a voice module, a cloud server, a three-dimensional projection module and a calling client; the cloud server is internally stored with a plurality of contact person data sets, the contact person data sets comprise standard contact person information and standard image information, and each standard contact person information corresponds to one called client; the voice module is used for acquiring voice information; the cloud server is used for receiving the voice information and extracting the corresponding contact person data set according to the voice information to obtain the standard contact person information and the standard image information; the three-dimensional projection module is used for receiving the standard image information, and converting and projecting the standard image information to obtain a three-dimensional image; the calling client is used for receiving the standard contact information and establishing a first communication link with the corresponding called client according to the standard contact information so as to enable the calling client and the called client to mutually transmit communication signals through the first communication link.
The remote interaction system provided by the embodiment of the invention at least has the following beneficial effects: the voice module is convenient for acquiring voice information of a user, and the cloud server extracts corresponding standard contact information and standard image information through the voice information, so that the information extraction speed is greatly improved. The calling client side and the called client side can transmit communication signals mutually through the first communication link so as to realize remote communication between the user and the client, and meanwhile, the three-dimensional projection module can project three-dimensional images of the client, so that the sense of reality of communication between the user and the client is increased, and the use experience of the user is improved.
According to some embodiments of the present invention, the called client is further configured to establish a second communication link with the cloud server, and upload initial image information to the corresponding contact data set through the second communication link.
According to some embodiments of the present invention, the initial image information is provided with a contact tag, and the cloud server acquires the contact data set corresponding to the contact tag by identifying the contact tag, so that the initial image information is stored in the corresponding contact data set.
According to some embodiments of the present invention, the cloud server is further configured to perform normalization processing on the initial image information to obtain standard image information.
According to some embodiments of the present invention, the calling client is further configured to establish a third communication link with the cloud server, and upload initial image information to the corresponding contact data set through the third communication link.
According to some embodiments of the invention, the speech module comprises: the system comprises a voice acquisition unit, a voice processing unit and a voice recognition unit; the voice acquisition unit is used for acquiring initial voice, and the voice processing unit is used for carrying out conversion processing on the initial voice and transmitting the obtained converted voice to the voice recognition unit for recognition so as to obtain the voice information.
According to some embodiments of the invention, the system further comprises a camera module for acquiring real-time dynamic state of the user; the camera module transmits the real-time dynamic state to the cloud server for storage, and the called client acquires the real-time dynamic state of the cloud server through the second communication link.
Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.
Drawings
The above and/or additional aspects and advantages of the present invention will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
FIG. 1 is a schematic diagram of a remote interactive system according to an embodiment of the present invention.
Detailed Description
The embodiments of the present invention will be further explained with reference to the drawings.
As shown in fig. 1, a remote interactive system according to an embodiment of the present invention includes: the system comprises a plurality of called clients, a voice module, a cloud server, a three-dimensional projection module and a calling client; the cloud server stores a plurality of contact person data sets, the contact person data sets comprise standard contact person information and standard image information, and each standard contact person information corresponds to a called client; the voice module is used for acquiring voice information; the cloud server is used for receiving the voice information and extracting a corresponding contact person data set according to the voice information to obtain standard contact person information and standard image information; the three-dimensional projection module is used for receiving the standard image information, and converting and projecting the standard image information to obtain a three-dimensional image; the calling client is used for receiving the standard contact information and establishing a first communication link with the corresponding called client according to the standard contact information so as to enable the calling client and the called client to mutually transmit communication signals through the first communication link.
For example, as shown in fig. 1, the user of the calling client is defined as a subscriber, and the user of the called client is defined as a client. The voice module is convenient for acquiring the voice information of the user, and the cloud server extracts the corresponding standard contact information and the standard image information through the voice information, so that the information extraction speed is greatly improved. Wherein the contact data set may be named and distinguished by the name of the customer, e.g. named Xiaoming; the standard contact information may be the telephone or IP number of the customer, etc., for example, a few telephone numbers; the standard image information is an image of the customer, for example, several images of small and bright.
For example, the voice information content received by the voice module is Xiaoming, and the cloud server searches a plurality of contact person data sets to obtain the contact person data sets belonging to Xiaoming, so as to obtain a plurality of telephone numbers and a plurality of images belonging to Xiaoming.
The calling client can transmit communication signals with the called client through the first communication link, so as to realize remote communication between the user and the client, wherein the remote communication mode can be telephone communication, voice communication and the like.
Meanwhile, the three-dimensional projection module can convert standard image information into a three-dimensional image through a holographic three-dimensional stereo projection technology and project the three-dimensional image, so that the sense of reality of communication between a user and a client is increased, and the use experience of the user is improved.
The holographic three-dimensional stereo projection technology is a 3D imaging technology for recording and reproducing object light waves by utilizing the principles of laser interference and diffraction, and aims to record all information of the object light waves and present a 3D virtual image in space through diffraction and refraction of light. Therefore, the holographic three-dimensional stereo projection technology can convert two-dimensional standard image information to have three-dimensional parameters, and project the three-dimensional parameters into space to realize the stereo projection of three-dimensional images.
In some embodiments of the present invention, the called client is further configured to establish a second communication link with the cloud server, and upload the initial image information to the corresponding contact data set through the second communication link.
Specifically, the called client can upload initial image information to the corresponding contact data set through the second communication link, so that the number of standard image information is increased, the user can obtain more types of three-dimensional images, and the experience of the user is improved.
In some embodiments of the present invention, the initial image information has a contact tag, and the cloud server obtains a contact data set corresponding to the contact tag by identifying the contact tag, so that the initial image information is stored in the corresponding contact data set.
Specifically, when the client uploads the initial image information, a contact tag is added to the initial image information, and the contact tag corresponds to the contact data set, so that the cloud server can conveniently store the initial image information into the corresponding contact data set by identifying the contact tag and searching the contact data set corresponding to the contact tag, and the accuracy of storing the initial image information is ensured.
In some embodiments of the present invention, the cloud server is further configured to perform normalization processing on the initial image information to obtain standard image information.
Specifically, the normalization process includes: the normalization processing of the pixels, the normalization processing of the specification and the like enable the parameters of the initial image information and the standard image information to be consistent, and eliminate the difference between the images, so that the speed and the accuracy of the three-dimensional projection module for converting and projecting the standard image information are improved.
In some embodiments of the present invention, the calling client is further configured to establish a third communication link with the cloud server, and upload the initial image information to the corresponding contact data set through the third communication link.
Specifically, the calling client can upload the required initial image information to the corresponding contact data set through the third communication link, so that the number of standard image information is increased, the user can obtain more types of three-dimensional images, and the experience of the user is improved.
In some embodiments of the invention, the speech module comprises: the system comprises a voice acquisition unit, a voice processing unit and a voice recognition unit; the voice acquisition unit is used for acquiring initial voice, the voice processing unit is used for carrying out conversion processing on the initial voice, and transmitting the obtained converted voice to the voice recognition unit for recognition to obtain voice information.
Specifically, the voice collecting unit can collect initial voice of a user, and since the initial voice has many kinds, such as dialect, mandarin, cantonese, etc., the voice processing unit performs conversion processing on the initial voice through natural language processing technology, so that the initial voice is converted into a language easy to recognize.
The natural language processing technology is the field of interaction between computer science, artificial intelligence, linguistics and human natural language, and aims to make a computer understand and accept instructions input by human in the natural language and complete the translation function from one language to another language. Moreover, the language actually converted by the natural language processing technology can be set according to the requirement, for example, converting the dialect into the mandarin; in addition, the specific conversion method of the natural language processing technique is also not limited as long as conversion can be achieved.
Contain the speech database in the speech recognition unit, the speech recognition unit carries out the analysis to the conversion pronunciation through automatic speech recognition technology, obtains the speech parameter, then compares speech parameter and the data in the speech database, obtains the text that is closest with the conversion pronunciation, moves the speech recognition technology promptly and can convert the conversion pronunciation into the text, is favorable to the discernment of high in the clouds server to the text, has improved the speed and the rate of accuracy of high in the clouds server discernment.
In some embodiments of the present invention, the system further includes a camera module for acquiring real-time dynamics of the user; the camera module transmits the real-time dynamic state to the cloud server for storage, and the called client acquires the real-time dynamic state of the cloud server through the second communication link.
Specifically, the camera module can record the real-time dynamic state of the user and upload the recorded real-time dynamic state to the cloud server for storage; and the called client can acquire the real-time dynamic state of the cloud server through the second communication link, so that the client can conveniently know the real-time dynamic state of the user, and the information interaction between the client and the user is enhanced. Wherein, the real-time dynamic can be set as a real-time video or a real-time image, etc.
Furthermore, this embodiment is particularly useful for calling client's user to be the condition of old man, because son and daughter can't accompany the side of old man constantly, consequently son and daughter can obtain the real-time developments of high in the clouds server through the client of being called to realize constantly paying close attention to old man's state, avoid old man to take place unexpected condition.
Other constructions and operations of the remote interactive system according to the embodiments of the present invention are known to those skilled in the art and will not be described in detail herein.
The remote interactive system according to the embodiment of the present invention is described in detail in a specific embodiment with reference to fig. 1, it is to be understood that the following description is only exemplary and not a specific limitation of the invention.
As shown in fig. 1, a remote interactive system includes: the system comprises a plurality of called clients, a voice acquisition unit, a voice processing unit, a voice recognition unit, a cloud server, a three-dimensional projection module, a calling client and a camera module. The cloud server is internally stored with a plurality of contact person data sets, the contact person data sets comprise standard contact person information and standard image information, and each standard contact person information corresponds to one called client.
The voice acquisition unit is used for acquiring initial voice, the voice processing unit is used for carrying out conversion processing on the initial voice, and transmitting the obtained converted voice to the voice recognition unit for recognition to obtain voice information. The cloud server is used for receiving the voice information and extracting a corresponding contact person data set according to the voice information to obtain standard contact person information and standard image information. The three-dimensional projection module is used for receiving the standard image information, and converting and projecting the standard image information to obtain a three-dimensional image; the calling client is used for receiving the standard contact information and establishing a first communication link with the corresponding called client according to the standard contact information so as to enable the calling client and the called client to mutually transmit communication signals through the first communication link.
The called client is also used for establishing a second communication link with the cloud server and uploading the initial image information with the contact person label to the cloud server through the second communication link; the calling client is also used for establishing a third communication link with the cloud server and uploading initial image information with the contact person label to the cloud server through the third communication link; the cloud server identifies the contact person tags, obtains contact person data sets corresponding to the contact person tags, normalizes the initial image information and stores the normalized initial image information into the corresponding contact person data sets.
The camera module is used for acquiring the real-time dynamic state of the user, transmitting the real-time dynamic state to the cloud server for storage, and the called client acquires the real-time dynamic state of the cloud server through the second communication link.
According to the remote interaction system provided by the embodiment of the invention, at least some effects can be achieved, the voice acquisition unit, the voice processing unit and the voice recognition unit are used for acquiring the dialect of the user, converting and recognizing the dialect to enable the dialect to be processed into the text, so that the text content can be conveniently recognized by the cloud server, the contact person data set corresponding to the text content can be obtained through searching, the standard contact person information and the standard image information can be further obtained, and the speed and the accuracy of information extraction are greatly improved.
The calling client side and the called client side can transmit communication signals mutually through the first communication link so as to realize remote communication between the user and the client, and meanwhile, the three-dimensional projection module can project three-dimensional images of the client, so that the sense of reality of communication between the user and the client is increased, and the use experience of the user is improved. In addition, the calling client and the called client can upload initial image information to the corresponding contact data set, so that the number of standard image information is increased, a user can obtain more types of three-dimensional images, and the experience of the user is improved.
The embodiment is particularly suitable for the condition that the user of the calling client is the old, and is convenient for children to acquire the real-time dynamic state of the cloud server through the called client, so that the state of paying attention to the old at any time is realized, and the condition that the old is in an accident is avoided.
In the description herein, references to the description of "one embodiment," "some embodiments," or "the embodiment" or the like are intended to mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
While embodiments of the invention have been shown and described, it will be understood by those of ordinary skill in the art that: various changes, modifications, substitutions and alterations can be made to the embodiments without departing from the principles and spirit of the invention, the scope of which is defined by the claims and their equivalents.

Claims (7)

1. A remote interactive system, comprising:
a plurality of called clients;
the voice module is used for acquiring voice information;
the cloud server stores a plurality of contact person data sets, the contact person data sets comprise standard contact person information and standard image information, and each standard contact person information corresponds to one called client; the cloud server is used for receiving the voice information and extracting the corresponding contact person data set according to the voice information to obtain the standard contact person information and the standard image information;
the three-dimensional projection module is used for receiving the standard image information, and converting and projecting the standard image information to obtain a three-dimensional image;
and the calling client is used for receiving the standard contact information and establishing a first communication link with the corresponding called client according to the standard contact information so as to enable the calling client and the called client to mutually transmit communication signals through the first communication link.
2. A remote interactive system as claimed in claim 1, characterized in that: and the called client is also used for establishing a second communication link with the cloud server and uploading initial image information to the corresponding contact data set through the second communication link.
3. A remote interactive system as claimed in claim 2, characterized in that: the initial image information is provided with a contact person tag, and the cloud server identifies the contact person tag to acquire the contact person data set corresponding to the contact person tag so that the initial image information is stored in the corresponding contact person data set.
4. A remote interactive system as claimed in claim 2, characterized in that: and the cloud server is also used for carrying out normalization processing on the initial image information to obtain standard image information.
5. A remote interactive system as claimed in claim 1, characterized in that: the calling client is further used for establishing a third communication link with the cloud server and uploading initial image information to the corresponding contact person data set through the third communication link.
6. A remote interactive system as claimed in claim 1, characterized in that: the voice module includes: the system comprises a voice acquisition unit, a voice processing unit and a voice recognition unit; the voice acquisition unit is used for acquiring initial voice, and the voice processing unit is used for carrying out conversion processing on the initial voice and transmitting the obtained converted voice to the voice recognition unit for recognition so as to obtain the voice information.
7. A remote interactive system as claimed in claim 2, characterized in that: the system also comprises a camera module for acquiring the real-time dynamic state of the user; the camera module transmits the real-time dynamic state to the cloud server for storage, and the called client acquires the real-time dynamic state of the cloud server through the second communication link.
CN202010946916.1A 2020-09-10 2020-09-10 Remote interaction system Pending CN112182170A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010946916.1A CN112182170A (en) 2020-09-10 2020-09-10 Remote interaction system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010946916.1A CN112182170A (en) 2020-09-10 2020-09-10 Remote interaction system

Publications (1)

Publication Number Publication Date
CN112182170A true CN112182170A (en) 2021-01-05

Family

ID=73921692

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010946916.1A Pending CN112182170A (en) 2020-09-10 2020-09-10 Remote interaction system

Country Status (1)

Country Link
CN (1) CN112182170A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102413306A (en) * 2011-11-21 2012-04-11 康佳集团股份有限公司 3D television-based three-dimensional video call method and 3D television
CN107463248A (en) * 2017-06-20 2017-12-12 昆明理工大学 A kind of remote interaction method caught based on dynamic with line holographic projections
CN108122555A (en) * 2017-12-18 2018-06-05 北京百度网讯科技有限公司 The means of communication, speech recognition apparatus and terminal device
CN110012257A (en) * 2019-02-21 2019-07-12 百度在线网络技术(北京)有限公司 Call method, device and terminal

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102413306A (en) * 2011-11-21 2012-04-11 康佳集团股份有限公司 3D television-based three-dimensional video call method and 3D television
CN107463248A (en) * 2017-06-20 2017-12-12 昆明理工大学 A kind of remote interaction method caught based on dynamic with line holographic projections
CN108122555A (en) * 2017-12-18 2018-06-05 北京百度网讯科技有限公司 The means of communication, speech recognition apparatus and terminal device
CN110012257A (en) * 2019-02-21 2019-07-12 百度在线网络技术(北京)有限公司 Call method, device and terminal

Similar Documents

Publication Publication Date Title
JP7062851B2 (en) Voiceprint creation / registration method and equipment
US11138903B2 (en) Method, apparatus, device and system for sign language translation
US20190188903A1 (en) Method and apparatus for providing virtual companion to a user
KR101887637B1 (en) Robot system
KR102276951B1 (en) Output method for artificial intelligence speakers based on emotional values calculated from voice and face
US20240070397A1 (en) Human-computer interaction method, apparatus and system, electronic device and computer medium
CN110853646A (en) Method, device and equipment for distinguishing conference speaking roles and readable storage medium
CN111586469B (en) Bullet screen display method and device and electronic equipment
CN112016367A (en) Emotion recognition system and method and electronic equipment
CN113392270A (en) Video processing method, video processing device, computer equipment and storage medium
CN111564157A (en) Conference record optimization method, device, equipment and storage medium
CN111046148A (en) Intelligent interaction system and intelligent customer service robot
CN113392687A (en) Video title generation method and device, computer equipment and storage medium
US11580971B2 (en) Photo album management method, storage medium and electronic device
EP2503545A1 (en) Arrangement and method relating to audio recognition
CN114419527B (en) Data processing method, equipment and computer readable storage medium
CN114268747A (en) Interview service processing method based on virtual digital people and related device
CN116562270A (en) Natural language processing system supporting multi-mode input and method thereof
CN115687664A (en) Chinese image-text retrieval method and data processing method for Chinese image-text retrieval
Artemov et al. Designing Soft-Hardware Complex for Gesture Language Recognition using Neural Network Methods
JP6843409B1 (en) Learning method, content playback device, and content playback system
CN112182170A (en) Remote interaction system
CN113709364B (en) Camera identifying equipment and object identifying method
CN109740510B (en) Method and apparatus for outputting information
CN115171673A (en) Role portrait based communication auxiliary method and device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination