Detailed description of the invention
In order to make the object, technical solutions and advantages of the present invention clearer, below in conjunction with accompanying drawing to this
Invention is described in further detail, it is clear that described embodiment is only that a part of the present invention is implemented
Example rather than whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art exist
Do not make all other embodiments obtained under creative work premise, broadly fall into present invention protection
Scope.
The proposed by the invention method and apparatus processing multimedia messages can be applied in following scene,
But it is not limited to following scene:
Real-time communication scene: such as, first self and surrounding collect real-time Transmission to
Second, second roams and and first interaction wherein;The most such as, first and second are all self and surrounding ring
Border collects real-time Transmission to the other side, include alternatively the physically residing environment of two people or
Arbitrarily roaming and interaction in third party's environment;
Remote observation and monitoring scene;
Operative scenario: such as, individual or many people telecommuting, the most immersively meeting,
Remotely immersively cooperate or help client to solve problem, or, immersively train;
Education scene: for example, it is possible to immersively virtual classroom and and teacher at virtual ring
Interaction in border;
Medical treatment scene: such as, tele-medicine and and doctor's interaction in virtual environment;
Business scenario: such as, teleshopping and and businessman interactive in virtual environment, comprehensive
Fitting mirror;
Sports Scene: such as, individual or many people and dash champion compete dash in virtual environment;
Entertainment Scene: such as, individual or game that many people is in Virtual Space, can be on the spot in person
Ground participating television is live or interactive with film appearances;
Personal lifestyle scene: such as, the record of four-dimensional diary and projection, remote visit museum,
Remotely accompany household or house pet, application of being remotely grown up;
Can also be used for being applied to following scene:
The scene that virtual reality or augmented reality content generate: include film, TV, game,
Video content makes;
Place, special time space is done the scene of four-dimensional historical record.
Figure 1A shows the schematic flow sheet of a kind of method presenting multimedia messages according to the present invention,
Detailed process is as follows:
Step 100: receive the four-dimensional spacetime model for characterizing presentation information, described four-dimensional spacetime mould
Type has the attribute that can characterize time dependent described presentation information in digitized form, described table
That image information includes observing with the naked eye and/or can with equipment collect for characterizing object
Electromagnetic field spectral information.
In the embodiment of the present invention, the electromagnetic field spectral information described in step 100 can be object
Launch, it is also possible to be object reflection, or can also be object refraction, do not do at this and have
Body limits.
In the embodiment of the present invention, the electromagnetic field spectral information described in step 100 can include nothing
Line wave information, infrared ray information, visible ray information, ultraviolet information, X-ray information, and
At least one in gamma ray information, wherein, it is seen that optical information can include laser.
In the embodiment of the present invention, object corresponding to presentation information can be indoor and/or outdoor arbitrary
Visual field size and the object of angle.
In the embodiment of the present invention, four-dimensional spacetime model the most at least includes as properties:
Locus attribute: can refer to that the every bit on any time object becomes the most in time at one
Coordinate under the coordinate system changed;
Appearance attribute: the texture of any time body surface and spectral signature can be referred to (such as,
Color), the geometrical property (such as, normal direction, curvature, smoothness etc.) of body surface;
Voice attribute;
Movement properties: the movement velocity vector of every bit on any time object, acceleration can be referred to
Degree vector;Or, it is also possible to refer to can to regard as on object every part of rigid body angular velocity vector,
Angular acceleration vector;
Other attribute: can refer to that the classification of object, identity, material, mutual relation etc. are all can be by
At least one in the information that presentation information and presentation information are inferred over time.
In form, four-dimensional spacetime model is to be present in storage medium with digitized data mode,
This form can be stored, and presents, retrieval, editor, transmission, encryption, and is used for more
Senior intelligent use.
Step 110: be decoded described four-dimensional spacetime model processing, obtain the four-dimensional spacetime of decoding
Model.
In the embodiment of the present invention, further, the four-dimensional spacetime model received in step 100 may
Process through overcompression, now, it is also possible to four-dimensional spacetime model is carried out decompression.
Further, in order to improve the safety of transmission, the four-dimensional spacetime model received may be through adding
Close process, now, to be also decrypted process by the four-dimensional spacetime model received.
Step 120: according to the four-dimensional spacetime model of described decoding, by described four-dimensional spacetime model table
The presentation information levied presents.
In the embodiment of the present invention, further, it is also possible to this one end of device of multimedia messages will be presented
Scene present, therefore, by described four-dimensional spacetime model characterize presentation information present
Before, also include operating as follows:
Described four-dimensional spacetime model is merged with described first space-time model, when obtaining the target four-dimension
Empty model, described first space-time model is for characterizing the table of the object presenting place residing for multimedia messages
Image information;
Now, the presentation information that described four-dimensional spacetime model characterizes is carried out in now, optionally,
Can be in the following way:
According to described target four-dimensional spacetime model, the presentation information that described four-dimensional spacetime model is characterized
The presentation information characterized with described first space-time model presents.
Such as, the scene that the presentation information of four-dimensional spacetime model sign is corresponding is scene by the sea, the
The scene corresponding to presentation information that one space-time model characterizes is the scene in desk office, at this
In the case of Zhong, the scene presented can be the scene of this fusion being seashore before desk.
Further, it is also possible to human body and object detection, follow the tracks of and identify: can be by actual physical district
Territory is superimposed to virtual region, and such as, observer wears the VR helmet in the face of a grassland, his place in reality
There is wall in room, and this time " based on object detection " just can be the information of the wall of actual physical
It is superimposed to the grassland of the VR helmet, presents the translucent walls in a grassland;The most such as, staff detects,
Can the most then virtual hands superposition be entered in four dimension module by the gestures detection of real hands, say, that
Some virtual scenes, therefore, the presentation letter characterized by described four-dimensional spacetime model can also be merged
Before breath presents, described method also includes operating as follows:
First space-time model, second space-time model of described four-dimensional spacetime model with local terminal are merged,
Obtaining target four-dimensional spacetime model, described first space-time model presents the dress of multimedia messages for characterizing
Putting the presentation information in residing place, described second space-time model is for characterizing the presentation information of dummy object;
Now, the presentation information that described four-dimensional spacetime model characterizes is carried out in now, alternatively,
Can be in the following way:
According to described target four-dimensional spacetime model, presentation information that described four-dimensional spacetime model is characterized,
Presentation information that described first space-time model is characterized and the presentation that described second space-time model is characterized
Information presents.
Such as, the scene that the presentation information of four-dimensional spacetime model sign is corresponding is scene by the sea, the
The scene corresponding to presentation information that one space-time model characterizes is the scene in desk office, at this
In the case of Zhong, the scene presented can be the scene of this fusion being seashore before desk, enters one
Step, want on the desk presented, put a basin flower, but reality is the freshest on desk
Colored, at this point it is possible to characterize flower with the second space-time model, and by described four-dimensional spacetime model and basis
First space-time model of end, the second space-time model merge, and obtain target four-dimensional spacetime model, this
The scene presented in the case of Zhong can be to be this that put flower on seashore, and desk before desk
Plant the scene merged.
In the embodiment of the present invention, the scene presented is possible not only to picture, it is also possible to have sound, therefore,
Described presentation information also includes sound field that is that can feel and/or that can collect with equipment with ear
Information, described four-dimensional spacetime model is additionally operable to characterize the sound field letter of the object corresponding with described presentation information
Breath;Now, described method also includes operating as follows:
The sound field information that described four-dimensional spacetime model characterizes is played out.
In the embodiment of the present invention, in order to improve the scene corresponding to the presentation information presented and real scene
Similarity, when presenting the presentation information that four-dimensional spacetime model characterizes, be with reference to presenting multimedia messages
The front orientation information of device, therefore, the presentation information that described four-dimensional spacetime model characterizes is entered
Before row presents, also include operating as follows:
Determine the front orientation information of the device presenting multimedia messages;
Now, the presentation information that described four-dimensional spacetime model characterizes is carried out in now, optionally,
Can be in the following way:
According to described front orientation information, the presentation information that described four-dimensional spacetime model characterizes is carried out in
Existing.
In the embodiment of the present invention, when determining the front orientation information of the device presenting multimedia messages, optional
, can be in the following way:
The inertial navigation being associated with the device presenting multimedia messages is carried out attitude algorithm, is presented
The front orientation information of the device of multimedia messages.
Wherein, inertial navigation can be in gyroscope, magnetometer, accelerometer any one or appoint
Meaning combination.
In the embodiment of the present invention, the precision of observer's part interested can be improved selectively,
Further, also include operating as follows:
Determine front orientation information and the destination multimedia information of the device presenting multimedia messages;
Feed back to described front orientation information and described destination multimedia information send four-dimensional spacetime model
Device.
Such as, the scene corresponding to presentation information having seabeach, personage, and sailing boat, presenting if held
The eyeball fixes of the user of the device of multimedia messages personage, then using personage as destination multimedia information.
So, the device of four-dimensional spacetime model is sent when obtaining presentation information, it is possible to the only presentation of personage
Information, can not include the presentation information of sailing boat.
In the embodiment of the present invention, when determining destination multimedia information, can be based on presenting multimedia messages
" eyeball " of the photographic head of device determines.
It should be noted that the first space-time model described in the embodiment of the present invention and the second space-time model
Can be that the device presenting multimedia messages pre-builds, it is also possible to be to set up in real time, or,
Can also be that other devices pre-build, or set up in real time and be sent to present multimedia messages
Device, it is not specifically limited at this.
In the embodiment of the present invention, in some scene, can only present what four-dimensional spacetime model characterized
Presentation information, such as, in telecommuting or telecommunication scene, only present the device of multimedia messages
It is intended to experience " the most long-range " scene that sent of device sending four-dimensional spacetime model, now,
Only present the presentation information that four-dimensional spacetime model characterizes.In some scene, presenting four
On the basis of the presentation information that dimension space-time model is characterized, it is also possible to present the first space-time mould further
The presentation information that type or the second space-time model are characterized, presents this one end of presentation information and also to add
Some virtual item, such as, the device presenting multimedia messages not only to experience transmission four-dimensional spacetime mould
The scene that the device of type is sent, also to add virtual item, such as, conveniently in this scenario
Wave and aloft draw a blank, or in order to play, add some virtual item and (e.g., send out on hand
" lightning " removes the stone hitting in scene together).
In the embodiment of the present invention, further, it is also possible to by the first markup information and/or the second mark
Information presents.
In the embodiment of the present invention, it is also possible to receive the four-dimensional spacetime model that multiple device sends respectively,
Such as, the presentation information pair of the first four-dimensional spacetime model sign that the first transmitting terminal sends is received
The scene answered is the Temple of Heaven, receives the second four-dimensional spacetime model sign of the second transmitting terminal transmission
Scene corresponding to presentation information is Eiffel Tower, in now can by the Temple of Heaven and Eiffel Tower also
Row presents.
The present invention gives the detailed process presenting four-dimensional spacetime model, refering to shown in Figure 1B, by the four-dimension
Space-time model, the first space-time model and the second space-time model merge, and obtain target four-dimensional spacetime model, really
Surely front orientation information and the destination multimedia information of the device of multimedia messages are presented, and according to just facing
The presentation information that four-dimensional spacetime model characterizes is presented to information and target four-dimensional spacetime model, and by front
Orientation information and destination multimedia information feed back to send the device of four-dimensional spacetime model.
In the embodiment of the present invention, disclose a kind of method presenting multimedia messages: receive and be used for characterizing
The four-dimensional spacetime model of presentation information, described four-dimensional spacetime model has and can characterize in digitized form
The attribute of time dependent described presentation information;It is decoded described four-dimensional spacetime model processing,
Obtain the four-dimensional spacetime model of decoding;Four-dimensional spacetime model according to described decoding, during by the described four-dimension
The presentation information that empty model characterizes presents, and in this scenario, four-dimensional spacetime model has can
Characterizing the attribute of time dependent presentation information in digitized form, so, this programme is in certain journey
Solve the presentation information presented on degree and there is the problem of time delay, therefore, solve existing to a certain extent
There is present in technology the defect with time delay.
Refering to shown in Fig. 2 A, the embodiment of the present invention also proposes one and presents multimedia messages device, bag
Include:
Receive unit 20, for receiving the four-dimensional spacetime model for characterizing presentation information, described four
Dimension space-time model has the genus that can characterize time dependent described presentation information in digitized form
Property, that described presentation information includes observing with the naked eye and/or can with equipment collect for
Characterize the electromagnetic field spectral information of object;
Four-dimensional spacetime models treated unit 21, for described four-dimensional spacetime model being decoded process,
Obtain the four-dimensional spacetime model of decoding;
Display unit 22, for the four-dimensional spacetime model according to described decoding, by described four-dimensional spacetime
The presentation information that model characterizes plays out.
In the embodiment of the present invention, further, the four-dimensional spacetime model that reception unit 20 receives may
Process through overcompression, now, it is also possible to four-dimensional spacetime model is carried out decompression.
Further, in order to improve the safety of transmission, receive the four-dimensional spacetime mould that unit 20 receives
Type may be through encryption, now, also the four-dimensional spacetime model received is decrypted process.
In the embodiment of the present invention, further, it is also possible to this one end of device of multimedia messages will be presented
Scene present, therefore, described device also includes Model Fusion unit 23, for by described
Four-dimensional spacetime model merges with described first space-time model, obtains target four-dimensional spacetime model, institute
State the first space-time model for characterizing the presentation information in place residing for the device presenting multimedia messages;
Now, the presentation information that described four-dimensional spacetime model characterizes is being entered by described display unit 22
Row is in now, optionally, it is also possible in the following way:
According to described target four-dimensional spacetime model, the presentation information that described four-dimensional spacetime model is characterized
The presentation information characterized with described first space-time model presents.
Such as, the scene that the presentation information of four-dimensional spacetime model sign is corresponding is scene by the sea, the
The scene corresponding to presentation information that one space-time model characterizes is the scene in desk office, at this
In the case of Zhong, the scene that display unit 22 presents can be to be this fusion on seashore before desk
Scene.
Further, it is also possible to human body and object detection, follow the tracks of and identify: can be by actual physical district
Territory is superimposed to virtual region, and such as, observer wears the VR helmet in the face of a grassland, his place in reality
There is wall in room, and this time " based on object detection " just can be the information of the wall of actual physical
It is superimposed to the grassland of the VR helmet, presents the translucent walls in a grassland;The most such as, staff detects,
Can the most then virtual hands superposition be entered in four dimension module by the gestures detection of real hands, say, that
Can also merge some virtual scenes, described device also includes Model Fusion unit 23, for by institute
State four-dimensional spacetime model and the first space-time model of the described device presenting multimedia messages, the second space-time
Model merges, and obtains target four-dimensional spacetime model, and described first space-time model presents for sign
The presentation information in place residing for the device of multimedia messages, described second space-time model is used for characterizing virtual
The presentation information of object;
Now, the presentation information that described four-dimensional spacetime model characterizes is carried out by described display unit 22
In now, optionally, can be in the following way:
According to described target four-dimensional spacetime model, presentation information that described four-dimensional spacetime model is characterized,
Presentation information that described first space-time model is characterized and the presentation that described second space-time model is characterized
Information presents.
Such as, the scene that the presentation information of four-dimensional spacetime model sign is corresponding is scene by the sea, the
The scene corresponding to presentation information that one space-time model characterizes is the scene in desk office, at this
In the case of Zhong, the scene that display unit 22 presents can be to be this fusion on seashore before desk
Scene, further, want on the desk presented, put a basin flower, but in reality handle official business
Flower is not had, at this point it is possible to characterize flower with the second space-time model, and by the described four-dimension on table
Space-time model merges with the first space-time model, second space-time model of local terminal, obtains target four-dimensional
Space-time model, the scene that display unit 22 presents in this case can be before desk to be seashore,
And on desk, put the scene of this fusion of flower.
In the embodiment of the present invention, the scene presented is possible not only to picture, it is also possible to have sound, therefore,
Described presentation information also includes sound field that is that can feel and/or that can collect with equipment with ear
Information, described four-dimensional spacetime model is additionally operable to characterize the sound field letter of the object corresponding with described presentation information
Breath;
Now, described device also includes broadcast unit 24, for by described four-dimensional spacetime model table
The sound field information levied plays out.
In the embodiment of the present invention, in order to improve the scene corresponding to the presentation information presented and real scene
Similarity, display unit 22 present four-dimensional spacetime model characterize presentation information time, with reference in
The front orientation information of the device of existing multimedia messages, therefore, further, described device also includes
Processing unit 25, for determining the front orientation information of the device presenting multimedia messages;
The presentation information that described four-dimensional spacetime model characterizes is being presented by described display unit 22
Time, optionally, can be in the following way:
According to described forward towards, by described four-dimensional spacetime model characterize presentation information present.
In the embodiment of the present invention, processing unit 25 determine present multimedia messages device just facing to
During information, optionally, can be in the following way:
The inertial navigation being associated with the device presenting multimedia messages is carried out attitude algorithm, is presented
The front orientation information of the device of multimedia messages.
Wherein, inertial navigation can be in gyroscope, magnetometer, accelerometer any one or appoint
Meaning combination.
In the embodiment of the present invention, the precision of observer's part interested can be improved selectively,
Further, described device also includes processing unit 25, determines and holds the device presenting multimedia messages
Front orientation information and destination multimedia information;
Described device also includes feedback unit 26, for by described front orientation information and the many matchmakers of described target
Body information feeds back to send the device of four-dimensional spacetime model.
Such as, the scene corresponding to presentation information having seabeach, personage, and sailing boat, presenting if held
The eyeball fixes of the user of the device of multimedia messages personage, then using personage as destination multimedia information.
So, the device of four-dimensional spacetime model is sent when obtaining presentation information, it is possible to the only presentation of personage
Information, can not include the presentation information of sailing boat.
In the embodiment of the present invention, when processing unit 25 determines destination multimedia information, can be based on presenting
" eyeball " of the photographic head of the device of multimedia messages determines.
It should be noted that the first space-time model described in the embodiment of the present invention and the second space-time model
Can be that the device presenting multimedia messages pre-builds, it is also possible to be to set up in real time, or,
Can also be that other devices pre-build, or set up in real time and be sent to present multimedia messages
Device, it is not specifically limited at this.
In the embodiment of the present invention, in some scene, display unit 22 can only present four-dimensional spacetime
The presentation information that model characterizes, such as, in telecommuting or telecommunication scene, presents multimedia
The device of information is just hoped to experience and is sent " the most long-range " that the device of four-dimensional spacetime model is sent
Scene, now, only presents the presentation information that four-dimensional spacetime model characterizes.In some scene
In, display unit 22, on the basis of presenting the presentation information that four-dimensional spacetime model characterizes, also may be used
With the presentation information presenting the first space-time model further or the second space-time model is characterized, present
Some virtual item also to be added in this one end of presentation information, such as, presents the device of multimedia messages not
Light to be experienced and to send the scene that the device of four-dimensional spacetime model is sent, and also to add in this scenario
Add virtual item, such as, wave conveniently and aloft draw a blank, or in order to play, add one
A little virtual item (e.g., are sent out " lightning " together on hand and are removed the stone hitting in scene).
In the embodiment of the present invention, reception unit 20 can also receive four that multiple device sends respectively
Dimension space-time model, such as, receives the first four-dimensional spacetime model sign that the first transmitting terminal sends
Scene corresponding to presentation information be the Temple of Heaven, receive the second four-dimensional spacetime that the second transmitting terminal sends
Scene corresponding to presentation information that model characterizes is Eiffel Tower, in now can by the Temple of Heaven and
Eiffel Tower presents side by side.
In the embodiment of the present invention, disclose a kind of device presenting multimedia messages: receive unit 20,
For receiving the four-dimensional spacetime model for characterizing presentation information, described four-dimensional spacetime model has can
Characterizing the attribute of time dependent described presentation information in digitized form, described presentation information includes
The electromagnetic field light for characterizing object that is that can observe with the naked eye and/or that can collect with equipment
Spectrum information;Four-dimensional spacetime models treated unit 21, for being decoded described four-dimensional spacetime model
Process, obtain the four-dimensional spacetime model of decoding;Display unit 22, for according to the four of described decoding
Dimension space-time model, plays out the presentation information that described four-dimensional spacetime model characterizes, in the program
In, four-dimensional spacetime model has the genus that can characterize time dependent presentation information in digitized form
Property, so, solving the presentation information presented to a certain extent exists the problem of time delay, therefore, solves
There is present in prior art of having determined the defect of time delay.
Refering to shown in Fig. 2 B, in the embodiment of the present invention, the four-dimensional spacetime model received can use
Following manner is set up:
Step 200: obtain presentation information, presentation information includes observing with the naked eye and/or energy
The electromagnetic field spectral information for characterizing object that enough equipment collects;
Step 210: set up for the four-dimensional spacetime characterizing presentation information according to the presentation information got
Model, four-dimensional spacetime model has can characterize time dependent presentation information in digitized form
Attribute;
Step 220: the four-dimensional spacetime model of foundation is carried out coded treatment, and by encoded process
After four-dimensional spacetime model be transmitted.
Electromagnetic field spectral information described in the embodiment of the present invention can be object emission, it is possible to
Be object reflection, or can also be object refraction, be not specifically limited at this.
Electromagnetic field spectral information described in the embodiment of the present invention can include radio wave information,
Infrared ray information, visible ray information, ultraviolet information, X-ray information, and gamma ray information
In at least one, wherein, it is seen that optical information can include laser.
In the embodiment of the present invention, object corresponding to presentation information can be indoor and/or outdoor arbitrary
Visual field size and the object of angle.
In the embodiment of the present invention, when obtaining presentation information, 24 frames that obtain per second are to 120 frames.
In the embodiment of the present invention, when the presentation information got can be different spaces point and/or difference
Between put the presentation information got.
In the embodiment of the present invention, four-dimensional spacetime model the most at least includes as properties:
Locus attribute: can refer to that the every bit on any time object becomes the most in time at one
Coordinate under the coordinate system changed;
Appearance attribute: the texture of any time body surface and spectral signature can be referred to (such as,
Color), the geometrical property (such as, normal direction, curvature, smoothness etc.) of body surface;
Voice attribute;
Movement properties: the movement velocity vector of every bit on any time object, acceleration can be referred to
Degree vector;Or, it is also possible to refer to can to regard as on object every part of rigid body angular velocity vector,
Angular acceleration vector;
Other attribute: can refer to that the classification of object, identity, material, mutual relation etc. are all can be by
At least one in the information that presentation information and presentation information are inferred over time.
In form, four-dimensional spacetime model is to be present in storage medium with digitized data mode,
This form can be stored, and presents, retrieval, editor, transmission, encryption, and is used for more
Senior intelligent use.
In the embodiment of the present invention, after setting up four-dimensional spacetime model, further, it is also possible to amendment,
Strengthen, optimize four-dimensional spacetime model.
In the embodiment of the present invention, set up for characterizing the four of presentation information according to the presentation information got
During dimension space-time model, it is alternatively possible in the following way:
Presentation information is processed, obtains the first markup information;
According to the first markup information and presentation information, obtain including cloud information, bag of geological information at first
Include the second point cloud information of texture information;
First cloud information and second point cloud information are merged, obtains impact point cloud information;
Visual information is obtained according to impact point cloud information;
Obtain spatial model according to visual information, will merge for spatial model the most in the same time;
Space module, the first markup information and the second markup information obtained according to fusion, when obtaining the four-dimension
Empty model.
In actual applications, presentation information is except that can include observing with the naked eye and/or can use
Equipment collect outside the electromagnetic field spectral information characterizing object, it is also possible to include that sound field is believed
Breath, now, before obtaining spatial model according to visual information, method also includes operating as follows:
Calculate the sound field information of the object corresponding with presentation information according to presentation information, presentation information is also wrapped
Include sound field information that is that can feel and/or that can collect with equipment with ear;
Now, when obtaining spatial model according to visual information, it is alternatively possible in the following way:
By visual information and sound field information fusion, obtain spatial model.
Sound field information described in the embodiment of the present invention refers not only to audio-frequency information itself, also has implicit
Sound source spatial positional information inside, can include collectable information of acoustic wave and/or ultrasound wave letter
Breath.
In the embodiment of the present invention, first cloud information and second point cloud information are merged, obtains mesh
After punctuate cloud information, before obtaining visual information, said method also includes operating as follows:
Impact point cloud information is processed, obtains the second markup information;
Now, according to impact point cloud information, when obtaining visual information, it is alternatively possible to use such as lower section
Formula:
According to the second markup information and impact point cloud information, obtain visual information.
In the embodiment of the present invention, according to the second markup information and impact point cloud information, obtain visual information
Time, it is alternatively possible in the following way:
Impact point cloud information is carried out geometry vertex position optimization and normal direction calculates, obtain the first result;
First result is carried out surface fitting and triangle gridding processes, obtain the second result;
According to the second result, obtain visual information.
In the embodiment of the present invention, presentation information is processed, when obtaining the first markup information, optional
Ground, can be in the following way:
Presentation information is carried out Digital Image Processing analysis, obtains the first markup information.
In the embodiment of the present invention, when presentation information is carried out Digital Image Processing analysis, it is alternatively possible to
In the following way:
Split presentation information, detect, follow the tracks of, identification etc. processes.
In the embodiment of the present invention, split, detect, follow the tracks of, identify there is no between these steps bright
True ordering relation, for example, it is possible to first split presentation information, then detects;Can also first detect,
Split again.In order to improve the accuracy of the first markup information obtained, execution can be circulated and divide several times
Cut, detect, follow the tracks of and identify.Such as, performed once to split, detected, followed the tracks of and identify it
After, according to current results, then perform at least one to take turns segmentation, detect, follow the tracks of and identify, so may be used
To improve precision.
In the embodiment of the present invention, segmentation can refer to divide the image into into prospect and background, such as, point
Be slit into sky, ground or other, detection can refer to detect pedestrian, detection car plate, follow the tracks of permissible
Referring to follow the tracks of the arm motion of people, identification can refer to identify vehicle.
In the embodiment of the present invention, according to the first markup information and presentation information, obtain including geological information
First cloud information time, it is alternatively possible in the following way:
According to the first markup information, presentation information is processed, obtain and the object corresponding to presentation information
Coordinate information;
According to coordinate information, generate first the cloud information including geological information.
In the embodiment of the present invention, the most in the same time may with the coordinate information of the object corresponding to presentation information
Corresponding to different coordinate systems, now, in order to improve the accuracy of first the cloud information obtained, obtain
After the coordinate information of the object corresponding to presentation information, it is also possible to different local the most in the same time are sat
The coordinate information of the object under mark system is fused under same coordinate system, then according to being fused to same seat
Coordinate information under mark system, generates first the cloud information including geological information.
In the embodiment of the present invention, according to the first markup information and presentation information, obtain including texture information
Second point cloud information time, it is alternatively possible in the following way:
Use pointwise and/or the mode of image synthesis, according to the first markup information, presentation information is carried out
Extract information processing, obtain including the second point cloud information of texture information.
In the embodiment of the present invention, according to the second markup information and impact point cloud information, when obtaining visual information,
It is alternatively possible in the following way:
According to the second markup information and impact point cloud information, calculate the surface normal information of object;
Visual information is obtained according to surface normal information.
The present invention gives the detailed process setting up four-dimensional spacetime model, refering to shown in Fig. 2 C, according to table
Image information obtains the first markup information and sound field information, and obtains according to presentation information and the first markup information
Some cloud information and second point cloud information, obtain target by first cloud information and second point cloud information fusion
Point cloud information, obtains the second markup information according to impact point cloud information, and carries out several to impact point cloud information
What vertex position optimization and normal direction are calculated the first result, and the first result is carried out surface fitting and triangle
Gridding processes, and obtains the second result, obtains visual information according to the second result and the second markup information,
By visual information and sound field information fusion, obtain spatial model, after being merged by spatial model
Spatial model, spatial model, the first markup information and the second markup information after merging process and obtain four
Dimension space-time model.
Refering to shown in Fig. 2 D, the embodiment of the present invention also proposes a kind of multimedia messages device that processes, bag
Include:
Acquiring unit 2100, is used for obtaining presentation information, and described presentation information includes with the naked eye seeing
The electromagnetic field spectral information for characterizing object that is that observe and/or that can collect with equipment;
Unit 2200 set up by model, for setting up for characterizing described according to the presentation information got
The four-dimensional spacetime model of presentation information, described four-dimensional spacetime model has and can characterize in digitized form
The attribute of time dependent described presentation information;
Processing unit 2300, for carrying out coded treatment by the four-dimensional spacetime model of foundation;
Transmitting element 2400, for being transmitted the four-dimensional spacetime model after encoded process.
In the embodiment of the present invention, the electromagnetic field spectral information acquired in acquiring unit 2100 can be thing
Body is launched, it is also possible to be object reflection, or can also be object refraction, does not does at this
Concrete restriction.
In the embodiment of the present invention, the electromagnetic field spectral information described by acquiring unit 2100 can include
Radio wave information, infrared ray information, visible ray information, ultraviolet information, X-ray information,
With at least one in gamma ray information, wherein, it is seen that optical information can include laser.
In the embodiment of the present invention, object corresponding to presentation information can be indoor and/or outdoor arbitrary
Visual field size and the object of angle.
In the embodiment of the present invention, when acquiring unit 2100 obtains presentation information, per second obtain 24 frames
To 120 frames.
In the embodiment of the present invention, the presentation information accessed by acquiring unit 2100 can be different empty
Between point and the presentation information that gets of different time points.
In the embodiment of the present invention, four-dimensional spacetime model the most at least includes as properties:
Locus attribute: can refer to that the every bit on any time object becomes the most in time at one
Coordinate under the coordinate system changed;
Appearance attribute: the texture of any time body surface and spectral signature can be referred to (such as,
Color), the geometrical property (such as, normal direction, curvature, smoothness etc.) of body surface;
Voice attribute;
Movement properties: the movement velocity vector of every bit on any time object, acceleration can be referred to
Vector;Or, it is also possible to refer on object, to regard the angular velocity vector of every part of rigid body, angle as
Acceleration;
Other attribute: can refer to that the classification of object, identity, material, mutual relation etc. are all can be by
At least one in the information that presentation information and presentation are inferred over time.
In form, four-dimensional spacetime model is to be present in storage medium with digitized data mode,
This form can be stored, and presents, retrieval, editor, transmission, encryption, and is used for more
Senior intelligent use.
In the embodiment of the present invention, model is set up after unit 2200 sets up four-dimensional spacetime model, enters one
Step, it is also possible to revise, strengthen, optimize four-dimensional spacetime model.
In actual applications, presentation information is except that can include observing with the naked eye and/or can use
Equipment collect outside the electromagnetic field spectral information characterizing object, it is also possible to include that sound field is believed
Breath, now, in the embodiment of the present invention, further, described device can also include sound field information meter
Calculate unit 2500, for calculating the object corresponding with described presentation information according to described presentation information
Sound field information is that described presentation information also includes can feeling with ear and/or can adopt with equipment
The sound field information that collection arrives;
Described model is set up unit 2200 and is set up for characterizing described presentation letter according to described presentation information
During the four-dimensional spacetime model ceased, particularly as follows:
According to described presentation information and described sound field information, set up and be used for characterizing described presentation information and institute
State the four-dimensional spacetime model of sound field information.
Sound field information described in the embodiment of the present invention refers not only to audio-frequency information itself, also has implicit
Sound source spatial positional information inside, can include collectable information of acoustic wave and/or ultrasound wave letter
Breath.
In the embodiment of the present invention, alternatively, described model is set up unit 2200 and is included the first markup information
Signal generating unit 2200A, some cloud information generating unit 2200B, some cloud information fusion unit 2200C, regard
Visual information signal generating unit 2200D, four-dimensional spacetime model generation unit 2200E, wherein:
Described first markup information signal generating unit 2200A, for described presentation information is processed,
To the first markup information;
Described some cloud information generating unit 2200B, for according to described first markup information and described presentation
Information, obtains including first cloud information of geological information, including the second point cloud information of texture information;
Described some cloud information fusion unit 2200C, for by described first cloud information and described second point
Cloud information merges, and obtains impact point cloud information;
Described visual information signal generating unit 2200D, for obtaining vision letter according to described impact point cloud information
Breath;
Described four-dimensional spacetime model generation unit 2200E, for obtaining spatial mode according to described visual information
Type, will merge for spatial model the most in the same time, the space module, described obtained according to fusion
First markup information and described second markup information, obtain described four-dimensional spacetime model.
In the embodiment of the present invention, further, described device also includes sound field information computing unit 2500,
For calculating the sound field information of object corresponding to described presentation information, described table according to described presentation information
Image information also includes sound field information that is that can feel and/or that can collect with equipment with ear;
When described four-dimensional spacetime model generation unit 2200E obtains spatial model according to described visual information,
It is alternatively possible in the following way:
Described visual information and described sound field information are merged, obtains described spatial model.
In the embodiment of the present invention, alternatively, described some cloud information generating unit 2200B is additionally operable to, to institute
State impact point cloud information to process, obtain the second markup information;
Described visual information signal generating unit 2200D is according to described impact point cloud information, when obtaining visual information,
It is alternatively possible in the following way:
According to described second markup information and described impact point cloud information, obtain described visual information.
In the embodiment of the present invention, further, described visual information signal generating unit 2200D is additionally operable to:
Described impact point cloud information is carried out geometry vertex position optimization and normal direction calculates, obtain the first result;
Described first result is carried out surface fitting and triangle gridding processes, obtain the second result;
According to described second result, obtain described visual information.
In the embodiment of the present invention, alternatively, described first markup information signal generating unit 2200A is to described
Presentation information processes, when obtaining the first markup information, it is alternatively possible in the following way:
Described presentation information is carried out Digital Image Processing analysis, obtains described first markup information.
In the embodiment of the present invention, described presentation information is entered by described first markup information signal generating unit 2200A
During row number image processing and analyzing, split presentation information, detect, follow the tracks of, identification etc. processes.
In the embodiment of the present invention, split, detect, follow the tracks of, identify there is no between these steps bright
True ordering relation, for example, it is possible to first split presentation information, then detects;Can also first detect,
Split again.In order to improve the accuracy of the first markup information obtained, execution can be circulated and divide several times
Cut, detect, follow the tracks of and identify.Such as, performed once to split, detected, followed the tracks of and identify it
After, according to current results, then perform at least one to take turns segmentation, detect, follow the tracks of and identify, so may be used
To improve precision.
In the embodiment of the present invention, segmentation can refer to divide the image into into prospect and background, such as, point
Be slit into sky, ground or other, detection can refer to detect pedestrian, detection car plate, follow the tracks of permissible
Referring to follow the tracks of the arm motion of people, identification can refer to identify vehicle.
In the embodiment of the present invention, described some cloud information generating unit 2200B is according to described first mark letter
Breath and described presentation information, when obtaining first the cloud information including geological information, it is alternatively possible to
In the following way:
According to described first markup information, described presentation information is processed, obtain and described presentation information
The coordinate information of corresponding object;
According to described coordinate information, generate first the cloud information including described geological information.
In the embodiment of the present invention, the most in the same time may with the coordinate information of the object corresponding to presentation information
Corresponding to different coordinate systems, now, in order to improve the accuracy of first the cloud information obtained, obtain
After the coordinate information of the object corresponding to presentation information, some cloud information generating unit 2200B is all right
The coordinate information of the object under different local coordinate systems the most in the same time is fused under same coordinate system,
Then according to being fused to the coordinate information under same coordinate system, first cloud including geological information is generated
Information.
In the embodiment of the present invention, alternatively, described some cloud information generating unit 2200B is according to described
One markup information and described presentation information are when obtaining the second point cloud information including texture information, permissible
In the following way:
Use pointwise and/or the mode of image synthesis, according to described first markup information, to described presentation
Information carries out extracting information processing, obtains including the second point cloud information of texture information.
In the embodiment of the present invention, described visual information signal generating unit 2200D is according to described second markup information
With described impact point cloud information, when obtaining visual information, can be in the following way:
According to described second markup information and described impact point cloud information, calculate body surface normal direction information;
Visual information is obtained according to described surface normal information.
In the embodiment of the present invention, further, the four-dimensional spacetime model of foundation is entered by processing unit 2300
After row coded treatment, to be also compressed the four-dimensional spacetime model after carrying out coded treatment processing,
Four-dimensional spacetime model after compression is processed by transmitting element 2400 is transmitted.
Further, in order to improve the safety of transmission, after transmitting element 2400 sends coded treatment
Four-dimensional spacetime model before, the four-dimensional spacetime model after coded treatment can be entered by processing unit 2300
Row encryption, or, before sending the four-dimensional spacetime model after compression processes, can be by compression
Four-dimensional spacetime model after reason is encrypted.
In the embodiment of the present invention, acquiring unit 2100 can be include column type, cuboid, prismatic,
Ring-like, spherical, and any one shape in hemispherical, including at least one photographic head, photographic head can
Think color photographic head, depth camera or infrared camera.
Further, acquiring unit 2100 can also include at least one mike, such as Fig. 2 E, 2F institute
Showing, wherein, Fig. 2 G is Fig. 2 E or the top view of Fig. 2 F, and Fig. 2 H is Fig. 2 E or Fig. 2 F
Side view.
Alternatively, acquiring unit 2100 includes 8 pairs of color photographic head and 8 mikes, wherein: top
Portion has 1 pair of color photographic head, visual angle to be respectively 180 degree;Side has 6 pairs of color photographic head, the visual angles to be respectively
70 degree;Bottom has 1 pair of color photographic head, visual angle to be respectively 180 degree;1 is had in the middle of every 1 pair of photographic head
Individual mike.
Alternatively, acquiring unit 2100 can also be following form:
1 or 1 pair of color photographic head are arranged at top, and visual angle is 45~180 degree;Side have 2 or
Person 8 is to color photographic head, and visual angle is respectively 45~180 degree;1 or 1 pair of color shooting are arranged at bottom
Head, visual angle is 45~180 degree;There is 1 mike, or, have 1 in the middle of every 1 pair of photographic head
Individual mike, alternatively, the quantity of mike is between 1~8.
In the embodiment of the present invention, alternatively, the photographic head at top can also be three-dimensional camera, multifocal
One in photographic head, structure light video camera head, flight time (ToF) photographic head, light field photographic head group
Or combination in any.
In the embodiment of the present invention, alternatively, the photographic head of side can be that three-dimensional camera, many focal lengths are taken the photograph
As the one in head, structure light video camera head, flight time (ToF) photographic head, light field photographic head group or
Combination in any.
Such as, acquiring unit 2100 is column type, has six pairs of binocular camera at cylindrical side surface,
Each camera view is 70 degree;It is respectively arranged with a pair binocular camera at cylinder end face and bottom surface, each
Camera view is 180 degree, it is possible to achieve the visual field of stereoscopic full views covers, and all photographic head are all
Calibrate through prior and obtained parameter matrix.Acquiring unit 2100 can be with built-in eight mikes.
In the embodiment of the present invention, color photographic head can by optical lens, sensor devices (Image Sensor),
ISP (Image Signal Processing Unit, image signal processing chip) forms.
VPU (Vision Processing Unit, vision processor) can include that unit 2200 set up by model
With processing unit 2300, wherein, photographic head can be by MIPI (Mobile Industry Processor
Interface, mobile Industry Processor Interface) it is connected to VPU chip, it is right that VPU chip processes two
The data that photographic head is transmitted through, are four VPU chips inside such a cylinder.
In the embodiment of the present invention, model set up unit 2200 can include processor, video card, internal memory,
Video memory, flash memory, hard disk, be wirelessly transferred, wire transmission and multiple Bus Interface Chip.
Shown below is the scene that the embodiment of the present invention is suitable for.
Scene shown in Fig. 3 A is: first is in the first scene, and second is in the second scene, and first and second are passed through
The method that present example provides makes the surrounding of first and first " remotely present " face in second in real time
Before, second can carry out interaction with first.
Further, four-dimensional spacetime model can also be first stored in storage and sets by device for processing muti-medium information
In Bei, the device that can receive and process four-dimensional spacetime model that second is held can obtain from storage device
Take four-dimensional spacetime model, as shown in Figure 3 B, in this case, scene that second institute " sees " and figure
The scene seen in the case of shown in 3A can be the same.
When four-dimensional spacetime model is stored in storage device by device for processing muti-medium information, first can also
Hold the device that can receive and process four-dimensional spacetime model, from storage device, obtain four-dimensional spacetime model,
Perceive oneself in the first scene residing for one time point of past, as shown in Figure 3 C.
Scene shown in Fig. 3 D is: first is in the first scene, and second is in the second scene, and first and second are passed through
Present example one makes the surrounding of first and first " remotely present " in face of second in real time, and second can
To carry out interaction with first;First and second realize two-way " remotely reality " in real time by present example one and " mix
Close reality ", what first was perceived is the first scene and the superposition of second at first place, and what second was perceived is
First and first scene at first place;It should be noted that first and second are all right for the scene of wanted perception
Having multiple choices, both sides can select the first scene seeing first place can also select to see second place
Second scene, or see the 3rd scene at other side place;First and second it can be seen that same true or
Person's virtual scene is it can also be seen that different true or virtual scene.
Scene shown in Fig. 3 E is: first realizes telecommuting by embodiment provided by the present invention.
Scene shown in Fig. 3 F is: first and second can realize sense by embodiment provided by the present invention
To virtual environment, further, it is also possible to realize interaction, as " on the spot in person ".
Method and apparatus is not solid with any certain computer, virtual system or miscellaneous equipment provided herein
Have relevant.Various general-purpose systems can also be used together with based on example in this.As described above,
Construct the structure required by this kind of device to be apparent from.Additionally, the present invention is also not for any specific
Programming language.It is understood that, it is possible to use various programming languages realize the content of invention described herein,
And the description done language-specific above is the preferred forms in order to disclose the present invention.
In description mentioned herein, illustrate a large amount of detail.It is to be appreciated, however, that this
Inventive embodiment can be put into practice in the case of not having these details.In some instances, not
It is shown specifically known method, structure and technology, in order to do not obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help understand in each inventive aspect one
Or multiple, above in the description of the exemplary embodiment of the present invention, each feature of the present invention is sometimes
It is grouped together in single embodiment, figure or descriptions thereof.But, should be by the disclosure
Method be construed to reflect an intention that i.e. the present invention for required protection require ratio in each claim
The middle more feature of feature be expressly recited.More precisely, as the following claims reflect,
Inventive aspect is all features less than single embodiment disclosed above.Therefore, it then follows be embodied as
Claims of mode are thus expressly incorporated in this detailed description of the invention, the most each claim itself
All as the independent embodiment of the present invention.
Those skilled in the art are appreciated that and can carry out the module in the device in embodiment certainly
Change adaptively and they are arranged in one or more devices different from this embodiment.Permissible
Some block combiner in embodiment are become a module or unit or assembly, and in addition can be them
It is divided into multiple submodule or subelement or sub-component.Except in such feature and/or process or module
Outside at least some excludes each other, can use any combination that (this specification is included adjoint right
Require, summary and accompanying drawing) disclosed in all features and so disclosed any method or equipment
All processes or unit are combined.Unless expressly stated otherwise, this specification (includes adjoint right
Require, summary and accompanying drawing) disclosed in each feature can by provide identical, equivalent or similar purpose replace
Replace for feature.
Although additionally, it will be appreciated by those of skill in the art that embodiments more described herein include it
Some feature included in its embodiment rather than further feature, but the group of the feature of different embodiment
Close and mean to be within the scope of the present invention and formed different embodiments.Such as, in claim
In book, one of arbitrarily can mode using in any combination of embodiment required for protection.
Each device embodiment of the present invention can realize with hardware, or to process at one or more
The software module run on device realizes, or realizes with combinations thereof.Those skilled in the art should
Understand, microprocessor or digital signal processor (DSP) can be used in practice to realize basis
The some or all functions of the some or all modules in the device of the embodiment of the present invention.The present invention is also
Can be implemented as part or all the device program for performing method as described herein (such as,
Computer program and computer program).The program of such present invention of realization can be stored in calculating
On machine computer-readable recording medium, or can be to have the form of one or more signal.Such signal can be from
Download on internet website and obtain, or provide on carrier signal, or provide with any other form.
The present invention will be described rather than limits the invention to it should be noted above-described embodiment,
And those skilled in the art can design replacement without departing from the scope of the appended claims
Embodiment.In the claims, any reference marks that should not will be located between bracket is configured to right
The restriction required.Word " comprises " and does not excludes the presence of the element or step not arranged in the claims.It is positioned at
Word "a" or "an" before element does not excludes the presence of multiple such element.The present invention can be by
In including the hardware of some different elements and realizing by means of properly programmed computer.Enumerating
If in the unit claim of equipment for drying, several in these devices can be by same hardware
Item specifically embodies.Word first, second and third use do not indicate that any order.Can be by
These word explanations are title.