CN102572217A

CN102572217A - Visual-attention-based multimedia processing method and device

Info

Publication number: CN102572217A
Application number: CN2011104538310A
Authority: CN
Inventors: 王荣泽
Original assignee: Huawei Technologies Co Ltd
Current assignee: Huang Zhenqiang
Priority date: 2011-12-29
Filing date: 2011-12-29
Publication date: 2012-07-11
Anticipated expiration: 2031-12-29
Also published as: CN102572217B

Abstract

The invention discloses a visual-attention-based multimedia processing method and a visual-attention-based multimedia processing device, and relates to the technical field of multimedia processing. Control over multimedia display is finished by confirming an eye catcher of a user under the condition of no influence on the use feeling of the user. The method comprises the following steps of: detecting an eye catcher position corresponding to a watcher in a display screen; acquiring a sight correlation area corresponding to the eye catcher position; and performing video enhancement processing on a video image corresponding to the sight correlation area. The embodiment of the invention is mainly applied to the multimedia processing.

Description

Multi-media processing method and device based on visual attention location

Technical field

The present invention relates to the multimedia processing technology field, relate in particular to a kind of multi-media processing method and device based on visual attention location.

Background technology

Along with the user is increasingly high to the requirement of audio frequency and video experience sense, the mode that audio frequency and video are handled more and more relies on user's intention.At present, the processing mode of audio frequency and video is specially the artificial processing scheme of setting, and through background program with audio-video document according to handled scheme handled, the audio-video document after will handling then shows.Audio-video document is handled the intention that needs the perfect processing scheme of setting just can meet the user through this processing mode.

Summary of the invention

Embodiments of the invention provide a kind of multi-media processing method and device based on visual attention location, have realized under the situation that does not influence user's use experience, accomplish the control to multimedia display through the sight line focus of confirming the user.

For achieving the above object, embodiments of the invention adopt following technical scheme:

A kind of multi-media processing method based on visual attention location comprises:

Detect the corresponding sight line focal position of beholder in the display screen;

According to said sight line focal position, obtain the sight line associated region corresponding with said sight line focal position;

The video image corresponding to said sight line associated region carries out the video enhancement process.

A kind of multimedia processing apparatus based on visual attention location comprises:

Detecting unit is used to detect the corresponding sight line focal position of beholder in the display screen;

Acquiring unit is used for according to said sight line focal position, obtains the sight line associated region corresponding with said sight line focal position;

Adjustment unit is used for the corresponding video image of said sight line associated region is carried out the video enhancement process.Multi-media processing method and device that the embodiment of the invention provides based on visual attention location; Through obtaining beholder's visual focus position; And confirm the zone that the beholder is watching according to the sight line associated region that the visual focus position obtains the beholder; Directly said sight line associated region is adjusted to satisfy sense of experience of users then, realized under the situation that does not influence user's use experience, accomplish control multimedia display through the sight line focus of confirming the user.

Description of drawings

In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art; To do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art below; Obviously, the accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills; Under the prerequisite of not paying creative work, can also obtain other accompanying drawing according to these accompanying drawings.

Fig. 1 is the flow chart of a kind of multi-media processing method based on visual attention location in the embodiment of the invention 1;

Fig. 2 is the flow chart of a kind of multi-media processing method based on visual attention location in the embodiment of the invention 2;

Fig. 3 is the composition frame chart of a kind of multimedia processing apparatus based on visual attention location in the embodiment of the invention 3;

Fig. 4 is that another kind in the embodiment of the invention 3 is based on the composition frame chart of the multimedia processing apparatus of visual attention location;

Fig. 5 is that another kind in the embodiment of the invention 3 is based on the composition frame chart of the multimedia processing apparatus of visual attention location;

Fig. 6 is that another kind in the embodiment of the invention 3 is based on the composition frame chart of the multimedia processing apparatus of visual attention location;

Be that another kind in the embodiment of the invention 3 is based on the composition frame chart of the multimedia processing apparatus of visual attention location during Fig. 7.

Embodiment

To combine the accompanying drawing in the embodiment of the invention below, the technical scheme in the embodiment of the invention is carried out clear, intactly description, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills are not making the every other embodiment that is obtained under the creative work prerequisite, all belong to the scope of the present invention's protection.

Embodiment 1

The embodiment of the invention provides a kind of multi-media processing method based on visual attention location, and is as shown in Figure 1, and this method comprises:

101, detect the corresponding sight line focal position of beholder in the display screen.

Wherein, detect the corresponding sight line focal position of beholder and can pass through pupil-corneal reflection vector method, implementation is following:

Shine people face with an infrared secondary light source, form reflection image at the eyes anterior corneal surface, this reflection image is called as pul (Purkinje) spot by the emperor himself.Human eye is looked the screen diverse location staring at, and corresponding rotation can take place eyeball, supposes under beholder's motionless situation; Since the fixed-site of infrared light emitting diode, and eyeball is an approximate spheroid, so when eyeball moves; Can think that the admire absolute position of spot of pul is constant; And corresponding variation will take place in the position of iris and pupil, and the admire relative position relation of spot and pupil and iris of pul also changes like this, the confirming and can realize through image processing of this relative position relation; Relative position relation by them can draw the direction of sight line then, and obtains the sight line focal position thus.

Implementation method based on the corresponding sight line focal position of above-mentioned detection beholder; The implementation method of the corresponding sight line focal position of beholder specifically comprises and uses above-mentioned pupil-corneal reflection vector method that the corresponding sight line focal position of said a plurality of beholders is detected in the said detection display screen, and obtains all and be in the sight line focal position in the display screen.

The beholder also can realize through alternate manner corresponding sight line focal position in the said detection display screen; The embodiment of the invention does not limit this; The concrete implementation method of said alternate manner is for well known to a person skilled in the art technology, and the present invention repeats no more to this.

102,, obtain the sight line associated region corresponding with said sight line focal position according to said sight line focal position.

Wherein, said according to said sight line focal position, obtain the sight line associated region corresponding with said sight line focal position can but be not limited in the following manner and realize.Be specially:

Obtain the sight line central area according to said sight line focal position, said sight line central area is for being the zone at center with the sight line focal position; Zone association relation according to said sight line central area and setting in advance generates the sight line associated region.

Wherein, the size of said sight line central area is to be provided with in advance, and specifically can be set to the sight line focal position is the center; 1/9 of whole screen width; 1/5 of height, the user also can be provided with according to actual needs voluntarily, and the embodiment of the invention does not limit this.

Wherein, The said zone association relation that is provided with in advance is any relation in the following relation, and this relation belongs to same paragraph for the interior literal of said sight line associated region and identical or close, the said sight line associated region of said sight line central area image pixel and close, the said sight line associated region of said sight line central area picture material and identical or close, the said sight line associated region of said sight line central area picture shape and the literal in the said sight line central area.The user can choose one or more zone association relations according to actual needs, and the embodiment of the invention does not limit this.

103, the corresponding video image of sight line associated region is carried out the video enhancement process.

As for example, the corresponding video image of sight line associated region is carried out the video enhancement process can realize through following two kinds of methods, specifically comprise:

First method: the image information in the said sight line associated region is carried out image enhancement processing.

Wherein, Saidly the corresponding video image of said sight line associated region is carried out image enhancement processing specifically comprise needs are presented at the processing such as sharpening that video content on the display screen is directed to the image in the said sight line associated region, make that this video content can be more clear after showing through display screen.

Second method: the video information in the said sight line associated region is carried out the coding and decoding video enhancement process.

Wherein, saidly video information in the said sight line associated region carried out the coding and decoding video enhancement process specifically comprise:

In the video coding end, when video file is encoded, when being encoded, distributes the image in the sight line associated region more yardage and computational resource, when being encoded, distributes the image in the non-sight line associated region less yardage and computational resource.

In the video decode end, when video file is decoded, the video file after encoding is decoded in conjunction with the bilateral filtering technology.

Be appreciated that; The corresponding video image of sight line associated region is carried out the video enhancement process also can have condition of different to different application scenes; For example: in realizing associated region, not only exist video image also to have the alphabetic character zone, can pass through OCR (Optical Character Recognition, optical character identification) technology; Literal is extracted; And the image that extracts behind the literal carried out the video enhancement process, and then that the image after the enhancement process is superimposed with the literal that identifies, the corresponding video image of this sight line associated region of reconstruct.In addition, also have other implementation, those of ordinary skills other the implementation that can expect also within the protection range of the embodiment of the invention.

In addition; Need to prove, after the corresponding video image of sight line associated region is carried out the video enhancement process, in order to improve user's use experience; Can repeated execution of steps 101 to step 103, so that the audio frequency and video that the user is paid close attention to show that adjustment reaches optimum.

The multi-media processing method that the embodiment of the invention provides based on visual attention location; Through obtaining beholder's visual focus position; And confirm the zone that the beholder is watching according to the sight line associated region that the visual focus position obtains the beholder; Directly said sight line associated region is adjusted to satisfy sense of experience of users then, realized under the situation that does not influence user's use experience, accomplish control multimedia display through the sight line focus of confirming the user.

Embodiment 2

A kind of multi-media processing method based on visual attention location is provided in the embodiment of the invention, and as shown in Figure 2, this method comprises:

201, detect the corresponding sight line focal position of beholder in the display screen.When said sight line focal position was a beholder's sight line focal position, then execution in step 202; When said sight line focal position is the corresponding a plurality of sight lines focal position of a plurality of beholders, then execution in step 203 or execution in step 204.

Wherein, the implementation of the beholder's that said real-time reception picture pick-up device is caught sight line focal position is identical with the associated description of said step 101, and the embodiment of the invention repeats no more to this.

202, according to a said beholder's sight line focal position, obtain the sight line associated region corresponding with a said beholder's sight line focal position, and execution in step 207.

Wherein, said sight line focal position according to a said beholder obtains identically with the associated description of the implementation method of the corresponding sight line associated region in a said beholder's sight line focal position and said step 102, and the embodiment of the invention repeats no more to this.

203, obtain a plurality of sight line associated regions corresponding respectively according to said a plurality of sight lines focal position, and said a plurality of sight line associated regions are merged the sight line associated region after obtaining merging, and execution in step 207 with said a plurality of sight lines focal position.

Wherein, Saidly obtain in a plurality of sight line associated regions corresponding sight line focal position respectively according to a said beholder with said a plurality of sight lines focal position according to said a plurality of sight lines focal position; Obtain identically with the associated description of the implementation method of the corresponding sight line associated region in a said beholder's sight line focal position and said step 102, the embodiment of the invention repeats no more to this.

What be worth explanation is, said said a plurality of sight line associated regions is merged, and the sight line associated region after obtaining merging can be realized in the following manner, specifically comprises:

With said a plurality of sight line associated regions together according to separately sight line associated region position grouping; Generate a new sight line associated region as the sight line associated region after merging, the sight line associated region after the said merging has covered said a plurality of sight line associated region.

204, obtain said a plurality of beholders' rights of using through recognition of face, and confirm whether said a plurality of beholders' rights of using are identical.If said a plurality of beholders' rights of using are different, then execution in step 205; If said a plurality of beholders' rights of using are identical, then execution in step 206.

Wherein, Obtaining of said a plurality of beholders' rights of using can combine the authority in the database of said multimedia processing system that realization is set through face identification method; Can also adopt the mode of human eye iris recognition; Concrete implementation is for well known to a person skilled in the art technology, and the embodiment of the invention no longer is described in detail at this.

205, the beholder's of high rights of using sight line focal position obtains corresponding sight line associated region according to having, and with the sight line associated region that obtains as the corresponding sight line associated region in said a plurality of beholders' sight line focal position, and execution in step 207.

206, obtain a plurality of sight line associated regions corresponding respectively according to said a plurality of sight lines focal position with said a plurality of sight lines focal position; Overlap the zone if said a plurality of sight line associated region exists, confirm that then said coincidence zone is the corresponding sight line associated region in said a plurality of sight lines focal position; Do not overlap the zone if said a plurality of sight line associated regions do not exist, then definite sight line focal position corresponding sight line associated region nearest from picture pick-up device picture center is the corresponding sight line associated region in said a plurality of sight lines focal position, and execution in step 208.

207, the corresponding video image of said sight line associated region is carried out the video enhancement process.

Wherein, said that the corresponding video image of said sight line associated region is carried out in video enhancement process and the step 103 relevant description is identical, can specifically be applied in the following scene, comprising:

Optional, said video image to said sight line associated region correspondence carries out the video enhancement process and can be the switching of main screen and auxiliary screen.For example, in video conference, MCU (Multipoint Control Unit, multipoint control unit) is transferred to the terminal to many pictures, shows in the local terminal.The camera of end detects beholder's sight line, gets access to beholder's sight line focal position, is that the center obtains the sight line associated region according to said sight line focal position.If in the time decision that is provided with in advance, beholder's sight line focal position does not move to outside the said sight line associated region, then is sent to MCU to said sight line associated region positional information; MCU is through this positional information of contrast, if this position is not at key frame, and at auxiliary image; Then amplify auxiliary image, become key frame and show, and its sound is amplified; Key frame is reduced into auxiliary image and shows, and its sound is reduced.

Optional, it can be the enhancing to the image frame per second that said video image to said sight line associated region correspondence carries out the video enhancement process.For example, detect beholder's sight line focal position at video camera, be reported to MCU, MCU calculates the sight line focal position that reports for 2 times, subtracts each other according to its corresponding horizontal ordinate, draws the situation of movement of sight line focal position on abscissa and ordinate.Carry out 3 such operations, if the situation of movement that calculates for 3 times is identical, the direction that moves of judgement place beholder sight line then, otherwise proceed to detect.According to the direction that beholder's sight line moves, carry out the adjustment of captions broadcasting speed: identical with the captions moving direction, represent that then the captions broadcasting speed is too fast, need to reduce the captions translational speed, otherwise need to accelerate the captions translational speed.After the adjustment of captions translational speed, detect again, carry out the adjustment of captions translational speed according to testing result again.Through detection, the adjustment that does not stop, adjust to and beholder's sight line position is in the middle of the screen and no longer mobile, then the adjustment of captions translational speed finishes.

Optional, it can be the enhancing to the audio/video encoding/decoding resource that said video image to said sight line associated region correspondence carries out the video enhancement process.For example, detect the current sight line focal position of beholder at video camera, and obtain the sight line associated region: said sight line associated region is marked; Be sent to MCU to said sight line associated region information coordinate; MCU strengthens the image coding and decoding in the said sight line associated region according to said sight line associated region information, strengthens the encoding and decoding effect with higher pixel, wideer colour gamut, higher transmission bandwidth; Reach better real effect, promote user's visual experience.User's sight line moves, and then move with user's sight line and mobile in this zone, and the image effect in user's sight line is for more excellent.

Need to prove; Except that above-mentioned video image to said sight line associated region correspondence is carried out the video enhancement process; Said video image to said sight line associated region correspondence carries out the video enhancement process and also can carry out according to other method, and the embodiment of the invention does not limit this.

In addition; Need to prove; Before said video image to said sight line associated region correspondence carries out the video enhancement process; Can the relevant information of said sight line associated region be sent to remote server, so that said remote server carries out the adjustment that audio frequency and video show according to the relevant information of said sight line associated region to said sight line associated region.

Wherein, The relevant information of said sight line associated region can be the relevant information of the said sight line focus area corresponding with a beholder's sight line focal position; Can be the relevant information of the sight line focus area after the said merging; Can also can be the said relevant information that overlaps the zone for having the beholder's of high rights of using the corresponding sight line associated region in sight line focal position.Specifically can comprise the information such as centre coordinate, boundary sizes of said sight line associated region, the user can be provided with and add according to actual needs voluntarily, and the embodiment of the invention is not enumerated at this one by one.

Wherein, said remote server can be MCU, and said relevant information with said sight line associated region sends to remote server can realize that the embodiment of the invention does not limit this through communication channels such as IP networks.

And said multi-media processing method based on visual attention location can also carry out different processing according to beholder's quantity, has improved the service efficiency of equipment, has promoted user's use experience.

And; Relevant information through with said beholder's sight line associated region sends to remote server; So that remote server can be handled the encoding and decoding of the source end of multimedia file,, make the user can obtain better use experience for the user provides better audio frequency and video display effect.

Embodiment 3

The embodiment of the invention provides a kind of multimedia processing apparatus based on visual attention location, and is of Fig. 3, and this device comprises: receiving element 31, acquiring unit 32, adjustment unit 33.

Receiving element 31 is used to detect the corresponding sight line focal position of beholder in the display screen.

Acquiring unit 32 is used for according to said sight line focal position, obtains the sight line associated region corresponding with said sight line focal position.

Adjustment unit 33 is used for the corresponding video image of said sight line associated region is carried out the video enhancement process.

Further, as shown in Figure 4, said acquiring unit 32 comprises: first acquisition module 321, second acquisition module 322.

First acquisition module 321; When being used in said sight line focal position being a plurality of sight lines focal position of a plurality of beholders' correspondences; Obtain a plurality of sight line associated regions corresponding respectively according to said a plurality of sight lines focal position with said a plurality of sight lines focal position; And said a plurality of sight line associated regions are merged the sight line associated region after obtaining merging.

Second acquisition module 322; When being used in said sight line focal position being a plurality of sight lines focal position of a plurality of beholders' correspondences; Obtain said a plurality of beholders' rights of using through recognition of face; And, obtain the sight line associated region corresponding with said sight line focal position according to said a plurality of beholders' rights of using and said a plurality of sight lines focal position.

Further, as shown in Figure 5, said second acquisition module comprises: authority is confirmed submodule 3221, the definite submodule 3222 in zone.

Authority is confirmed submodule 3221, is used for confirming whether said a plurality of beholders' rights of using are identical.

Submodule 3222 is confirmed in the zone; Be used for not simultaneously in said a plurality of beholders' rights of using; The beholder's of high rights of using sight line focal position obtains corresponding sight line associated region according to having, and with the sight line associated region that obtains as the corresponding sight line associated region in said a plurality of beholders' sight line focal position.

Submodule 3222 is confirmed in said zone, can also be used for rights of using said a plurality of beholders when identical, obtains a plurality of sight line associated regions corresponding with said a plurality of sight lines focal position respectively according to said a plurality of sight lines focal position; Overlap the zone if said a plurality of sight line associated region exists, confirm that then said coincidence zone is the corresponding sight line associated region in said a plurality of sight lines focal position; Do not overlap the zone if said a plurality of sight line associated regions do not exist, then definite sight line focal position corresponding sight line associated region nearest from picture pick-up device picture center is the corresponding sight line associated region in said a plurality of sight lines focal position.

Further, as shown in Figure 6, said adjustment unit 33 also comprises: first enforcement module 331, second enforcement module 332.

First enforcement module 331 is used for the image information in the said sight line associated region is carried out image enhancement processing.

Second enforcement module 332 is used for the video information in the said sight line associated region is carried out the coding and decoding video enhancement process.

Further, as shown in Figure 7, this device also comprises: transmitting element 34.

Transmitting element 34 is used for the relevant information of said sight line associated region is sent to remote server, so that said remote server carries out the adjustment that audio frequency and video show according to the relevant information of said sight line associated region to said sight line associated region.

Further, said acquiring unit 32 also is used for obtaining the sight line central area according to said sight line focal position, and said sight line central area is for being the zone at center with the sight line focal position; Zone association relation according to said sight line central area and setting in advance generates the sight line associated region.

The multimedia processing apparatus that the embodiment of the invention provides based on visual attention location; Through obtaining beholder's visual focus position; And confirm the zone that the beholder is watching according to the sight line associated region that the visual focus position obtains the beholder; Directly said sight line associated region is adjusted to satisfy sense of experience of users then, realized under the situation that does not influence user's use experience, accomplish the control that audio frequency and video are shown through the sight line focus of confirming the user.

And; Relevant information through with said beholder's sight line associated region sends to remote server; So that remote server can be handled the encoding and decoding of the source end of audio frequency and video,, make the user can obtain better use experience for the user provides better audio frequency and video display effect.

Through the description of above execution mode, the those skilled in the art can be well understood to the present invention and can realize by the mode that software adds essential common hardware, can certainly pass through hardware, but the former is better execution mode under a lot of situation.Based on such understanding; The part that technical scheme of the present invention contributes to prior art in essence in other words can be come out with the embodied of software product, and this computer software product is stored in the storage medium that can read, like the floppy disk of computer; Hard disk or CD etc.; Comprise some instructions with so that computer equipment (can be personal computer, server, the perhaps network equipment etc.) carry out the described method of each embodiment of the present invention.

The above; Be merely embodiment of the present invention, but protection scope of the present invention is not limited thereto, any technical staff who is familiar with the present technique field is in the technical scope that the present invention discloses; Can expect easily changing or replacement, all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection range of said claim.

Claims

1. the multi-media processing method based on visual attention location is characterized in that, comprising:

2. method according to claim 1 is characterized in that, and is said according to said sight line focal position when collecting a plurality of sight lines focal position of a plurality of beholders' correspondences, obtains the sight line associated region corresponding with said sight line focal position and comprises:

Obtain a plurality of sight line associated regions corresponding respectively according to said a plurality of sight lines focal position, and said a plurality of sight line associated regions are merged the sight line associated region after obtaining merging with said a plurality of sight lines focal position; Perhaps

Obtain said a plurality of beholders' rights of using through recognition of face, and, obtain and the corresponding sight line associated region in said a plurality of sight lines focal position according to said a plurality of beholders' rights of using and said a plurality of sight lines focal position.

3. method according to claim 2 is characterized in that, said rights of using and said a plurality of sight lines focal position according to said a plurality of beholders obtain and the corresponding sight line associated region in said a plurality of sight lines focal position, comprising:

Whether the rights of using of confirming said a plurality of beholders are identical;

If said a plurality of beholders' rights of using are different; Then the beholder's of high rights of using sight line focal position obtains corresponding sight line associated region according to having, and with the sight line associated region that obtains as the corresponding sight line associated region in said a plurality of beholders' sight line focal position;

If said a plurality of beholders' rights of using are identical, then obtain a plurality of sight line associated regions corresponding respectively with said a plurality of sight lines focal position according to said a plurality of sight lines focal position; Overlap the zone if said a plurality of sight line associated region exists, confirm that then said coincidence zone is the corresponding sight line associated region in said a plurality of sight lines focal position; Do not overlap the zone if said a plurality of sight line associated regions do not exist, then definite sight line focal position corresponding sight line associated region nearest from picture pick-up device picture center is the corresponding sight line associated region in said a plurality of sight lines focal position.

4. method according to claim 1 is characterized in that, said sight line associated region is carried out the video enhancement process comprise:

Image information in the said sight line associated region is carried out image enhancement processing; Perhaps

Video information in the said sight line associated region is carried out the coding and decoding video enhancement process.

5. method according to claim 1 is characterized in that, according to said sight line focal position, obtains after the sight line associated region corresponding with said sight line focal position, also comprises:

The relevant information of said sight line associated region is sent to remote server, so that said remote server carries out the video enhancement process according to the relevant information of said sight line associated region to said sight line associated region.

6. according to each described method of claim 1-5, it is characterized in that, said according to said sight line focal position, obtain the sight line associated region corresponding and also comprise with said sight line focal position:

Obtain the sight line central area according to said sight line focal position, said sight line central area is for being the zone at center with the sight line focal position;

Zone association relation according to said sight line central area and setting in advance generates the sight line associated region.

7. method according to claim 6 is characterized in that, the said zone association relation that is provided with in advance is any relation in the following relation, and this relation is:

Literal in said sight line associated region and identical or close, the said sight line associated region of said sight line central area image pixel and close, the said sight line associated region of said sight line central area picture material and identical or close, the said sight line associated region of said sight line central area picture shape and the literal in the said sight line central area belong to same paragraph.

8. the multimedia processing apparatus based on visual attention location is characterized in that, comprising:

Adjustment unit is used for the corresponding video image of said sight line associated region is carried out the video enhancement process.

9. the multimedia processing apparatus based on visual attention location according to claim 8 is characterized in that, said acquiring unit comprises:

First acquisition module; Be used for when said sight line focal position is a plurality of sight lines focal position of a plurality of beholders' correspondences; Obtain a plurality of sight line associated regions corresponding respectively according to said a plurality of sight lines focal position with said a plurality of sight lines focal position; And said a plurality of sight line associated regions are merged the sight line associated region after obtaining merging;

Second acquisition module; Be used for when said sight line focal position is a plurality of sight lines focal position of a plurality of beholders' correspondences; Obtain said a plurality of beholders' rights of using through recognition of face; And, obtain the sight line associated region corresponding with said sight line focal position according to said a plurality of beholders' rights of using and said a plurality of sight lines focal position.

10. the multimedia processing apparatus based on visual attention location according to claim 9 is characterized in that, said second acquisition module comprises:

Authority is confirmed submodule, is used for confirming whether said a plurality of beholders' rights of using are identical;

Submodule is confirmed in the zone; Be used for not simultaneously in said a plurality of beholders' rights of using; The beholder's of high rights of using sight line focal position obtains corresponding sight line associated region according to having, and with the sight line associated region that obtains as the corresponding sight line associated region in said a plurality of beholders' sight line focal position;

Submodule is confirmed in said zone, also is used for rights of using said a plurality of beholders when identical, obtains a plurality of sight line associated regions corresponding with said a plurality of sight lines focal position respectively according to said a plurality of sight lines focal position; Overlap the zone if said a plurality of sight line associated region exists, confirm that then said coincidence zone is the corresponding sight line associated region in said a plurality of sight lines focal position; Do not overlap the zone if said a plurality of sight line associated regions do not exist, then definite sight line focal position corresponding sight line associated region nearest from picture pick-up device picture center is the corresponding sight line associated region in said a plurality of sight lines focal position.

11. the multimedia processing apparatus based on visual attention location according to claim 8 is characterized in that, said adjustment unit comprises:

First enforcement module is used for the image information in the said sight line associated region is carried out image enhancement processing;

Second enforcement module is used for the video information in the said sight line associated region is carried out the coding and decoding video enhancement process.

12. the multimedia processing apparatus based on visual attention location according to claim 8 is characterized in that, this device also comprises:

Transmitting element is used for the relevant information of said sight line associated region is sent to remote server, so that said remote server carries out the video enhancement process according to the relevant information of said sight line associated region to said sight line associated region.

13. each described multimedia processing apparatus according to Claim 8-12 based on visual attention location; It is characterized in that; Said acquiring unit also is used for obtaining the sight line central area according to said sight line focal position, and said sight line central area is for being the zone at center with the sight line focal position; Zone association relation according to said sight line central area and setting in advance generates the sight line associated region.