CN106303366A - A kind of method and device of Video coding based on territorial classification coding - Google Patents
A kind of method and device of Video coding based on territorial classification coding Download PDFInfo
- Publication number
- CN106303366A CN106303366A CN201610685073.8A CN201610685073A CN106303366A CN 106303366 A CN106303366 A CN 106303366A CN 201610685073 A CN201610685073 A CN 201610685073A CN 106303366 A CN106303366 A CN 106303366A
- Authority
- CN
- China
- Prior art keywords
- region
- pretreatment
- computer
- picture
- carries out
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
The invention discloses the method and device of a kind of Video coding based on territorial classification coding, relate to technical field of video transmission;Solve the technical problem how more effectively to transmit under cbr (constant bit rate);This technical scheme includes: step one, identifies each content area in video pictures;Step 2, carries out pretreatment to each region, reduces image noise.
Description
Technical field
The present invention relates to technical field of video transmission, particularly to the side of a kind of Video coding based on territorial classification coding
Method and device.
Background technology
Under normal circumstances, video conference main frame connects high-definition camera shooting meeting-place picture, as it is shown in figure 1, carry out video
Coding transmission.But it is because the impact such as light, photographic head sampling, the slide region photographed and original slide image phase
Ratio, has bigger noise, and color also can change, such as, the solid color regions on lantern slide, take with photographic head,
Not being the most pure color, this causes the compression ratio after information distortion and Video coding to reduce.How to carry out more under cbr (constant bit rate)
Effective transmission becomes technical problem urgently to be resolved hurrily.
Summary of the invention
The present invention is to solve the technical problem how more effectively to transmit under cbr (constant bit rate).
In order to solve the problems referred to above, a kind of method that the invention provides Video coding based on territorial classification coding, bag
Include:
Step one, identifies each content area in video pictures;
Step 2, carries out pretreatment respectively to each region, reduces image noise.
Present invention also offers the device of a kind of Video coding based on territorial classification coding, including:
Recognition unit, identifies each content area in video pictures;
Pretreatment unit, carries out pretreatment respectively to each region, reduces image noise.
Technical scheme achieves the method and device of a kind of Video coding based on territorial classification coding, uses
Different modes carries out pretreatment to image zones of different, can reduce image noise, thus highlight the content that user is interested,
Improve the perceived quality of user.
Accompanying drawing explanation
The existing photographic head of Fig. 1 and video conference main frame connection diagram;
Fig. 2 photographic head of the present invention and video conference main frame connection diagram;
The method schematic diagram of a kind of Video coding based on territorial classification coding of Fig. 3;
The method flow schematic diagram of a kind of Video coding based on territorial classification coding of Fig. 4;
Fig. 5 reduces the preprocess method schematic diagram of spatial resolution;
The device schematic diagram of a kind of Video coding based on territorial classification coding of Fig. 6.
Detailed description of the invention
Below in conjunction with drawings and Examples, technical scheme is described in detail.
If it should be noted that do not conflict, each feature in the embodiment of the present invention and embodiment can mutually be tied
Close, all within protection scope of the present invention.Although it addition, show logical order in flow charts, but in some situation
Under, can be to be different from the step shown or described by order execution herein.
Embodiment one, a kind of method of Video coding based on territorial classification coding, as it is shown on figure 3, include:
Step one, identifies each content area in video pictures;
Step 2, carries out pretreatment respectively to each region, reduces image noise.
Technical scheme achieves the method and device of a kind of Video coding based on territorial classification coding, uses
Different modes carries out pretreatment to image zones of different, can reduce image noise, thus highlight the content that user is interested,
Improve the perceived quality of user.
Embodiment two, a kind of method of Video coding based on territorial classification coding, as shown in Figure 4, in embodiment one
On the basis of, including:
Further, described step one, each content area is divided into: human face region, computer viewing area, zone of action, no
The combination in the one or more regions in zone of action.
Computer viewing area, human face region, zone of action and inactive region, human eye is at the emphasis perceptually paid close attention to not
With.Human face region is of greatest concern.For zone of action, human eye more preferably pays close attention to its motion.And to inactive region, human eye
Focus more on its details.Therefore, computer viewing area, human face region, zone of action and inactive region, in pretreatment link
Treat with a certain discrimination.
By mark or image analysis technology in advance, identify the human face region in video pictures, computer viewing area, work
Dynamic region and inactive region, before traditional coding flow process, carry out pre-place to image zones of different in different ways
Reason, reduces image noise, the content that prominent user is interested, improves the perceived quality of user.
Further, described step 2, each region is carried out pretreatment, described human face region does not carry out pretreatment.
Use human face detection tech, detect the human face region in picture, be A by this area marking;Human face region is
Of greatest concern, so human face region does not carry out pretreatment.
Further, described step 2, each region is carried out pretreatment, described computer viewing area, at camera collection
To picture on, mark out computer picture, then by affine transformation, use the picture collected from computer to replace shooting
The computer viewing area of mark in the picture that machine photographs.As shown in Figure 2.
If using Fig. 2 structure, video conference main frame connection photographic head and speech computer, pass through API on speech computer
Directly collect original desktop images.By mark form, camera collection to picture on, mark out computer picture
Four angle points, then by affine transformation, use the picture collected from computer to replace the picture that video camera photographs
In the computer viewing area of mark, it is possible to the effective computer viewing area display quality promoted in the final picture of video conference,
And can effectively improve compression ratio.
Because in video conference, photographic head is usually fixed, and can mark out computer and show by the way of mark in advance
Show four focuses of region B;To region B, the real-time pictures that will obtain in speech computer, through affine transformation, cover frame
On image;Video conference main frame is directly connected to video camera and computer equipment, by obtaining computer picture in real time, uses affine transformation
Camera views corresponding content, strengthens picture.
Further, described step 2, each region is carried out pretreatment, described zone of action, carries out reducing spatial discrimination
The pretreatment of rate.
Use frame difference method, in non-A, non-B region, identify zone of action C.
Further, the preprocess method reducing spatial resolution is: image pixel is divided into the little lattice of M*N, will be the least
Image pixel in lattice, in employing grid, the meansigma methods of each pixel value substitutes.
The preprocess method reducing spatial resolution is:
Image pixel is divided into the little lattice of M*N, is typically 2*2.By the image pixel in every little lattice, each in using grid
The meansigma methods of pixel value substitutes, as it is shown in figure 5, so reduce spatial resolution, improves Video coding compression ratio.
Further, described step 2, each region is carried out pretreatment, described inertia region, carrying out the reduction time divides
The pretreatment of resolution.
Identify and mark out inactive region D.
Further, the preprocess method reducing temporal resolution is: assume that certain some pixel value is V, its front n frame pretreatment
After pixel value be respectively V1, V2 ..., Vn, its meansigma methods is Vm, sets threshold value t, as V and Vm difference absolute value not higher than
Threshold value t, then after pretreatment, this pixel value is Vm, is otherwise V.So reduce temporal resolution, improve Video coding compression
Rate.
Embodiment three, the device of a kind of Video coding based on territorial classification coding, as shown in Figure 6, including:
Recognition unit, identifies each content area in video pictures;
Pretreatment unit, carries out pretreatment respectively to each region, reduces image noise.
Technical scheme achieves the method and device of a kind of Video coding based on territorial classification coding, uses
Different modes carries out pretreatment to image zones of different, can reduce image noise, thus highlight the content that user is interested,
Improve the perceived quality of user.
Embodiment four, the device of a kind of Video coding based on territorial classification coding, as shown in Figure 6, in embodiment three
On the basis of farther include:
Further, described recognition unit, each content area is divided into: human face region, computer viewing area, zone of action,
The combination in the one or more regions in inertia region.
Computer viewing area, human face region, zone of action and inactive region, human eye is at the emphasis perceptually paid close attention to not
With.Human face region is of greatest concern.For zone of action, human eye more preferably pays close attention to its motion.And to inactive region, human eye
Focus more on its details.Therefore, computer viewing area, human face region, zone of action and inactive region, in pretreatment link
Treat with a certain discrimination.
By mark or image analysis technology in advance, identify the human face region in video pictures, computer viewing area, work
Dynamic region and inactive region, before traditional coding flow process, carry out pre-place to image zones of different in different ways
Reason, reduces image noise, the content that prominent user is interested, improves the perceived quality of user.
Further, described pretreatment unit, each region is carried out pretreatment, described human face region does not carry out pretreatment.
Use human face detection tech, detect the human face region in picture, be A by this area marking;Human face region is
Of greatest concern, so human face region does not carry out pretreatment.
Further, described pretreatment unit, each region is carried out pretreatment, described computer viewing area, at photographic head
On the picture collected, mark out computer picture, then by affine transformation, use the picture collected from computer to replace
The computer viewing area of mark in the picture that video camera photographs.As shown in Figure 2.
If using Fig. 2 structure, video conference main frame connection photographic head and speech computer, pass through API on speech computer
Directly collect original desktop images.By mark form, camera collection to picture on, mark out computer picture
Four angle points, then by affine transformation, use the picture collected from computer to replace the picture that video camera photographs
In the computer viewing area of mark, it is possible to the effective computer viewing area display quality promoted in the final picture of video conference,
And can effectively improve compression ratio.
Because in video conference, photographic head is usually fixed, and can mark out computer and show by the way of mark in advance
Show four focuses of region B;To region B, the real-time pictures that will obtain in speech computer, through affine transformation, cover frame
On image;Video conference main frame is directly connected to video camera and computer equipment, by obtaining computer picture in real time, uses affine transformation
Camera views corresponding content, strengthens picture.
Further, described pretreatment unit, each region is carried out pretreatment, described zone of action, carries out reducing space
The pretreatment of resolution.Use frame difference method, in non-A, non-B region, identify zone of action C.
Further, the preprocess method reducing spatial resolution is: image pixel is divided into the little lattice of M*N, will be the least
Image pixel in lattice, in employing grid, the meansigma methods of each pixel value substitutes.
The preprocess method reducing spatial resolution is:
Image pixel is divided into the little lattice of M*N, is typically 2*2.By the image pixel in every little lattice, each in using grid
The meansigma methods of pixel value substitutes, as it is shown in figure 5, so reduce spatial resolution, improves Video coding compression ratio.
Further, described pretreatment unit, each region is carried out pretreatment, described inertia region, when reducing
Between the pretreatment of resolution.Identify and mark out inactive region D.
Further, the preprocess method reducing temporal resolution is: assume that certain some pixel value is V, its front n frame pretreatment
After pixel value be respectively V1, V2 ..., Vn, its meansigma methods is Vm, sets threshold value t, as V and Vm difference absolute value not higher than
Threshold value t, then after pretreatment, this pixel value is Vm, is otherwise V.So reduce temporal resolution, improve Video coding compression
Rate.
Image in HD video meeting, according to the difference of the focus of user, is divided into four class regions by the present invention: face
Region, computer viewing area, zone of action and region, four, inertia region, before traditional coding flow process, use difference
Mode image zones of different is carried out pretreatment, reduce image noise, the content that prominent user is interested, improve the sense of user
Know quality.
One of ordinary skill in the art will appreciate that all or part of step in said method can be instructed by program
Related hardware completes, and described program can be stored in computer-readable recording medium, such as read only memory, disk or CD
Deng.Alternatively, all or part of step of above-described embodiment can also use one or more integrated circuit to realize.Accordingly
Ground, each module/unit in above-described embodiment can realize to use the form of hardware, it would however also be possible to employ the shape of software function module
Formula realizes.The present invention is not restricted to the combination of the hardware and software of any particular form.
Certainly, the present invention also can have other various embodiments, in the case of without departing substantially from present invention spirit and essence thereof, ripe
Know those skilled in the art to work as and can make various corresponding change and deformation according to the present invention, but these change accordingly and become
Shape all should belong to the scope of the claims of the present invention.
Claims (16)
1. the method for a Video coding based on territorial classification coding, it is characterised in that including:
Step one, identifies each content area in video pictures;
Step 2, carries out pretreatment respectively to each region, reduces image noise.
2. the method for claim 1, it is characterised in that described step one, each content area is divided into: human face region, electricity
The combination in the one or more regions in brain viewing area, zone of action, inertia region.
3. method as claimed in claim 2, it is characterised in that described step 2, carries out pretreatment, described face to each region
Region does not carry out pretreatment.
4. method as claimed in claim 2, it is characterised in that described step 2, carries out pretreatment, described computer to each region
Viewing area, camera collection to picture on, mark out computer picture, then by affine transformation, use from computer
The picture collected replaces the computer viewing area of mark in the picture that video camera photographs.
5. method as claimed in claim 2, it is characterised in that described step 2, carries out pretreatment, described activity to each region
Region, carries out reducing the pretreatment of spatial resolution.
6. method as claimed in claim 5, it is characterised in that the preprocess method reducing spatial resolution is: by image slices
Element is divided into the little lattice of M*N, and by the image pixel in every little lattice, in employing grid, the meansigma methods of each pixel value substitutes.
7. method as claimed in claim 2, it is characterised in that described step 2, carries out pretreatment to each region, described does not lives
Dynamic region, carries out reducing the pretreatment of temporal resolution.
8. method as claimed in claim 7, it is characterised in that the preprocess method reducing temporal resolution is: assume certain point
Pixel value is V, and its front pretreated pixel value of n frame is respectively V1, V2 ..., Vn, its meansigma methods is Vm, sets threshold value t, such as V
Be not higher than threshold value t with the absolute value of the difference of Vm, then after pretreatment, this pixel value is Vm, is otherwise V.
9. the device of a Video coding based on territorial classification coding, it is characterised in that including:
Recognition unit, identifies each content area in video pictures;
Pretreatment unit, carries out pretreatment respectively to each region, reduces image noise.
10. device as claimed in claim 9, it is characterised in that described recognition unit, each content area is divided into: human face region,
The combination in the one or more regions in computer viewing area, zone of action, inertia region.
11. devices as claimed in claim 10, it is characterised in that described pretreatment unit, carry out pretreatment, institute to each region
State human face region and do not carry out pretreatment.
12. devices as claimed in claim 10, it is characterised in that described pretreatment unit, carry out pretreatment, institute to each region
State computer viewing area, camera collection to picture on, mark out computer picture, then by affine transformation, use from
The picture collected on computer replaces the computer viewing area of mark in the picture that video camera photographs.
13. devices as claimed in claim 10, it is characterised in that described pretreatment unit, carry out pretreatment, institute to each region
State zone of action, carry out reducing the pretreatment of spatial resolution.
14. devices as claimed in claim 13, it is characterised in that the preprocess method reducing spatial resolution is: by image
Pixel is divided into the little lattice of M*N, and by the image pixel in every little lattice, in employing grid, the meansigma methods of each pixel value substitutes.
15. devices as claimed in claim 10, it is characterised in that described pretreatment unit, carry out pretreatment, institute to each region
State inertia region, carry out reducing the pretreatment of temporal resolution.
16. devices as claimed in claim 15, it is characterised in that the preprocess method reducing temporal resolution is: assume certain
Point pixel value is V, and its front pretreated pixel value of n frame is respectively V1, V2 ..., Vn, its meansigma methods is Vm, sets threshold value t,
Absolute value such as the difference of V and Vm is not higher than threshold value t, then after pretreatment, this pixel value is Vm, is otherwise V.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610685073.8A CN106303366B (en) | 2016-08-18 | 2016-08-18 | Video coding method and device based on regional classification coding |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610685073.8A CN106303366B (en) | 2016-08-18 | 2016-08-18 | Video coding method and device based on regional classification coding |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106303366A true CN106303366A (en) | 2017-01-04 |
CN106303366B CN106303366B (en) | 2020-06-19 |
Family
ID=57679842
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610685073.8A Active CN106303366B (en) | 2016-08-18 | 2016-08-18 | Video coding method and device based on regional classification coding |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106303366B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109561239A (en) * | 2018-08-20 | 2019-04-02 | 张亮 | Piece caudal flexure intelligent selection platform |
WO2022148142A1 (en) * | 2021-01-05 | 2022-07-14 | 华为技术有限公司 | Image processing method and apparatus |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101389014A (en) * | 2007-09-14 | 2009-03-18 | 浙江大学 | Resolution variable video encoding and decoding method based on regions |
CN103310411A (en) * | 2012-09-25 | 2013-09-18 | 中兴通讯股份有限公司 | Image local reinforcement method and device |
CN103888710A (en) * | 2012-12-21 | 2014-06-25 | 深圳市捷视飞通科技有限公司 | Video conferencing system and method |
CN103929640A (en) * | 2013-01-15 | 2014-07-16 | 英特尔公司 | Techniques For Managing Video Streaming |
-
2016
- 2016-08-18 CN CN201610685073.8A patent/CN106303366B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101389014A (en) * | 2007-09-14 | 2009-03-18 | 浙江大学 | Resolution variable video encoding and decoding method based on regions |
CN103310411A (en) * | 2012-09-25 | 2013-09-18 | 中兴通讯股份有限公司 | Image local reinforcement method and device |
CN103888710A (en) * | 2012-12-21 | 2014-06-25 | 深圳市捷视飞通科技有限公司 | Video conferencing system and method |
CN103929640A (en) * | 2013-01-15 | 2014-07-16 | 英特尔公司 | Techniques For Managing Video Streaming |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109561239A (en) * | 2018-08-20 | 2019-04-02 | 张亮 | Piece caudal flexure intelligent selection platform |
WO2022148142A1 (en) * | 2021-01-05 | 2022-07-14 | 华为技术有限公司 | Image processing method and apparatus |
Also Published As
Publication number | Publication date |
---|---|
CN106303366B (en) | 2020-06-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6257840B2 (en) | System and method for liveness analysis | |
US8982180B2 (en) | Face and other object detection and tracking in off-center peripheral regions for nonlinear lens geometries | |
TWI496109B (en) | Image processor and image merging method thereof | |
US8942509B2 (en) | Apparatus and method creating ghost-free high dynamic range image using filtering | |
US8896703B2 (en) | Superresolution enhancment of peripheral regions in nonlinear lens geometries | |
US9961272B2 (en) | Image capturing apparatus and method of controlling the same | |
KR102289261B1 (en) | Apparatus and method detecting motion mask | |
JP2010136032A (en) | Video monitoring system | |
US8896670B2 (en) | Image processing device, image processing method, and program | |
JP2010503006A5 (en) | ||
CN106470313B (en) | Image generation system and image generation method | |
US10616502B2 (en) | Camera preview | |
JP2024504270A (en) | Image fusion of scenes with objects at multiple depths | |
KR101890134B1 (en) | The analysis system and controlling method of moving image data by a CCTV monitor | |
US20100021008A1 (en) | System and Method for Face Tracking | |
CN106303366A (en) | A kind of method and device of Video coding based on territorial classification coding | |
KR20160137289A (en) | Photographing apparatus and method for controlling the same | |
EP4375925A1 (en) | Photographic image processing method and device | |
CN103155002A (en) | Method and arrangement for identifying virtual visual information in images | |
JP2016208355A (en) | Image monitoring device, image monitoring method, and image monitoring program | |
JP2011101161A (en) | Imaging device, control method of the same, reproducing device, and program | |
WO2024062971A1 (en) | Information processing device, information processing method, and information processing program | |
US20180211365A1 (en) | Efficient path-based method for video denoising | |
Bacci et al. | Forensic Facial Comparison: Current Status, Limitations, and Future Directions. Biology 2021, 10, 1269 | |
US20170289485A1 (en) | Method and device for displaying a plurality of videos |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: 100040 Shijingshan District railway building, Beijing, the 16 floor Applicant after: Chinese translation language through Polytron Technologies Inc Address before: 100040 Shijingshan District railway building, Beijing, the 16 floor Applicant before: Mandarin Technology (Beijing) Co., Ltd. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant |