WO2017088479A1 - Method of identifying digital on-screen graphic and device - Google Patents

Method of identifying digital on-screen graphic and device Download PDF

Info

Publication number
WO2017088479A1
WO2017088479A1 PCT/CN2016/088466 CN2016088466W WO2017088479A1 WO 2017088479 A1 WO2017088479 A1 WO 2017088479A1 CN 2016088466 W CN2016088466 W CN 2016088466W WO 2017088479 A1 WO2017088479 A1 WO 2017088479A1
Authority
WO
WIPO (PCT)
Prior art keywords
logo
station
area
identified
standard
Prior art date
Application number
PCT/CN2016/088466
Other languages
French (fr)
Chinese (zh)
Inventor
何小坤
Original Assignee
乐视控股(北京)有限公司
乐视致新电子科技(天津)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 乐视控股(北京)有限公司, 乐视致新电子科技(天津)有限公司 filed Critical 乐视控股(北京)有限公司
Publication of WO2017088479A1 publication Critical patent/WO2017088479A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/414Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/75Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence

Definitions

  • the embodiments of the present invention relate to the field of information recognition technologies, and in particular, to a station logo identification method and apparatus.
  • Smart TV is adapting to the trend of “high definition”, “networking” and “intelligence”. It has the ability to obtain program content from various channels such as the Internet, video equipment, computers, etc., and to meet the needs of consumers through information fusion. The ability to clearly show content on a large screen. Compared with traditional TV, smart TV provides users with faster, smarter and more humanized application services.
  • Smart TVs contain a large number of on-demand videos and live programs, and most TV programs retain the logo of the producers.
  • the Taiwan Standard is an important symbol for distinguishing TV stations. It contains important semantic information such as TV station name, program source and program orientation. It is an important source of semantics for video analysis, understanding and retrieval. The realization of the logo recognition technology will effectively realize the function of the program list, and is of great significance for understanding user preferences and delivering value-added services.
  • the first scheme a logo recognition scheme based on a single frame image.
  • This method takes the edge distance transformation matrix as the feature, and adopts template matching method, including global edge matching (sliding window traverses the entire logo area), block matching (manual filtering non-edge), color shape matching (color and The shape features are matched at the same time, and the program search is sorted and sorted.
  • the second scheme a logo recognition method based on multi-frame images.
  • the scheme generally adopts the following three methods: 1. Segmenting an image by using a feature value change of a pixel of a sensitive multi-frame video frame sequence (ie, a logo area), and performing edge detection, using a sliding window and a block Square The method performs matching identification. Secondly, the color and shape features are comprehensively used, and the logo is divided in the frame sequence according to the time-space invariance. The spatial distribution histogram is combined with the HSV color histogram to describe the feature effectively. Finally, the SUV is used to complete the logo recognition.
  • Method 3 Calculate the change of the adjacent frame image, extract the station mark and its Hu invariant moment, and obtain the recognition result according to the candidate set and the preset criterion.
  • the CCTV logo, the satellite logo, and the local logo are identified in the same manner, resulting in low recognition efficiency and low recognition accuracy.
  • the embodiment of the invention provides a method and a device for identifying a station logo, which are used to solve the defects of low recognition efficiency and low recognition accuracy in the prior art.
  • An embodiment of the present invention provides a station logo identification method, where the method includes:
  • the to-be-identified station logo is identified according to a first preset policy
  • the to-be-identified station logo is identified according to a second preset policy.
  • An embodiment of the present invention provides a station identification device, and the device includes:
  • the station label determining unit is configured to determine whether the station identifier to be identified in the station label area is a CCTV station label
  • a first identifying unit configured to: when the to-be-identified station is marked as a CCTV logo, identify the to-be-identified station logo according to a first preset policy;
  • the second identifying unit is configured to identify the to-be-identified station logo according to the second preset policy when the to-be-identified station is marked as a non-CCTV station logo.
  • An embodiment of the present invention provides a server, including:
  • the communication interface is used for information transmission between the user equipment and the server;
  • the processor is configured to invoke logic instructions in the memory to perform the following method
  • An embodiment of the present invention provides a computer program, including program code, where the program code is used Do the following:
  • the to-be-identified station logo is identified according to a first preset policy
  • the to-be-identified station logo is identified according to a second preset policy.
  • Embodiments of the present invention provide a storage medium for storing the above computer program.
  • the station logo identification method and device provided by the embodiments of the present invention no longer identify each station label in the same manner, but first determine whether the station logo to be identified in the station logo area is a CCTV station logo, and then the CCTV station logo.
  • Different non-CCTV logos are identified by different strategies, which can effectively identify the logos of CCTV logos and non-CCTV logos, which improves recognition efficiency and recognition accuracy.
  • FIG. 1 is a flow chart of a method for identifying a station logo according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram of a video frame image before area division in an embodiment of the present invention
  • FIG. 3 is a schematic diagram of a video frame image after region division in an embodiment of the present invention.
  • FIG. 4 is an overall effect diagram of combining the edges of each video frame image by taking CCTV5 as an example
  • FIG. 5 is a flowchart of a method for identifying a station logo according to an embodiment of the present invention
  • FIG. 6 is a block diagram showing the structure of a station identification device according to an embodiment of the present invention.
  • FIG. 7 is a schematic structural diagram of a server according to an embodiment of the present invention.
  • FIG. 1 is a flowchart of a method for identifying a logo of an embodiment of the present invention. referring to FIG. 1, the method includes:
  • S101 determining whether the to-be-identified station logo in the station logo area is a CCTV station logo
  • the logo area is an area including only the logo to be identified.
  • the logo area can be extracted in a plurality of manners.
  • the station logo including the station identifier to be identified is obtained by the following steps. region:
  • the TV station logo is basically located in the upper left corner of the video frame image (of course, if it is in other positions, it can be adjusted as needed), so when the logo is detected, only the fixed upper left corner needs to be extracted.
  • the area ie, the preset area
  • the existing logo recognition method generally acquires the logo area according to the optimal area rule (GSR).
  • GSR optimal area rule
  • the partition area of the station is - line start position 80 (1/24), column start position 40 (1/27), line width 450 (15/64), column width 180 (1) /6), the effect of the area division is as shown in FIG. 2 and FIG. 3, of course, the proportional position can be appropriately adjusted as needed, and the embodiment does not limit this.
  • each video frame image can be preprocessed.
  • the pre-processing includes at least one of area segmentation, gradation, and image enhancement.
  • other processes may be included, which is not limited in this embodiment.
  • Gray is the gray level of the pixel. Value, R is the red component of the pixel, G is the green component of the pixel, and B is the blue component of the pixel.
  • image enhancement uses grayscale stretching from 0 to 255 grayscale, and can also be replaced by histogram transformation.
  • edge is the sharp change of image gray scale.
  • Edge extraction is the key to the identification of the logo.
  • the integrity of the edge directly affects the logo recognition result.
  • edge extraction there are many methods for edge extraction, such as Canny, LOG, Sobel. , Laplacian operator, etc. Considering the requirements of denoising, edge integrity, edge positioning accuracy, etc., the Canny edge detection method is adopted in this embodiment.
  • the parameters of the Canny edge detection method are set to: a weak edge threshold of 50, a strong edge threshold of 200, and of course, may be appropriately floated as needed, for example, the threshold is floated within a range of ⁇ 10.
  • the corresponding preset image threshold may be determined according to the number of the video frame images, and then according to whether the number of the video frame images is lower than the preset image threshold according to the edge points, whether to retain the Edge point.
  • the correspondence between the number of video frame images and the preset image threshold is established in advance, and the corresponding relationship is searched according to the number of the video frame images to determine a corresponding preset image threshold, and the video has each edge point.
  • the edge point is not retained, and the edge point is retained when the number of video frame images is higher than or equal to the preset image threshold at each edge point.
  • N is the number of video frame images
  • X is a preset image threshold
  • the parameters in the corresponding relationship may be adjusted according to the resolution of the image, which is not limited in this embodiment.
  • Figure 4 shows the overall effect of the synthesis.
  • the edge noise, the black border, and the non-essential characters all affect the recognition accuracy.
  • the synthesized edge is optimized.
  • the optimization process includes: edge noise deletion, At least one of black edge removal and unnecessary text deletion.
  • the logos to be identified may be identified according to the first preset policy for the features of the CCTV logo.
  • the to-be-identified station logo can be identified according to the second preset policy for the feature of the non-CCTV logo.
  • the identification of each station label is not performed in the same manner, but it is first determined whether the station identifier to be identified in the station label area is a CCTV station label, and then different strategies are used for identifying the CCTV station logo and the non-CCTV station logo. It can effectively identify the logos of CCTV logos and non-CCTV logos, which improves the recognition efficiency and recognition accuracy.
  • FIG. 5 is a flowchart of a method for identifying a logo of an embodiment of the present invention. referring to FIG. 2, the method includes:
  • S501 determining whether the to-be-identified station logo in the logo area is a CCTV station logo
  • the to-be-identified station logo in the station logo area is a CCTV station logo
  • a plurality of methods may be adopted, in order to ensure the accuracy of the judgment, in the present embodiment, according to the aspect ratio of the station logo area, gray
  • the degree and color determine whether the station logo to be identified in the station logo area is a CCTV station logo.
  • the aspect ratio difference the aspect ratio of the CCTV logo (length: vertical length; width: horizontal width) is generally significantly smaller Other logos;
  • the color map of the CCTV logo has a wide range of white pixel features. Especially in the left 2/3; (3) after the CC image of the CCTV is divided into blocks, the sub-blocks meet the relevant constraints of grayscale proximity, such as mean, variance and so on.
  • the logo area can be classified by the aspect ratio, the gradation, and the color of the logo area.
  • the aspect ratio of the logo area can be first determined by the aspect ratio, that is, the aspect ratio of each station area is calculated first, and the aspect ratio is calculated by calculating the length H and the width of the station area.
  • W, aspect ratio ratio W / H.
  • the ratio of the CCTV logo is usually below 0.3. Therefore, the initial judgment condition that can be constructed is: ratio ⁇ 0.3.
  • the ratings of the satellites of Inner Mongolia Satellite TV, Chongqing Satellite TV, Beijing Satellite TV, etc. are all below 0.3, so they can be screened by the aspect ratio and then filtered again by grayscale and color.
  • the mean value of the red component in the first predetermined range in the upper left corner of the logo area and the red component in the second preset range in the lower right corner of the logo area is smaller than the preset red component.
  • the station standard area can be divided into 5*3 sub-blocks by row* column (of course, it can also be divided by 6*3 or 4*3, etc.), and the first sub-block area1 in the upper left corner is extracted. (ie the first preset range) and the first sub-block area2 of the lower right corner (ie the second preset range).
  • the red distribution of the CCTV logo and the Chongqing Satellite TV logo and the Beijing Satellite TV logo are quite different in these two regions.
  • condition 1 ie, Condition1
  • the red mean values of area1 and area2 are less than 150.
  • the sub-area area3 is taken. 50 pixels in the width direction of the logo area can be used as the standard, and the leftmost 8 columns of pixels (that is, 4/25 in width) of the logo area are formed into area3 (ie, the third preset range).
  • Condition1 and Condition2 only contain the information of each sub-block, the constraint needs to be extended to the relationship between sub-blocks.
  • Condition 3 (Condition3) - The absolute difference between the mean values of area4, area5, area6, and area7 does not exceed 100.
  • the CCTV station standard satisfies the following conditions at the four sub-blocks:
  • Condition 4 (Condition4) - The variance of the mean sequence of area4, area5, area6, and area7 does not exceed 1600.
  • S502 When the to-be-identified station is marked as a CCTV logo, extract the digital area in the logo area, perform digit identification on the extracted digital area, and identify the number in the digital area according to the digit determination result. The combination between the CCTV logo and the identified number is used as the recognition result of the to-be-identified logo;
  • the CCTV logo includes a logo (ie, CCTV), characters, and numbers.
  • the difference between the common CCTV logos is text and numbers.
  • digital recognition is easier, more stable, and faster than text recognition.
  • the CCTV logo number can describe the specific channel separately, so the embodiment will remove the text in the logo area (such as "comprehensive”, "finance”, etc.), and extract the numbers in the logo area (ie 1, 2, etc.) .
  • the text is below the station logo and has a clear pixel interval from the mark, and a preset pixel interval may be set, and whether the pixel interval between the extracted edge and the mark exceeds a preset pixel interval exceeds a preset pixel interval. In this case, it is confirmed as a character and is deleted. Therefore, the following logo area is an area including only numbers and flags.
  • the digital area is located in the logo area and has a certain positional relationship. Therefore, the positional relationship between the logo area and the digital area may be established in advance, and then according to the digital area. The position information divides the image to obtain the digital area.
  • the digital area For the logo area, the following relationship exists between the digital area and the logo area (ie, the area where the CCTV is located):
  • the digital area is located on the right side of the marked area, and the occupied width is approximately equal to 1/4 of the marked area;
  • the digital area in the logo area can be extracted according to the positional relationship between the logo area and the digital area.
  • the digital portion and the background portion in the digital area may be binarized, with the digital portion set to white and the background portion set to black.
  • the interference information may affect the digital recognition.
  • the binarization may be performed.
  • the digital area performs interference information deletion.
  • the white pixel block/point of the four corners of the digital area may be deleted according to the following manner: the horizontal width of the digital area is W (equal to 0.25 W A , and W A is the width of the logo area) ), the vertical length is H (equal to H A , H A is the height of the logo area), each
  • noise filtering can be performed to further attenuate and reduce noise point effects.
  • the extracted digital region is first subjected to bit number discrimination, and the number in the digital region is identified based on the bit number discrimination result.
  • the method for determining the extracted digital area can be performed in various manners.
  • the gray value of each pixel in the digital area is projected in columns to form a horizontal width of the digital area.
  • Projection vector with more than a preset number in a column of projection vectors When the pixel belongs to the digital part, the column projection vector is identified. If the minimum distance between two adjacent identified projection vectors is greater than the preset distance, the digital discrimination result is set to two bits, otherwise The numerical discrimination result is set to one bit.
  • the numbers in the digital area can be identified according to the number of digits determination result by the following three steps:
  • the digital area is matched with the standard number according to the bit number discrimination result, and when the highest matching rate and the second highest matching rate are not equal, corresponding to the highest matching rate.
  • the standard number is used as the number in the digital area.
  • S503 Calculate a matching ratio between an edge of the logo area and each standard station mark when the to-be-identified station is marked as a non-CCTV station logo;
  • the non-CCTV station logo usually includes: a TV station logo and a local station logo, in this embodiment, considering the characteristics and differences of the CCTV logo, it can be divided into no-text logo (such as Oriental TV), and Separate text logos (such as Southeast Satellite TV, which is the above-mentioned "non-essential text") and have text labels that cannot be completely separated (such as Hebei Satellite TV, which is the "necessary text” mentioned above). Therefore, it will be based on the standard library. Three different types of satellite station logos are used for corresponding types of standard logos. Construct.
  • the edge in the station label area is matched with the standard station logo by the spatial distribution histogram to calculate the station label.
  • the matching rate between the edge of the area and each standard station mark is matched with the standard station logo by the spatial distribution histogram to calculate the station label.
  • the length and width are relatively small, and the TV stations of Inner Mongolia Satellite TV and Xinjiang Satellite TV (such stations)
  • the aspect ratio defined as long platform is significantly larger than the aspect ratio of these satellite stations. Therefore, the aspect ratio can be used as a classification condition.
  • the ratio of the long station logo is above 1.5, and the aspect ratio comparison condition of the recognition level is: ratio ⁇ 1.5.
  • the edge Edge of the logo area is divided into the same number of sub-blocks A1, A2, A3, ... Am, where m is the number of sub-blocks.
  • the edge point probability is counted, that is, the spatial distribution histogram is obtained.
  • the Edge of the short station is divided into 5*5 sub-blocks, that is, 25 sub-blocks; the Edge of the long-station label is divided into 5*10 (rows*columns) sub-blocks, that is, 50 sub-blocks.
  • Each standard station mark in the standard library is also calculated according to the short and long spatial distribution histogram, and the spatial distribution histogram of the station label area is matched with the standard station label sub-block.
  • ⁇ p is used to determine whether pixel information matching is required.
  • ⁇ p ⁇ 0.25 judgment condition setting i.e., a first preset difference, of course, but also for other values
  • S505 determining, when the difference between the highest matching rate and the second highest matching rate is less than or equal to the first preset difference, determining whether the standard logo corresponding to the highest matching rate or the second highest matching rate belongs to a local station label;
  • the determination condition ⁇ p ⁇ 0.25 it is determined whether the to-be-identified station logo is a local station label. In this embodiment, it is required to determine whether the standard logo corresponding to the highest matching rate or the second highest matching rate belongs to the locality. Taiwan standard.
  • the standard station label corresponding to the highest matching rate or the second highest matching rate does not belong to the local station label, the standard station label corresponding to the highest matching rate and the second highest matching rate is used as a standard standard to be matched, and The color information of the logo area is matched with the standard station to be matched to update the matching rate of the standard station to be matched, and the to-be-matched standard with a high matching rate is used as the recognition result of the to-be-identified station logo.
  • the to-be-identified station logo does not belong to the local station logo, and the possibility of belonging to the satellite station logo is very high.
  • step (3) the average red gray value m Ri , the average green gray value m Gi , and the average blue gray value m Bi of each sub-block region are calculated, and the standard logo in the standard library is the same.
  • the block average is calculated and a three-channel mean match is performed.
  • S507 If the standard station label corresponding to the highest matching rate or the second highest matching rate belongs to the local station label, determining the area to which the to-be-identified station label belongs according to the matching ratio of the edge of the station label area and each standard station label Separating the text area in the logo area, separating the single word part in the text area, identifying the separated single word part, and matching the single word part recognition result with the standard station logo corresponding to the area, The matching standard station logo is used as the recognition result of the to-be-identified station logo.
  • the standard logo corresponding to the region can be understood as all standard logos of the region.
  • the standard station label corresponding to the highest matching rate or the second highest matching rate belongs to the local station label, at this time, the possibility that the to-be-identified station label belongs to the local station label is very high, and usually corresponds to a plurality of standard stations in the same area. Each has a high matching rate, and in this case, the area to which the to-be-identified station logo belongs can be determined according to the matching rates.
  • the area to which the to-be-identified station label belongs is determined according to the number of standard station labels in each area in the obtained standard station logo and the total number of standard station labels in each area, and the following criteria can be adopted:
  • the standard station label (the standard station label corresponding to the highest matching rate) whose difference between the matching rate and the highest matching rate is smaller than the first preset difference, and determining the target of the most standard station label according to the region to which each standard station label is obtained. a region, determining whether a ratio between a number of standard logos of the target area in each of the obtained standard logos and a total number of standard logos corresponding to the target area exceeds a preset ratio, and if yes, using the target area as the Identify the area to which the logo belongs.
  • the difference between the matching rate and the highest matching rate is less than the first preset difference of 10 standard stations, and 8 of the 10 standard stations belong to Beijing TV, 1 belongs to Sichuan TV, and 1 belongs to Xinjiang TV.
  • the preset ratio is 50%.
  • the target area is Beijing, and the number of standard stations in Beijing is obtained. That is, 8) and the total number of standard logos corresponding to Beijing (ie 12) is 72.7%, which exceeds the preset ratio (50%).
  • Beijing is used as the area to which the logo to be identified belongs. .
  • the extraction efficiency and accuracy of the high text area in the embodiment, determining the position information of the text area according to the positional relationship between the logo area and the text area and the position information of the logo area, and according to the text area The location information is extracted from the logo area.
  • the method may be otherwise limited.
  • the text area is the same as the logo “BTV” in the Beijing TV station. Therefore, the right side of the cutable sign “BTV” is twice the height of the station logo area.
  • the text area that is, the height of the text area is the same as the height of the logo "BTV”, and the length of the text area is twice the height of the logo area;
  • the text region is subjected to Ostu binarization processing, and the connected domain method is used to remove noise interference.
  • the difference lies in the text, such as "Beijing Satellite TV”, “sports”, “life” and so on.
  • the difference can be uniquely described by the first character in the text area.
  • the separated single word part is the single word part where the first character in the text area is located, and can be separated by the following process. :
  • the identification of the separated single-word portions includes:
  • the single word portion is refined to obtain a refined edge map of the single word portion
  • the process of the refinement process may refer to a practical parallel refinement algorithm written by Lu Yue and Shi Pengfei. The implementation thereof is not repeated here.
  • the standard words in the single word set can also be refined to obtain a refined edge map of each standard word
  • the refined edge map of the single-word portion and the refinement edge map of each standard word may be converted into 50 *50 pixel size.
  • A2 calculating an edge matching ratio between the single word portion and the standard word according to the refined edge of the refined word portion
  • the edge features of the refined text include strokes depicting the text, which narrows the range of matching standard data, while the regional features are refined text content, enriching the area description, and correcting the matching text, because the refined data volume is smaller than
  • the edge feature is matched first, and then the region feature is corrected, thereby reducing the amount of calculation.
  • A3 determining whether the matching rate difference between the highest edge matching rate and the second highest edge matching rate is higher than the second preset difference
  • Ps_m1-Ps_m2 ⁇ 0.25 (0.25 can be set to other values
  • A4 when the matching rate difference is higher than the second preset difference, the standard word corresponding to the highest edge matching rate is used as the word recognition result of the single word area;
  • A5 when the matching ratio difference is not higher than the second preset difference, the edge matching rate is updated according to the regional feature of the refined word part and the standard word matching;
  • Ps_i+ (pi_ds+2*pi_sd)/3 (Ps_i is the i-th element of Ps), a new probability array Ps is formed;
  • A6 The standard word corresponding to the highest edge matching rate is used as the word recognition result of the single word area.
  • the character corresponding to the maximum value Ps_m of Ps is used as the recognized character, and the matching ratio is corrected.
  • Table 1 shows the average recognition time for several typical stations using the method of the present embodiment.
  • the traversal recognition time is short and the difference is not large, both are around 2s.
  • CCTV1 1.44
  • CCTV5 1.29
  • Southeast Satellite TV 1.93 Beijing life 2.17 Beijing Sports 2.25
  • Table 2 shows the average matching ratio and recognition rate of several typical stations using the method of the present embodiment. Table 2 demonstrates the effectiveness, stability, and reliability of the method of the present embodiment.
  • the method of the present embodiment is suitable for matching identification of CCTV stations, satellite stations and local stations, achieving high recognition rate recognition of more than 95%, and controlling the traverse recognition time within 3s.
  • FIG. 6 is a structural block diagram of a station identification device according to an embodiment of the present invention. referring to FIG. 6, the device includes:
  • the station label determining unit 601 is configured to determine whether the station label to be identified in the station label area is a CCTV station label;
  • the first identifying unit 602 is configured to identify the to-be-identified station logo according to the first preset policy when the to-be-identified station is marked as a CCTV station logo;
  • the second identifying unit 603 is configured to identify the to-be-identified station logo according to the second preset policy when the to-be-identified station is marked as a non-CCTV station logo.
  • the station identifier determining unit is further configured to determine, according to the aspect ratio, the gray scale, and the color of the logo area, whether the station logo to be identified in the logo area is a CCTV logo .
  • the first identifying unit is further configured to extract a digital area in the logo area, performing bit number discrimination on the extracted digital area, identifying a number in the digital area according to the bit number discrimination result, and using a combination between the CCTV logo mark and the recognized number as the The identification result of the station logo to be identified.
  • the first identifying unit is further configured to extract a digital area in the logo area according to a positional relationship between the logo area and the digital area, and The digital portion and the background portion in the digital area are binarized.
  • the first identifying unit is further configured to project a gray value of each pixel in the digital area by a column to form a horizontal width of the digital area.
  • a projection vector when there are more than a preset number of pixels belonging to the digital part in a column of projection vectors, the column projection vector is identified, if there is a minimum distance between two adjacent identified projection vectors greater than a preset distance, Then, the digital discrimination result is set to two bits, otherwise the digital discrimination result is set to one bit.
  • the second identifying unit is further configured to calculate a matching ratio between an edge of the logo area and each standard station label; the difference between the highest matching rate and the second highest matching rate is greater than
  • the standard station corresponding to the highest matching rate is used as the recognition result of the to-be-identified station logo; when the difference between the highest matching rate and the second highest matching rate is less than or equal to the first preset difference, the determining unit Whether the standard logo corresponding to the highest matching rate or the second highest matching rate belongs to the local station label; if the standard label corresponding to the highest matching rate or the second highest matching rate does not belong to the local station label, the highest matching rate is The standard station label corresponding to the second highest matching rate is used as a standard station logo to be matched, and the color information of the station label area is matched with the standard station label to be matched to update the matching rate of the standard station label to be matched.
  • the criterion to be matched with a high matching rate is used as the
  • the second identifying unit is further configured to match an edge in the logo area by a spatial distribution histogram with a standard logo to calculate the logo area. The matching rate between the edge of each standard and each standard.
  • the second identifying unit is further configured to: when the standard label corresponding to the highest matching rate or the second highest matching rate belongs to a local station logo, according to the station label area And determining a region to which the to-be-identified logo belongs, separating a text region in the logo region, separating a single-word portion in the text region, and performing the separated single-word portion Identification, the standard corresponding to the word recognition result of the word part
  • the logo is matched by the logo, and the matching standard logo is used as the recognition result of the logo to be identified.
  • the second identifying unit is further configured to perform refinement processing on the single word portion; and calculate the single word portion according to the refined edge of the refined word portion
  • the edge matching rate between the standard words determining whether the matching rate difference between the highest edge matching rate and the second highest edge matching rate is higher than the second preset difference; wherein the matching rate difference is higher than the second preset difference
  • the standard single word corresponding to the highest edge matching rate is used as a single word recognition result of the single word area; when the matching rate difference is not higher than the second preset difference, according to the refined word part
  • the area feature is matched with the standard word, and the edge matching rate is updated; the standard word corresponding to the highest edge matching rate is used as the word recognition result of the single word area.
  • the device further includes:
  • the area obtaining unit is configured to obtain a video frame image sequence from a preset area of the video that includes the to-be-identified station logo, perform edge extraction on each video frame image, and combine edges of each video frame image to obtain a minimum external connection of the synthesized edge.
  • a matrix respectively dividing each video frame image according to the minimum outer matrix, and synthesizing the segmented images by weighted averaging to obtain a logo area including a station identifier to be identified.
  • FIG. 7 is a block diagram showing the structure of a server according to another embodiment of the present application.
  • the server includes:
  • processor 701 a processor 701, a memory 702, a communication interface 703, and a bus 704;
  • the processor 701, the memory 702, and the communication interface 703 complete communication with each other through the bus 704;
  • the communication interface 703 is used for information transmission between the server and the user equipment;
  • the processor 701 is configured to invoke logic instructions in the memory 702 to perform the following methods;
  • FIG. 1 another embodiment of the present invention discloses a computer program, including program code, for performing the following operations:
  • the to-be-identified station logo is identified according to a first preset policy
  • the to-be-identified station logo is identified according to a second preset policy.
  • Another embodiment of the present invention discloses a storage medium for storing a computer program as described in the foregoing embodiments.
  • the foregoing program may be stored in a computer readable storage medium, and the program is executed when executed.
  • the foregoing steps include the steps of the foregoing method embodiments; and the foregoing storage medium includes: a medium that can store program codes, such as a ROM, a RAM, a magnetic disk, or an optical disk.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Geometry (AREA)
  • Computer Graphics (AREA)
  • Image Analysis (AREA)

Abstract

The invention relates to the technical field of information identification, and specifically, to a method of identifying a digital on-screen graphic and device. The method comprises: determining whether a digital on-screen graphic to be identified in a digital on-screen graphic field is a central television digital on-screen graphic (S101); if so, identifying, according to a first predefined strategy, the digital on-screen graphic to be identified (S102); and if not, identifying, according to a second predefined strategy, the digital on-screen graphic to be identified (S103). The method of identifying a digital on-screen graphic and the device do not identify different digital on-screen graphics using the same way, instead, it is first determined that whether the digital on-screen graphic to be identified is the central television digital on-screen graphic, and then different strategies are adopted to identify a central television digital on-screen graphic and a non-central television digital on-screen graphic, effectively identifying the channel to be identified with respect to the central television digital on-screen graphic and the non-central television digital on-screen graphic, and increasing identification efficiency and identification accuracy.

Description

台标识别方法及装置Station identification method and device
交叉引用cross reference
本申请要求在2015年11月24日提交中国专利局、申请号为201510824167.4、发明名称为“台标识别方法及装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。The present application claims the priority of the Chinese Patent Application, the entire disclosure of which is hereby incorporated by reference.
技术领域Technical field
本发明实施例涉及信息识别技术领域,尤其涉及一种台标识别方法及装置。The embodiments of the present invention relate to the field of information recognition technologies, and in particular, to a station logo identification method and apparatus.
背景技术Background technique
智能电视正顺应“高清化”、“网络化”、“智能化”的趋势迅速发展,具备从因特网、视频设备、计算机等多种渠道获得节目内容、通过信息融合的方式将消费者最需要的内容在大屏幕上清晰展现的功能。与传统电视相比,智能电视为用户提供了更加快捷、智能、人性化的应用服务。Smart TV is adapting to the trend of “high definition”, “networking” and “intelligence”. It has the ability to obtain program content from various channels such as the Internet, video equipment, computers, etc., and to meet the needs of consumers through information fusion. The ability to clearly show content on a large screen. Compared with traditional TV, smart TV provides users with faster, smarter and more humanized application services.
智能电视包含大量点播视频、直播节目,绝大多数的电视节目保留了节目出品方的台标。台标是区分电视台的重要标志,包含了电视台台名、节目来源、节目取向等重要的语义信息,是实现视频分析、理解和检索的重要语义源。台标识别技术的实现,将有效实现节目单功能,并对理解用户喜好、投放增值服务具有十分重要的意义。Smart TVs contain a large number of on-demand videos and live programs, and most TV programs retain the logo of the producers. The Taiwan Standard is an important symbol for distinguishing TV stations. It contains important semantic information such as TV station name, program source and program orientation. It is an important source of semantics for video analysis, understanding and retrieval. The realization of the logo recognition technology will effectively realize the function of the program list, and is of great significance for understanding user preferences and delivering value-added services.
现有技术中在对台标进行识别时,通常采用以下两种方案:In the prior art, when identifying the logo, the following two schemes are usually adopted:
第一种方案:基于单帧图像的台标识别方案。该种方法以边缘距离变换矩阵作为特征,均采用模板匹配的方式,包括全局边缘匹配(滑动窗口遍历整个台标区域)、分块匹配(采用人工方式过滤非边缘)、颜色形状匹配(颜色和形状特征同时匹配,为节目搜索进行分类排序)等。The first scheme: a logo recognition scheme based on a single frame image. This method takes the edge distance transformation matrix as the feature, and adopts template matching method, including global edge matching (sliding window traverses the entire logo area), block matching (manual filtering non-edge), color shape matching (color and The shape features are matched at the same time, and the program search is sorted and sorted.
第二种方案:基于多帧图像的台标识别方法。该种方案通常可采用以下三种方法:一、利用连续多帧视频帧序列的敏感区域(即台标区域)像素点的特征值变化来分割图像,并进行边缘检测,采用滑动窗口和分块方 法进行匹配识别。二、综合利用颜色和形状特征,根据时空不变性在帧序列中分割台标,利用空间分布直方图结合HSV彩色直方图对特征进行有效描述,最后利用SUV完成台标识别。方法三:计算相邻帧图像的变化,提取台标及其Hu不变矩,并根据候选集合和预设准则得到识别结果等。The second scheme: a logo recognition method based on multi-frame images. The scheme generally adopts the following three methods: 1. Segmenting an image by using a feature value change of a pixel of a sensitive multi-frame video frame sequence (ie, a logo area), and performing edge detection, using a sliding window and a block Square The method performs matching identification. Secondly, the color and shape features are comprehensively used, and the logo is divided in the frame sequence according to the time-space invariance. The spatial distribution histogram is combined with the HSV color histogram to describe the feature effectively. Finally, the SUV is used to complete the logo recognition. Method 3: Calculate the change of the adjacent frame image, extract the station mark and its Hu invariant moment, and obtain the recognition result according to the candidate set and the preset criterion.
但现有技术中,对央视台标、卫视台标和地方台标等采用相同方式进行识别,导致识别效率和识别准确率过低。However, in the prior art, the CCTV logo, the satellite logo, and the local logo are identified in the same manner, resulting in low recognition efficiency and low recognition accuracy.
发明内容Summary of the invention
本发明实施例提供一种台标识别方法及装置,用以解决现有技术中识别效率和识别准确率过低的缺陷。The embodiment of the invention provides a method and a device for identifying a station logo, which are used to solve the defects of low recognition efficiency and low recognition accuracy in the prior art.
本发明实施例提供一种台标识别方法,所述方法包括:An embodiment of the present invention provides a station logo identification method, where the method includes:
判断台标区域中的待识别台标是否为央视台标;Determining whether the to-be-identified station logo in the station logo area is a CCTV station logo;
在所述待识别台标为央视台标时,按照第一预设策略对所述待识别台标进行识别;When the to-be-identified station is marked as a CCTV station logo, the to-be-identified station logo is identified according to a first preset policy;
在所述待识别台标为非央视台标时,按照第二预设策略对所述待识别台标进行识别。When the to-be-identified station is marked as a non-CCTV station logo, the to-be-identified station logo is identified according to a second preset policy.
本发明实施例提供一种台标识别装置,所述装置包括:An embodiment of the present invention provides a station identification device, and the device includes:
台标判断单元,用于判断台标区域中的待识别台标是否为央视台标;The station label determining unit is configured to determine whether the station identifier to be identified in the station label area is a CCTV station label;
第一识别单元,用于在所述待识别台标为央视台标时,按照第一预设策略对所述待识别台标进行识别;a first identifying unit, configured to: when the to-be-identified station is marked as a CCTV logo, identify the to-be-identified station logo according to a first preset policy;
第二识别单元,用于在所述待识别台标为非央视台标时,按照第二预设策略对所述待识别台标进行识别。The second identifying unit is configured to identify the to-be-identified station logo according to the second preset policy when the to-be-identified station is marked as a non-CCTV station logo.
本发明实施例提供一种服务器,包括:An embodiment of the present invention provides a server, including:
处理器、存储器、通信接口和总线;其中,a processor, a memory, a communication interface, and a bus; wherein
所述通信接口用于该用户设备与服务器之间的信息传输;The communication interface is used for information transmission between the user equipment and the server;
所述处理器用于调用所述存储器中的逻辑指令,以执行如下方法;The processor is configured to invoke logic instructions in the memory to perform the following method;
判断台标区域中的待识别台标是否为央视台标;在所述待识别台标为央视台标时,按照第一预设策略对所述待识别台标进行识别;在所述待识别台标为非央视台标时,按照第二预设策略对所述待识别台标进行识别。Determining whether the to-be-identified station logo in the station logo area is a CCTV station logo; when the to-be-identified station logo is a CCTV station logo, identifying the to-be-identified station logo according to the first preset policy; When the station logo is a non-CCTV station logo, the to-be-identified station logo is identified according to a second preset policy.
本发明实施例提供一种计算机程序,包括程序代码,所述程序代码用 于执行如下操作:An embodiment of the present invention provides a computer program, including program code, where the program code is used Do the following:
判断台标区域中的待识别台标是否为央视台标;Determining whether the to-be-identified station logo in the station logo area is a CCTV station logo;
在所述待识别台标为央视台标时,按照第一预设策略对所述待识别台标进行识别;When the to-be-identified station is marked as a CCTV station logo, the to-be-identified station logo is identified according to a first preset policy;
在所述待识别台标为非央视台标时,按照第二预设策略对所述待识别台标进行识别。When the to-be-identified station is marked as a non-CCTV station logo, the to-be-identified station logo is identified according to a second preset policy.
本发明实施例提供一种存储介质,用于存储上述的计算机程序。Embodiments of the present invention provide a storage medium for storing the above computer program.
本发明实施例提供的台标识别方法及装置,不再对各台标均采用相同方式进行识别,而是先判断台标区域中的待识别台标是否为央视台标,再对央视台标和非央视台标采用不同策略进行识别,能够有效地针对央视台标和非央视台标的特征对待识别台标进行识别,提高了识别效率和识别准确率。The station logo identification method and device provided by the embodiments of the present invention no longer identify each station label in the same manner, but first determine whether the station logo to be identified in the station logo area is a CCTV station logo, and then the CCTV station logo. Different non-CCTV logos are identified by different strategies, which can effectively identify the logos of CCTV logos and non-CCTV logos, which improves recognition efficiency and recognition accuracy.
附图说明DRAWINGS
图1是本发明一种实施方式的台标识别方法的流程图;1 is a flow chart of a method for identifying a station logo according to an embodiment of the present invention;
图2是本发明一种实施方式中区域分割前的视频帧图像的示意图;2 is a schematic diagram of a video frame image before area division in an embodiment of the present invention;
图3是本发明一种实施方式中区域分割后的视频帧图像的示意图;3 is a schematic diagram of a video frame image after region division in an embodiment of the present invention;
图4是以CCTV5为例,将各视频帧图像的边缘进行合成的整体效果图;4 is an overall effect diagram of combining the edges of each video frame image by taking CCTV5 as an example;
图5是本发明一种实施方式的台标识别方法的流程图;FIG. 5 is a flowchart of a method for identifying a station logo according to an embodiment of the present invention; FIG.
图6是本发明一种实施方式的台标识别装置的结构框图;6 is a block diagram showing the structure of a station identification device according to an embodiment of the present invention;
图7是本发明一种实施方式的服务器的结构示意图。FIG. 7 is a schematic structural diagram of a server according to an embodiment of the present invention.
具体实施方式detailed description
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整的描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described in conjunction with the drawings in the embodiments of the present invention. It is a partial embodiment of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
应理解的是,虽然背景技术中主要针对智能电视中电视节目的台标识别,但本公开的实施方式并不限于此,其还可适用于其他对象(诸如,平 板电脑、手机、PC机等设备)中电视节目的台标识别,也就是说,凡是涉及到电视节目的台标识别的场景均包括在本实施方式的应用范围内。It should be understood that although the background art is primarily directed to logo recognition of television programs in smart televisions, embodiments of the present disclosure are not limited thereto, and are also applicable to other objects (such as flat The identification of the logo of the television program in the tablet computer, the mobile phone, the PC, and the like, that is, the scene of the logo recognition involving the television program is included in the application range of the present embodiment.
图1是本发明一种实施方式的台标识别方法的流程图;参照图1,所述方法包括:1 is a flowchart of a method for identifying a logo of an embodiment of the present invention; referring to FIG. 1, the method includes:
S101:判断台标区域中的待识别台标是否为央视台标;S101: determining whether the to-be-identified station logo in the station logo area is a CCTV station logo;
需要说明的是,所述台标区域即为仅包括待识别台标的区域。It should be noted that the logo area is an area including only the logo to be identified.
可理解的是,所述台标区域可通过多种方式进行提取,为了防止随机噪声、画面噪声等噪声对台标识别的影响,本实施方式中,通过以下步骤获取包括待识别台标的台标区域:It can be understood that the logo area can be extracted in a plurality of manners. In order to prevent the influence of noise such as random noise and picture noise on the logo recognition, in the embodiment, the station logo including the station identifier to be identified is obtained by the following steps. region:
(1)从包括待识别台标的视频的预设区域内获取视频帧图像序列;(1) acquiring a video frame image sequence from a preset area of the video including the logo to be identified;
根据先验知识,电视台台标基本全部位于视频帧图像的左上角(当然,若处于其他位置,也可根据需要进行适应性调整即可),因此台标检测时,只需提取固定的左上角区域(即预设区域)作为台标检测区域即可。现有的台标识别方法一般按照最优区域规则(GSR)获取台标区域,本实施方式与现有的台标识别方法不同之处在于:(1)计算所有台标在各视频帧图像中有效识别的比例位置;(2)计算所有比例位置的最大范围作为台标区域分割的区域。以1920*1080的视频为例,台标分割区域为——行开始位置80(1/24),列开始位置40(1/27),行宽450(15/64),列宽180(1/6),区域分割效果如图2和图3所示,当然,所述比例位置可根据需要进行适当调整,本实施方式对此不加以限制。According to the prior knowledge, the TV station logo is basically located in the upper left corner of the video frame image (of course, if it is in other positions, it can be adjusted as needed), so when the logo is detected, only the fixed upper left corner needs to be extracted. The area (ie, the preset area) can be used as the station mark detection area. The existing logo recognition method generally acquires the logo area according to the optimal area rule (GSR). The difference between the present embodiment and the existing station logo recognition method is: (1) calculating all the station logos in each video frame image. The proportion position that is effectively recognized; (2) Calculate the maximum range of all proportional positions as the area divided by the station area. Taking the video of 1920*1080 as an example, the partition area of the station is - line start position 80 (1/24), column start position 40 (1/27), line width 450 (15/64), column width 180 (1) /6), the effect of the area division is as shown in FIG. 2 and FIG. 3, of course, the proportional position can be appropriately adjusted as needed, and the embodiment does not limit this.
为消除图像中无关的信息、恢复或增强有用的相关信息,提高特征的可检测性,最大限度地简化数据,以确保识别的可靠性,本实施方式中,可对各视频帧图像进行预处理,所述预处理包括:区域分割、灰度化和图像增强中的至少一种,当然,还可包括其他处理过程,本实施方式对此不加以限制。In order to eliminate irrelevant information in the image, restore or enhance useful related information, improve the detectability of features, and minimize data to ensure the reliability of recognition, in this embodiment, each video frame image can be preprocessed. The pre-processing includes at least one of area segmentation, gradation, and image enhancement. Of course, other processes may be included, which is not limited in this embodiment.
所述预处理可采用公式Gray=0.33R+0.59G+0.11B进行灰度化,当然,也可通过三通道平均值法或三通道最大值法等方式替代,其中,Gray为像素的灰度值,R为像素的红色分量,G为像素的绿色分量,B为像素的蓝色分量。The pre-processing can be grayed out by using the formula Gray=0.33R+0.59G+0.11B. Of course, it can also be replaced by a three-channel average method or a three-channel maximum method, wherein Gray is the gray level of the pixel. Value, R is the red component of the pixel, G is the green component of the pixel, and B is the blue component of the pixel.
所述图像增强的目的是突出台标区域有效信息,如图标、文字、数字 等,图像增强采用0~255灰度级的灰度拉伸,也可用直方图变换法替代。The purpose of the image enhancement is to highlight the effective information of the logo area, such as icons, texts, numbers Etc., image enhancement uses grayscale stretching from 0 to 255 grayscale, and can also be replaced by histogram transformation.
(2)对各视频帧图像进行边缘提取;(2) performing edge extraction on each video frame image;
可理解的是,边缘是图像灰度变化剧烈之处,边缘提取是台标识别的关键,边缘的完整程度直接影响台标识别结果,当然,边缘提取的方法有很多,如Canny、LOG、Sobel、拉普拉斯算子法等。综合考虑去噪、边缘完整性、边缘定位精度等要求,本实施方式中采用Canny边缘检测方法。Understandably, the edge is the sharp change of image gray scale. Edge extraction is the key to the identification of the logo. The integrity of the edge directly affects the logo recognition result. Of course, there are many methods for edge extraction, such as Canny, LOG, Sobel. , Laplacian operator, etc. Considering the requirements of denoising, edge integrity, edge positioning accuracy, etc., the Canny edge detection method is adopted in this embodiment.
在具体实现中,Canny边缘检测方法的参数设置为:弱边缘阈值50,强边缘阈值200,当然,也可根据需要进行适当浮动,例如,阈值在±10的范围内进行浮动。In a specific implementation, the parameters of the Canny edge detection method are set to: a weak edge threshold of 50, a strong edge threshold of 200, and of course, may be appropriately floated as needed, for example, the threshold is floated within a range of ±10.
(3)将各视频帧图像的边缘进行合成;(3) synthesizing the edges of each video frame image;
在具体实现中,可根据所述视频帧图像的数量来确定对应的预设图像阈值,再根据所述各边缘点在视频帧图像的数量是否低于所述预设图像阈值来判断是否保留该边缘点。In a specific implementation, the corresponding preset image threshold may be determined according to the number of the video frame images, and then according to whether the number of the video frame images is lower than the preset image threshold according to the edge points, whether to retain the Edge point.
也就是说,预先建立视频帧图像的数量与预设图像阈值之间的对应关系,根据所述视频帧图像的数量查找对应关系,以确定对应的预设图像阈值,在具有各边缘点在视频帧图像的数量低于所述预设图像阈值时,不保留该边缘点,在各边缘点在视频帧图像的数量高于等于所述预设图像阈值时,保留该边缘点。That is, the correspondence between the number of video frame images and the preset image threshold is established in advance, and the corresponding relationship is searched according to the number of the video frame images to determine a corresponding preset image threshold, and the video has each edge point. When the number of frame images is lower than the preset image threshold, the edge point is not retained, and the edge point is retained when the number of video frame images is higher than or equal to the preset image threshold at each edge point.
下面以一个具体的实施例来说明将各视频帧图像的边缘进行合成,但不限定本发明的保护范围:设N为视频帧图像的数量,X为预设图像阈值。In the following, a specific embodiment is used to describe the edges of each video frame image, but the scope of protection of the present invention is not limited: N is the number of video frame images, and X is a preset image threshold.
当N=6时,相应地,X=4,也就是说,只有边缘点在4个以上(包括4)的视频帧图像中存在时才保留,若边缘点在3个以下(包括3)的视频帧图像中存在时则舍弃;When N=6, correspondingly, X=4, that is, only the edge points are retained when there are more than 4 (including 4) video frame images, if the edge points are below 3 (including 3) Discard when there is a video frame image;
当6>N>3时,相应地,X=3,也就是说,只有边缘点在3个以上(包括3)的视频帧图像中存在时才保留,若边缘点在2个以下(包括2)的视频帧图像中存在时则舍弃;When 6>N>3, correspondingly, X=3, that is, only the edge points are retained when there are more than 3 (including 3) video frame images, if the edge points are below 2 (including 2 ) is discarded when there is a video frame image;
当N≤3时,相应地,X=N,也就是说,只有边缘点在所有视频帧图像中存在时才保留,其他情况均舍弃。When N ≤ 3, correspondingly, X = N, that is, only the edge points are retained in the presence of all video frame images, and other cases are discarded.
当然,所述对应关系中的参数可根据图像的分辨率进行调整,本实施方式对此不加以限制。 Of course, the parameters in the corresponding relationship may be adjusted according to the resolution of the image, which is not limited in this embodiment.
以CCTV5为例,图4展示了合成的整体效果。Taking CCTV5 as an example, Figure 4 shows the overall effect of the synthesis.
由于边缘噪声、黑边和非必要文字等均会对识别准确率造成影响,为进一步提高识别准确率,可合成的边缘进行优化处理,本实施方式中,所述优化处理包括:边缘噪声删除、黑边去除和非必要文字删除中的至少一种。The edge noise, the black border, and the non-essential characters all affect the recognition accuracy. To further improve the recognition accuracy, the synthesized edge is optimized. In this embodiment, the optimization process includes: edge noise deletion, At least one of black edge removal and unnecessary text deletion.
(4)获取合成的边缘的最小外接矩阵;(4) obtaining a minimum circumscribed matrix of the synthesized edges;
(5)按照所述最小外接矩阵分别对各视频帧图像进行分割,并将分割出的图像通过加权平均的方式进行合成,以获得包括待识别台标的台标区域。(5) segmenting each video frame image according to the minimum outer matrix, and synthesizing the segmented images by weighted averaging to obtain a logo area including the station identifier to be identified.
S102:在所述待识别台标为央视台标时,按照第一预设策略对所述待识别台标进行识别;S102: When the to-be-identified station is marked as a CCTV station logo, the to-be-identified station logo is identified according to a first preset policy;
在具体实现中,由于央视台标的差别仅在于数字和文字不同,故而,可针对央视台标的特征,按照第一预设策略对所述待识别台标进行识别。In the specific implementation, since the difference between the CCTV logos is only that the numbers and characters are different, the logos to be identified may be identified according to the first preset policy for the features of the CCTV logo.
S103:在所述待识别台标为非央视台标时,按照第二预设策略对所述待识别台标进行识别。S103: When the to-be-identified station is marked as a non-CCTV station logo, the to-be-identified station logo is identified according to a second preset policy.
可理解的是,可针对非央视台标的特征,按照第二预设策略对所述待识别台标进行识别。It can be understood that the to-be-identified station logo can be identified according to the second preset policy for the feature of the non-CCTV logo.
本实施方式不再对各台标均采用相同方式进行识别,而是先判断台标区域中的待识别台标是否为央视台标,再对央视台标和非央视台标采用不同策略进行识别,能够有效地针对央视台标和非央视台标的特征对待识别台标进行识别,提高了识别效率和识别准确率。In this embodiment, the identification of each station label is not performed in the same manner, but it is first determined whether the station identifier to be identified in the station label area is a CCTV station label, and then different strategies are used for identifying the CCTV station logo and the non-CCTV station logo. It can effectively identify the logos of CCTV logos and non-CCTV logos, which improves the recognition efficiency and recognition accuracy.
图5是本发明一种实施方式的台标识别方法的流程图;参照图2,所述方法包括:FIG. 5 is a flowchart of a method for identifying a logo of an embodiment of the present invention; referring to FIG. 2, the method includes:
S501:判断台标区域中的待识别台标是否为央视台标;S501: determining whether the to-be-identified station logo in the logo area is a CCTV station logo;
可理解的是,判断台标区域中的待识别台标是否为央视台标时,可采用多种方式,为保证判断的准确率,本实施方式中,根据台标区域的长宽比、灰度和颜色判断台标区域中的待识别台标是否为央视台标。It can be understood that, when judging whether the to-be-identified station logo in the station logo area is a CCTV station logo, a plurality of methods may be adopted, in order to ensure the accuracy of the judgment, in the present embodiment, according to the aspect ratio of the station logo area, gray The degree and color determine whether the station logo to be identified in the station logo area is a CCTV station logo.
研究发现,央视台标相对卫视台标、地方台标的特点和差异发现:(1)长宽比差异,央视台标的长宽比(长:竖直方向长度;宽:水平方向宽度)普遍明显小于其他台标;(2)央视台标的彩色图有广泛的白色像素特征, 尤其表现在左侧2/3处;(3)央视台标的灰度图分块后,子块之间满足灰度接近的相关约束,如均值、方差等。The study found that the characteristics and differences of CCTV logos relative to satellite logos and local logos were found: (1) the aspect ratio difference, the aspect ratio of the CCTV logo (length: vertical length; width: horizontal width) is generally significantly smaller Other logos; (2) The color map of the CCTV logo has a wide range of white pixel features. Especially in the left 2/3; (3) after the CC image of the CCTV is divided into blocks, the sub-blocks meet the relevant constraints of grayscale proximity, such as mean, variance and so on.
故而,可通过所述台标区域的长宽比、灰度和颜色的不同来对所述台标区域进行分类。Therefore, the logo area can be classified by the aspect ratio, the gradation, and the color of the logo area.
由于长宽比是台标最直接的特征之一。故而,可先对所述台标区域通过长宽比进行初步判断,也就是说,先计算各台标区域的长宽比,计算长宽比的方法为:计算台标区域的长度H和宽度W,长宽比ratio=W/H。Because the aspect ratio is one of the most direct features of the logo. Therefore, the aspect ratio of the logo area can be first determined by the aspect ratio, that is, the aspect ratio of each station area is calculated first, and the aspect ratio is calculated by calculating the length H and the width of the station area. W, aspect ratio ratio = W / H.
央视台标的ratio通常在0.3以下,故而,可构建的初步判断条件为:ratio<0.3。但是,内蒙古卫视、重庆卫视、北京卫视等卫视台(包括这些卫视的地方台)台标的ratio均在0.3以下,故而,可通过长宽比筛选后,再通过灰度和颜色进行再次筛选。The ratio of the CCTV logo is usually below 0.3. Therefore, the initial judgment condition that can be constructed is: ratio<0.3. However, the ratings of the satellites of Inner Mongolia Satellite TV, Chongqing Satellite TV, Beijing Satellite TV, etc. (including the local stations of these satellite TV stations) are all below 0.3, so they can be screened by the aspect ratio and then filtered again by grayscale and color.
通过灰度和颜色进行再次筛选时,可根据以下判断条件进行分类,也就是说,设置以下用于进一步判断的条件:When re-screening by gradation and color, it can be classified according to the following judgment conditions, that is, the following conditions for further judgment are set:
(1)所述台标区域左上角的第一预设范围内的红色分量和所述台标区域右下角的第二预设范围内的红色分量的均值小于预设红色分量。(1) The mean value of the red component in the first predetermined range in the upper left corner of the logo area and the red component in the second preset range in the lower right corner of the logo area is smaller than the preset red component.
也就是说,可将台标区域按行*列分为5*3子块(当然,也可通过6*3或4*3等方式进行分块),提取左上角的第一个子块area1(即第一预设范围)和右下角的第一个子块area2(即第二预设范围)。央视台标和重庆卫视台标、北京卫视台标等在这两个区域的红色分布截然不同。In other words, the station standard area can be divided into 5*3 sub-blocks by row* column (of course, it can also be divided by 6*3 or 4*3, etc.), and the first sub-block area1 in the upper left corner is extracted. (ie the first preset range) and the first sub-block area2 of the lower right corner (ie the second preset range). The red distribution of the CCTV logo and the Chongqing Satellite TV logo and the Beijing Satellite TV logo are quite different in these two regions.
综合干扰台标和央视台标的颜色特征,可构建条件1(即Condition1)——area1和area2的红色均值小于150。Combining the color characteristics of the interference station logo and the CCTV station logo, condition 1 (ie, Condition1) can be constructed—the red mean values of area1 and area2 are less than 150.
(2)所述台标区域左侧第三预设范围内的灰度均值小于预设灰度值。(2) The gray mean value in the third preset range on the left side of the logo area is smaller than the preset gray value.
考虑Condition1的容错性和央视台标半透明的特点,取子块area3。可以台标区域的宽度方向50个像素为标准,取台标区域最左8列像素(即宽的4/25)构成area3(即第三预设范围)。Considering the fault tolerance of the Condition1 and the semi-transparent characteristics of the CCTV logo, the sub-area area3 is taken. 50 pixels in the width direction of the logo area can be used as the standard, and the leftmost 8 columns of pixels (that is, 4/25 in width) of the logo area are formed into area3 (ie, the third preset range).
分析干扰台标和CCTV台标在area3的灰度差异,构建条件2(Condition2)——area3的灰度均值小于100。Analyze the difference between the gray level of the interference station and the CCTV station in area3, and build condition 2 (Condition2)—the average gray level of area3 is less than 100.
可理解的是,可通过三通道经典合成法Gray=0.33R+0.59G+0.11B来获得灰度图像,也可通过三通道最大值法、三通道平均值法等,本实施方式对此不加以限制。 It can be understood that the grayscale image can be obtained by the three-channel classical synthesis method Gray=0.33R+0.59G+0.11B, or the three-channel maximum value method, the three-channel average method, etc., and the present embodiment does not Limit it.
(3)将所述台标区域分割为至少4个部分,分割后的预设部分之间的像素均值的绝对差不超过预设绝对差;(3) dividing the logo area into at least four parts, and the absolute difference of the pixel mean values between the divided preset parts does not exceed a preset absolute difference;
由于Condition1和Condition2只包含了各子块的自身信息,需将约束扩展至子块间的关系。分析发现,台标区域中的文字、数字像素主要位于台标区域的后1/3列。为此,将台标区域按照行*列分为2*3子块(当然,也可通过3*3等方式进行分块),取前2*2子块(即分割后的预设部分)分别表示为area4、area5、area6和area7。Since Condition1 and Condition2 only contain the information of each sub-block, the constraint needs to be extended to the relationship between sub-blocks. The analysis found that the characters and digital pixels in the logo area are mainly located in the last 1/3 column of the logo area. To this end, the station standard area is divided into 2*3 sub-blocks according to the row* column (of course, it can also be divided by 3*3, etc.), and the first 2*2 sub-blocks (that is, the divided preset parts) are taken. Represented as area4, area5, area6, and area7, respectively.
多样本严格计算发现,CCTV台标在这4个子块处基本如下条件:The multi-sample rigorous calculation found that the CCTV station standard has the following conditions at these four sub-blocks:
条件3(Condition3)——area4、area5、area6和area7的均值之绝对差不超过100。Condition 3 (Condition3) - The absolute difference between the mean values of area4, area5, area6, and area7 does not exceed 100.
(4)分割后的预设部分之间的像素均值的方差不超过预设方差。(4) The variance of the pixel mean values between the divided preset portions does not exceed the preset variance.
也就是说,以上4个子块的像素均值的方差均有不同,故而,CCTV台标在这4个子块处满足如下条件:That is to say, the variances of the pixel mean values of the above four sub-blocks are different. Therefore, the CCTV station standard satisfies the following conditions at the four sub-blocks:
条件4(Condition4)——area4、area5、area6和area7的均值序列之方差不超过1600。Condition 4 (Condition4) - The variance of the mean sequence of area4, area5, area6, and area7 does not exceed 1600.
某台标样本同时满足以上Condition1~Condition4时,可将其判断为央视台标,否则判断为非央视台标。上述条件同时判断时,准确率最高,实验证明,缺少任意一项条件均会提高多样本分类的错误率。When a certain sample sample satisfies the above Condition1~Condition4, it can be judged as the CCTV logo, otherwise it is judged as the non-CCTV logo. When the above conditions are judged at the same time, the accuracy rate is the highest. Experiments prove that the lack of any one condition will increase the error rate of multi-sample classification.
S502:在所述待识别台标为央视台标时,提取所述台标区域中的数字区域,对提取的数字区域进行位数判别,根据位数判别结果识别所述数字区域中的数字,将央视台标的标志和识别的数字之间的组合作为所述待识别台标的识别结果;S502: When the to-be-identified station is marked as a CCTV logo, extract the digital area in the logo area, perform digit identification on the extracted digital area, and identify the number in the digital area according to the digit determination result. The combination between the CCTV logo and the identified number is used as the recognition result of the to-be-identified logo;
需要说明的是,所述央视台标包含标志(即CCTV)、文字和数字,常见央视台标的差异在于文字和数字。模式识别领域中,数字识别比文字识别容易、稳定、快速。同时,央视台标的数字可单独描述具体频道,从而本实施方式将去除台标区域中的文字(如“综合”、“财经”等),提取台标区域中的数字(即1、2等)。其中,文字处于台标下方并与标志有明显的像素间隔,可设置预设像素间隔,通过所述提取的边缘中与标志之间的像素间隔是否超过预设像素间隔,在超过预设像素间隔时,则确认为文字,对其进行删除,故而,下述台标区域即为仅包括数字和标志的区域。 It should be noted that the CCTV logo includes a logo (ie, CCTV), characters, and numbers. The difference between the common CCTV logos is text and numbers. In the field of pattern recognition, digital recognition is easier, more stable, and faster than text recognition. At the same time, the CCTV logo number can describe the specific channel separately, so the embodiment will remove the text in the logo area (such as "comprehensive", "finance", etc.), and extract the numbers in the logo area (ie 1, 2, etc.) . Wherein, the text is below the station logo and has a clear pixel interval from the mark, and a preset pixel interval may be set, and whether the pixel interval between the extracted edge and the mark exceeds a preset pixel interval exceeds a preset pixel interval. In this case, it is confirmed as a character and is deleted. Therefore, the following logo area is an area including only numbers and flags.
可理解的是,所述数字区域位于所述台标区域内,并且存在一定的位置关系,因此,可预先建立所述台标区域和数字区域之间的位置关系,再根据所述数字区域的位置信息对所述图像进行分割处理,以获得所述数字区域。It can be understood that the digital area is located in the logo area and has a certain positional relationship. Therefore, the positional relationship between the logo area and the digital area may be established in advance, and then according to the digital area. The position information divides the image to obtain the digital area.
对于台标区域而言,数字区域和标志区域(即CCTV所在区域)之间存在如下对应关系:For the logo area, the following relationship exists between the digital area and the logo area (ie, the area where the CCTV is located):
(1)数字区域位于标志区域右侧,所占宽度大约等于标志区域的1/4;(1) The digital area is located on the right side of the marked area, and the occupied width is approximately equal to 1/4 of the marked area;
(2)数字区域和标志区域中的字母等高,约占央视台标整体高度的0.8。(2) The letters in the digital area and the marked area are equal in height, which is about 0.8 of the overall height of the CCTV logo.
故而,可根据所述台标区域和数字区域之间的位置关系提取所述台标区域中的数字区域。Therefore, the digital area in the logo area can be extracted according to the positional relationship between the logo area and the digital area.
另外,为了便于对数字区域中的数字进行识别,可对所述数字区域中的数字部分和背景部分进行二值化处理,设数字部分设为白色,背景部分设为黑色。In addition, in order to facilitate the identification of the numbers in the digital area, the digital portion and the background portion in the digital area may be binarized, with the digital portion set to white and the background portion set to black.
由于所述数字区域四个角的位置容易产生白色像素块/点,并且所述数字区域中还存在噪声点,这些干扰信息会对数字识别造成影响,本实施方式中,可对二值化后的数字区域进行干扰信息删除。Since the positions of the four corners of the digital area are likely to generate white pixel blocks/points, and there are still noise points in the digital area, the interference information may affect the digital recognition. In this embodiment, the binarization may be performed. The digital area performs interference information deletion.
在具体实现中,可根据以下方式删除所述数字区域四个角的白色像素块/点:设所述数字区域的水平宽度为W(等于0.25WA,WA为所述台标区域的宽度)、竖直长度为H(等于HA,HA为所述台标区域的高度),各
Figure PCTCN2016088466-appb-000001
In a specific implementation, the white pixel block/point of the four corners of the digital area may be deleted according to the following manner: the horizontal width of the digital area is W (equal to 0.25 W A , and W A is the width of the logo area) ), the vertical length is H (equal to H A , H A is the height of the logo area), each
Figure PCTCN2016088466-appb-000001
对于所述数字区域中的噪声点,则可进行噪声滤波,进一步削弱和减少噪声点影响。For noise points in the digital region, noise filtering can be performed to further attenuate and reduce noise point effects.
为进一步提高数字识别的准确率,本实施方式中,先对提取的数字区域进行位数判别,再根据位数判别结果识别所述数字区域中的数字。In order to further improve the accuracy of digital recognition, in the present embodiment, the extracted digital region is first subjected to bit number discrimination, and the number in the digital region is identified based on the bit number discrimination result.
可理解的是,对提取的数字区域进行判别可通过多种方式,本实施方式中,按列对所述数字区域中各像素的灰度值进行投影,以构成长度为所述数字区域水平宽度的投影向量,在一列投影向量中具有超过预设数量的 属于数字部分的像素时,将该列投影向量进行标识,若存在相邻两个被标识的投影向量之间的最小距离大于预设距离,则将所述数字判别结果设为两位,否则将所述数字判别结果设为一位。It can be understood that the method for determining the extracted digital area can be performed in various manners. In this embodiment, the gray value of each pixel in the digital area is projected in columns to form a horizontal width of the digital area. Projection vector with more than a preset number in a column of projection vectors When the pixel belongs to the digital part, the column projection vector is identified. If the minimum distance between two adjacent identified projection vectors is greater than the preset distance, the digital discrimination result is set to two bits, otherwise The numerical discrimination result is set to one bit.
为进一步提高数字识别的效率,本实施方式中,可通过以下3个步骤根据位数判别结果识别所述数字区域中的数字:In order to further improve the efficiency of the digital recognition, in the present embodiment, the numbers in the digital area can be identified according to the number of digits determination result by the following three steps:
(1)获取数字区域中的白色像素区域A,并计算A的水平宽度w和竖直行宽h。如果h/w>2,则该台标为CCTV-1的台标。否则进入(2)。(1) Acquiring the white pixel area A in the digital area, and calculating the horizontal width w and the vertical line width h of A. If h/w>2, the station is marked as the station logo of CCTV-1. Otherwise enter (2).
(2)对所述数字区域进行边缘提取,如果按行*列分块为m*n个子块,具体如表1所示。构建边缘点概率空间分布直方图,与标准0、2~9的数字图像边缘点概率空间分布直方图计算匹配概率。(2) Perform edge extraction on the digital area, if the block is divided into m*n sub-blocks by row * column, as shown in Table 1. The histogram of the edge point probability spatial distribution is constructed, and the matching probability is calculated with the histogram of the edge probability of the digital image edge point of the standard 0, 2-9.
表1分块参数Table 1 block parameters
Figure PCTCN2016088466-appb-000002
Figure PCTCN2016088466-appb-000002
(3)根据数字判别结果识别所述数字区域中的数字。如果数字判别结果为一位,则台标为CCTV-1、CCTV-10~CCTV15的概率为0,如果数字判别结果为两位,则台标为CCTV1~CCTV9的概率均为0,从而完成所述数字区域中的数字识别。(3) Identifying the number in the digital area based on the result of the digital discrimination. If the digital discrimination result is one bit, the probability that the station logo is CCTV-1, CCTV-10~CCTV15 is 0. If the digital discrimination result is two digits, the probability that the station logo is CCTV1~CCTV9 is 0, thus completing the Digital identification in the digital area.
考虑识别是否准确的问题,本实施方式中,根据所述位数判别结果将所述数字区域与标准数字进行匹配,在最高匹配率和次高匹配率不相等时,按照所述最高匹配率对应的标准数字作为所述数字区域中的数字。Considering the problem of whether the identification is accurate, in the embodiment, the digital area is matched with the standard number according to the bit number discrimination result, and when the highest matching rate and the second highest matching rate are not equal, corresponding to the highest matching rate. The standard number is used as the number in the digital area.
当然,在最高匹配率和次高匹配率相等时,不予唯一识别。Of course, when the highest matching rate and the second highest matching rate are equal, no unique identification is made.
S503:在所述待识别台标为非央视台标时,计算所述台标区域的边缘与各标准台标的匹配率;S503: Calculate a matching ratio between an edge of the logo area and each standard station mark when the to-be-identified station is marked as a non-CCTV station logo;
可理解的是,由于非央视台标通常包括:卫视台标和地方台标,本实施方式中,考虑到卫视台标的特征和差异,可分为无文字台标(如东方卫视)、有可分离文字台标(如东南卫视,也就是上述的“非必要文字”)和有不可完全分离文字台标(如河北卫视,也就是上述的“必要文字”),故而,在标准库中将依据三种不同卫视台台标类型进行相应类型标准台标的 构建。Understandably, since the non-CCTV station logo usually includes: a TV station logo and a local station logo, in this embodiment, considering the characteristics and differences of the CCTV logo, it can be divided into no-text logo (such as Oriental TV), and Separate text logos (such as Southeast Satellite TV, which is the above-mentioned "non-essential text") and have text labels that cannot be completely separated (such as Hebei Satellite TV, which is the "necessary text" mentioned above). Therefore, it will be based on the standard library. Three different types of satellite station logos are used for corresponding types of standard logos. Construct.
为便于计算所述台标区域的边缘与各标准台标的匹配率,本实施方式中,对所述台标区域中的边缘通过空间分布直方图与标准台标进行匹配,以计算所述台标区域的边缘与各标准台标的匹配率。In order to facilitate the calculation of the matching ratio between the edge of the logo area and each standard station mark, in the embodiment, the edge in the station label area is matched with the standard station logo by the spatial distribution histogram to calculate the station label. The matching rate between the edge of the area and each standard station mark.
由于江苏卫视、东方卫视等台标只剩下圆形、椭圆形台标(该类台标定义为短台标),长宽比较小,内蒙古卫视、新疆卫视等电视台台标(这类台标定义为长台标)的长宽比明显大于这些卫视台标的长宽比。所以,可将长宽比作为分类条件。长台标的ratio在1.5以上,识别层次之长宽比比较条件即为:ratio≥1.5,在分类后,进行匹配时,则可直接按照相同长宽比类型的标准台标进行匹配,而无需和所有标准台标均进行匹配,进一步缩短了匹配时长。Because Jiangsu Satellite TV, Dongfang Satellite TV and other logos only have round and elliptical logos (this type of station logo is defined as short station logo), the length and width are relatively small, and the TV stations of Inner Mongolia Satellite TV and Xinjiang Satellite TV (such stations) The aspect ratio defined as long platform is significantly larger than the aspect ratio of these satellite stations. Therefore, the aspect ratio can be used as a classification condition. The ratio of the long station logo is above 1.5, and the aspect ratio comparison condition of the recognition level is: ratio ≥ 1.5. After the classification, when matching is performed, the matching can be directly performed according to the standard logo of the same aspect ratio type, without All standard logos are matched to further reduce the matching duration.
本实施方式中,将台标区域的边缘Edge划分为数量相同的子块A1、A2、A3…Am,其中m为子块的个数。对每个子块,统计其中的边缘点概率,即得到空间分布直方图。例如:短台标的Edge分为5*5个子块,即25个子块;长台标的Edge分为5*10(行*列)个子块,即50个子块。标准图库中的各标准台标也按照短、长进行空间分布直方图计算,所述台标区域的空间分布直方图与标准台标逐子块进行匹配。计匹配的子块数为n(初值为0),总子块数为N,如果某子块的概率十分接近(例如:概率差<0.05),则n=n+1;匹配结束,计算匹配率为p=n/N。由此得到遍历所有标准台标的匹配率数组P。In this embodiment, the edge Edge of the logo area is divided into the same number of sub-blocks A1, A2, A3, ... Am, where m is the number of sub-blocks. For each sub-block, the edge point probability is counted, that is, the spatial distribution histogram is obtained. For example, the Edge of the short station is divided into 5*5 sub-blocks, that is, 25 sub-blocks; the Edge of the long-station label is divided into 5*10 (rows*columns) sub-blocks, that is, 50 sub-blocks. Each standard station mark in the standard library is also calculated according to the short and long spatial distribution histogram, and the spatial distribution histogram of the station label area is matched with the standard station label sub-block. The number of sub-blocks to be matched is n (initial value is 0), and the total number of sub-blocks is N. If the probability of a sub-block is very close (for example, probability difference <0.05), then n=n+1; matching ends, calculation The matching ratio is p=n/N. This results in a matching rate array P that traverses all standard logos.
也就是说,参数△p用于判断是否需要进行像素信息匹配。△p定义为:△p=pmax-psecond,其中,pmax为匹配率数组P的最大值(即最高匹配率),psecond为匹配率数组P中除最大值之外的次大值(即次高匹配率)。That is to say, the parameter Δp is used to determine whether pixel information matching is required. Δp is defined as: Δp=p max -p second , where p max is the maximum value of the matching rate array P (ie, the highest matching rate), and p second is the next largest value other than the maximum value in the matching rate array P (ie the second highest match rate).
S504:在最高匹配率与次高匹配率之差大于第一预设差值时,将最高匹配率对应的标准台标作为所述待识别台标的识别结果;S504: When the difference between the highest matching rate and the second highest matching rate is greater than the first preset difference, the standard station corresponding to the highest matching rate is used as the recognition result of the to-be-identified station target;
考虑到匹配识别有一定的容错性,设定判断条件△p≥0.25(即第一预设差值,当然,还可为其他值),如果判断条件△p≥0.25成立,则将pmax对应的标准台标为作为所述待识别台标的识别结果。Taking into account the matching identify certain fault tolerance, △ p≥0.25 judgment condition setting (i.e., a first preset difference, of course, but also for other values), if the determination condition △ p≥0.25 established, then the corresponding p max The standard station logo is used as the identification result of the logo to be identified.
S505:在最高匹配率与次高匹配率之差小于等于第一预设差值时,判断所述最高匹配率或次高匹配率对应的标准台标是否属于地方台标; S505: determining, when the difference between the highest matching rate and the second highest matching rate is less than or equal to the first preset difference, determining whether the standard logo corresponding to the highest matching rate or the second highest matching rate belongs to a local station label;
在判断条件△p≥0.25不成立时,则需要确定所述待识别台标是否为地方台标,本实施方式中,需要判断所述最高匹配率或次高匹配率对应的标准台标是否属于地方台标。When the determination condition Δp ≥ 0.25 is not established, it is determined whether the to-be-identified station logo is a local station label. In this embodiment, it is required to determine whether the standard logo corresponding to the highest matching rate or the second highest matching rate belongs to the locality. Taiwan standard.
S506:若所述最高匹配率或次高匹配率对应的标准台标均不属于地方台标,则将所述最高匹配率和次高匹配率对应的标准台标作为待匹配标准台标,将所述台标区域的彩色信息与所述待匹配标准台标进行匹配,以更新所述待匹配标准台标的匹配率,将匹配率较高的待匹配标准作为所述待识别台标的识别结果。S506: If the standard station label corresponding to the highest matching rate or the second highest matching rate does not belong to the local station label, the standard station label corresponding to the highest matching rate and the second highest matching rate is used as a standard standard to be matched, and The color information of the logo area is matched with the standard station to be matched to update the matching rate of the standard station to be matched, and the to-be-matched standard with a high matching rate is used as the recognition result of the to-be-identified station logo.
由于所述最高匹配率或次高匹配率对应的标准台标均不属于地方台标,故而,所述待识别台标也不会属于地方台标,属于卫视台标的可能性非常高。Since the standard logo corresponding to the highest matching rate or the second highest matching rate does not belong to the local station logo, the to-be-identified station logo does not belong to the local station logo, and the possibility of belonging to the satellite station logo is very high.
本实施方式中,通过台标区域的彩色信息与所述待匹配标准台标进行匹配,以更新所述待匹配标准台标的匹配率,可通过以下步骤:In this embodiment, by matching the color information of the logo area with the standard station to be matched, to update the matching rate of the standard station to be matched, the following steps can be performed:
(1)提取台标区域RGB彩色三通道灰度矩阵MR、MG、MB(1) Extract the RGB color three-channel gray matrix M R , M G , M B of the logo area.
(2)提取所述台标区域中三通道灰度区域矩阵M0.5R、M0.5G、M0.5B,三者的宽、高分别为MR、MG、MB的一半,三者的像素原点坐标如式(1)所示。其中,gray分别代表R、G、B,x为行坐标,y为列坐标,H为矩
Figure PCTCN2016088466-appb-000003
以便于后续匹配。
(2) extracting the three-channel gray-scale region matrix M 0.5R , M 0.5G , M 0.5B in the logo area , the width and height of the three are half of M R , M G , M B , respectively The pixel origin coordinates are as shown in equation (1). Where gray represents R, G, B, x is the row coordinate, y is the column coordinate, and H is the moment.
Figure PCTCN2016088466-appb-000003
For subsequent matching.
(3)矩阵分块。将M0.5R、M0.5G、M0.5B分别进行分块。由于区域尺寸缩小,从而分块方案修正为:短台标的Edge分为3*3个子块,即9个区域;长台标的Edge分为3*6(行*列)个子块,即18个区域。(3) Matrix partitioning. M 0.5R , M 0.5G , and M 0.5B were separately divided. As the size of the area is reduced, the block scheme is modified as follows: the edge of the short station is divided into 3*3 sub-blocks, that is, 9 areas; the edge of the long station is divided into 3*6 (rows*columns) sub-blocks, that is, 18 areas. .
(4)RGB三通道空间分布直方图构建。按步骤(3)的分块方案,计算各子块区域的平均红色灰度值mRi、平均绿色灰度值mGi、平均蓝色灰度值mBi,标准库中的标准台标进行同理的分块均值计算,并进行三通道均值匹配。计匹配的区域数为nrgb,总区域数位Nrgb,如果三个通道的匹配差(同一子块区域中样本灰度均值和标准灰度均值的差值)都小于50,则nrgb=nrgb+1。匹配结束,计算匹配率为prgb=nrgb/Nrgb。由此得到遍历所有标准台标的RGB匹配率数组Prgb,其中相近台标的匹配率为prgb,非相近台 标的匹配率为0。(4) RGB three-channel spatial distribution histogram construction. According to the blocking scheme of step (3), the average red gray value m Ri , the average green gray value m Gi , and the average blue gray value m Bi of each sub-block region are calculated, and the standard logo in the standard library is the same. The block average is calculated and a three-channel mean match is performed. The number of matching regions is n rgb , and the total region number is N rgb . If the matching difference of the three channels (the difference between the mean gray level of the sample and the standard gray mean in the same sub-block region) is less than 50, then n rgb =n Rgb +1. At the end of the match, the matching ratio is calculated as p rgb =n rgb /N rgb . This results in an RGB matching rate array P rgb traversing all standard station marks, where the matching rate of similar stations is p rgb and the matching rate of non-similar stations is 0.
(5)匹配率更新。综合匹配率数组P和Prgb的结果,更新匹配率数组Pnew=P+Prgb(5) Matching rate update. The results of the matching ratio arrays P and P rgb are combined to update the matching rate array P new =P+P rgb .
S507:若所述最高匹配率或次高匹配率对应的标准台标属于地方台标,则根据所述台标区域的边缘与各标准台标的匹配率来确定所述待识别台标所属的地区,分割出所述台标区域中的文字区域,分离所述文字区域中的单字部分,对分离出的单字部分进行识别,将单字部分识别结果与所述地区对应的标准台标进行文字匹配,将匹配的标准台标作为所述待识别台标的识别结果。S507: If the standard station label corresponding to the highest matching rate or the second highest matching rate belongs to the local station label, determining the area to which the to-be-identified station label belongs according to the matching ratio of the edge of the station label area and each standard station label Separating the text area in the logo area, separating the single word part in the text area, identifying the separated single word part, and matching the single word part recognition result with the standard station logo corresponding to the area, The matching standard station logo is used as the recognition result of the to-be-identified station logo.
可理解的是,所述地区对应的标准台标可理解为所述地区的所有标准台标。It can be understood that the standard logo corresponding to the region can be understood as all standard logos of the region.
由于所述最高匹配率或次高匹配率对应的标准台标属于地方台标,此时,所述待识别台标属于地方台标的可能性非常高,通常会与多个相同地区的标准台标均具有较高的匹配率,在此情况下,可根据这些匹配率来确定所述待识别台标所属的地区。Since the standard station label corresponding to the highest matching rate or the second highest matching rate belongs to the local station label, at this time, the possibility that the to-be-identified station label belongs to the local station label is very high, and usually corresponds to a plurality of standard stations in the same area. Each has a high matching rate, and in this case, the area to which the to-be-identified station logo belongs can be determined according to the matching rates.
本实施方式中,在根据获取的标准台标中各地区的标准台标数量和各地区的标准台标总数确定所述待识别台标所属的地区,可通过以下标准:In the present embodiment, the area to which the to-be-identified station label belongs is determined according to the number of standard station labels in each area in the obtained standard station logo and the total number of standard station labels in each area, and the following criteria can be adopted:
获取匹配率与最高匹配率之差小于第一预设差值的标准台标(包括最高匹配率对应的标准台标),根据获取的各标准台标所属的地区确定最多标准台标所属的目标地区,判断获取的各标准台标中目标区域的标准台标数量和所述目标区域对应的标准台标的总数之间的比例是否超过预设比例,若是,则将所述目标区域作为所述待识别台标所属的地区。Obtaining the standard station label (the standard station label corresponding to the highest matching rate) whose difference between the matching rate and the highest matching rate is smaller than the first preset difference, and determining the target of the most standard station label according to the region to which each standard station label is obtained. a region, determining whether a ratio between a number of standard logos of the target area in each of the obtained standard logos and a total number of standard logos corresponding to the target area exceeds a preset ratio, and if yes, using the target area as the Identify the area to which the logo belongs.
例如:匹配率与最高匹配率之差小于第一预设差值的标准台标为10个,并且10个标准台标中8个属于北京电视台,1个属于四川电视台,1个属于新疆电视台,设北京电视台一共有12个,四川电视台一共有6个,新疆电视台一共有5个,预设比例为50%,可确定目标区域为北京,获取的各标准台标中北京的标准台标数量(即8个)和北京对应的标准台标的总数(即12个)之间的比例为72.7%,超过了预设比例(50%),此时,将北京作为所述待识别台标所属的地区。For example, the difference between the matching rate and the highest matching rate is less than the first preset difference of 10 standard stations, and 8 of the 10 standard stations belong to Beijing TV, 1 belongs to Sichuan TV, and 1 belongs to Xinjiang TV. There are 12 TV stations in Beijing, 6 TV stations in Sichuan, and 5 TV stations in Xinjiang. The preset ratio is 50%. The target area is Beijing, and the number of standard stations in Beijing is obtained. That is, 8) and the total number of standard logos corresponding to Beijing (ie 12) is 72.7%, which exceeds the preset ratio (50%). At this time, Beijing is used as the area to which the logo to be identified belongs. .
需要说明的是,提取所述台标区域中的文字区域具有多种方式,为提 高文字区域的提取效率和准确率,本实施方式,根据台标区域和文字区域之间的位置关系以及所述台标区域的位置信息确定所述文字区域的位置信息,并根据所述文字区域的位置信息从所述台标区域中提取文字区域,当然,还可通过其他方式,本实施方式对此不加以限制。It should be noted that there are multiple ways to extract the text area in the logo area. The extraction efficiency and accuracy of the high text area, in the embodiment, determining the position information of the text area according to the positional relationship between the logo area and the text area and the position information of the logo area, and according to the text area The location information is extracted from the logo area. Of course, the method may be otherwise limited.
以所述待识别台标为北京电视台台标为例,文字区域与北京电视台中的标志“BTV”等高,故而,可切割标志“BTV”右侧两倍于所述台标区域高度的区域作为文字区域,也就是说,文字区域的高度与标志“BTV”的高度相同,文字区域的长度为所述台标区域高度的两倍;Taking the logo of the station to be identified as the Beijing TV station logo as an example, the text area is the same as the logo “BTV” in the Beijing TV station. Therefore, the right side of the cutable sign “BTV” is twice the height of the station logo area. As the text area, that is, the height of the text area is the same as the height of the logo "BTV", and the length of the text area is twice the height of the logo area;
为进一步提高识别准确率,本实施方式中,对所述文字区域进行Ostu二值化处理,并采用连通域方式去除噪声干扰。In order to further improve the recognition accuracy, in the present embodiment, the text region is subjected to Ostu binarization processing, and the connected domain method is used to remove noise interference.
可理解的是,对于有些地方台标,例如北京电视台台标,其差异在于文字,如“北京卫视”、“体育”、“生活”等。该差异可由文字区域中第一个文字唯一描述,为了提高识别效率,本实施方式中,分离出的单字部分为所述文字区域中的第一个文字所在的单字部分,可通过以下流程实现分离:Understandably, for some local landmarks, such as the Beijing TV station logo, the difference lies in the text, such as "Beijing Satellite TV", "sports", "life" and so on. The difference can be uniquely described by the first character in the text area. In order to improve the recognition efficiency, in the embodiment, the separated single word part is the single word part where the first character in the text area is located, and can be separated by the following process. :
(1)去除文字区域中横向、纵向的干扰直线(段);(1) removing the horizontal and vertical interference lines (segments) in the text area;
(2)获取所述文字区域中单字部分之间的像素间隔;(2) acquiring a pixel interval between the word portions in the text area;
(3)根据所述像素间隔提取出完整的单字部分,如:文(文艺)、影(影视);(3) extracting a complete single word part according to the pixel interval, such as: text (literary art), shadow (video);
(4)对单字部分进行阈值分割,分割过程中取左上角5*5子域,计算黑色比例r,如果r小于0.5,交换单字部分的黑白像素。(4) Perform threshold segmentation on the single word part. In the process of segmentation, take the 5*5 subfield in the upper left corner and calculate the black ratio r. If r is less than 0.5, exchange the black and white pixels of the single word part.
当然,还可将全部单字部分均分离出来,再分别进行识别,本实施方式对此不加以限制。Of course, all the word parts can be separated and identified separately, which is not limited in this embodiment.
在现有技术中存在多种对分离出的单字部分进行识别的方法,但为了提高识别准确率和稳定性,参照图2,所述对分离出的单字部分进行识别包括:In the prior art, there are various methods for identifying the separated single-word portions, but in order to improve the recognition accuracy and stability, referring to FIG. 2, the identification of the separated single-word portions includes:
A1:对所述单字部分进行细化处理;A1: refining the word part;
本实施方式中,对所述单字部分进行细化处理,以获得所述单字部分的细化边缘图,细化处理的过程可参照由吕岳和施鹏飞编写的《一种实用并行细化算法及其实现》,本实施方式在此不再赘述。 In this embodiment, the single word portion is refined to obtain a refined edge map of the single word portion, and the process of the refinement process may refer to a practical parallel refinement algorithm written by Lu Yue and Shi Pengfei. The implementation thereof is not repeated here.
为了便于后续处理,还可对单字集中的标准单字也进行细化处理,以获得各标准单字的细化边缘图;In order to facilitate subsequent processing, the standard words in the single word set can also be refined to obtain a refined edge map of each standard word;
为使所述单字部分的细化边缘图及各标准单字的细化边缘图的大小保持一致,例如:可将所述单字部分的细化边缘图及各标准单字的细化边缘图变换为50*50像素大小。In order to keep the refinement edge map of the single-word portion and the refinement edge map of each standard word consistent, for example, the refined edge map of the single-word portion and the refined edge map of each standard word may be converted into 50 *50 pixel size.
A2:根据细化处理后的单字部分的细化边缘计算所述单字部分和标准单字之间的边缘匹配率;A2: calculating an edge matching ratio between the single word portion and the standard word according to the refined edge of the refined word portion;
细化文字的边缘特征包含了刻画了文字的笔画,可缩小匹配标准数据的范围,而区域特征是细化文字的内容,丰富了区域描述,可修正确定匹配文字,由于细化的数据量小于区域特征,本实施方式中,先通过边缘特征匹配,再通过区域特征修正,从而减少运算量。The edge features of the refined text include strokes depicting the text, which narrows the range of matching standard data, while the regional features are refined text content, enriching the area description, and correcting the matching text, because the refined data volume is smaller than In the present embodiment, the edge feature is matched first, and then the region feature is corrected, thereby reducing the amount of calculation.
本实施方式中,根据细化处理后的单字部分的细化边缘计算所述单字部分和标准单字之间的边缘匹配率时,可通过以下流程:In this embodiment, when calculating the edge matching ratio between the single word portion and the standard word according to the refined edge of the single word portion after the thinning process, the following process may be adopted:
(1)遍历标准单字的细化边缘图si,将si分成10*10的100个子图;(1) traversing the refined edge map si of the standard word, dividing the si into 100 sub-pictures of 10*10;
(2)计算各子图细化边缘比例pi_j,组成细化边缘比例矩阵Mi,同理计算得到单字部分的细化边缘比例矩阵M;(2) Calculate the edge ratio pi_j of each subgraph to form a refined edge scale matrix Mi, and calculate the refined edge scale matrix M of the single word part by the same reason;
(3)比较M和Mi中相同位置元素的近似度,进而计算边缘矩阵匹配度Pe_i;(3) comparing the approximation of the same position elements in M and Mi, and then calculating the edge matrix matching degree Pe_i;
(4)根据Ps_i+=Pe_i(Ps_i为Ps的第i个元素),组成新的概率数组Ps;(4) According to Ps_i+=Pe_i (Ps_i is the i-th element of Ps), form a new probability array Ps;
A3:判断最高边缘匹配率和次高边缘匹配率的匹配率差值是否高于第二预设差值;A3: determining whether the matching rate difference between the highest edge matching rate and the second highest edge matching rate is higher than the second preset difference;
也就是说,判断Ps的最高值Ps_m1与次高值Ps_m2满足条件:That is to say, it is judged that the highest value Ps_m1 and the second highest value Ps_m2 of Ps satisfy the condition:
Ps_m1-Ps_m2≧0.25(0.25可设为其它值),Ps_m1-Ps_m2≧0.25 (0.25 can be set to other values),
A4:在所述匹配率差值高于第二预设差值时,将所述最高边缘匹配率对应的标准单字作为所述单字区域的单字识别结果;A4: when the matching rate difference is higher than the second preset difference, the standard word corresponding to the highest edge matching rate is used as the word recognition result of the single word area;
也就是说,在满足Ps_m1-Ps_m2≧0.25时,识别为Ps_m1对应的文字并更新匹配率。That is, when Ps_m1 - Ps_m2 ≧ 0.25 is satisfied, the character corresponding to Ps_m1 is recognized and the matching ratio is updated.
A5:在所述匹配率差值不高于第二预设差值时,根据细化处理后的单字部分的区域特征和标准单字进行匹配,对所述边缘匹配率进行更新; A5: when the matching ratio difference is not higher than the second preset difference, the edge matching rate is updated according to the regional feature of the refined word part and the standard word matching;
也就是说,在不满足Ps_m1-Ps_m2≧0.25时,可根据区域特征通过以下流程进行修正:That is to say, when Ps_m1-Ps_m2≧0.25 is not satisfied, it can be corrected according to the regional characteristics by the following process:
将细化处理后的单字部分逐区域像素确定细化处理后的标准单字相同位置的像素是否为区域像素,计算整图匹配率pi_ds;Defining the pixel-by-region pixel of the refined word-by-region pixel to determine whether the pixel at the same position of the standard word after the thinning process is a region pixel, and calculating a whole map matching ratio pi_ds;
将细化处理后的标准单字逐区域像素确定细化处理后的单字部分相同位置的像素是否为区域像素,计算整图匹配率pi_sd;Defining the normalized single-word-by-region pixel to determine whether the pixel at the same position of the single-word portion after the thinning process is a region pixel, and calculating a whole map matching ratio pi_sd;
根据Ps_i+=(pi_ds+2*pi_sd)/3(Ps_i为Ps的第i个元素),组成新的概率数组Ps;According to Ps_i+=(pi_ds+2*pi_sd)/3 (Ps_i is the i-th element of Ps), a new probability array Ps is formed;
A6:将最高边缘匹配率对应的标准单字作为所述单字区域的单字识别结果。A6: The standard word corresponding to the highest edge matching rate is used as the word recognition result of the single word area.
也就是说,将Ps的最大值Ps_m对应的文字作为识别文字,并修正匹配率。In other words, the character corresponding to the maximum value Ps_m of Ps is used as the recognized character, and the matching ratio is corrected.
表1给出了采用本实施方式的方法对几个典型台标的平均识别时间。遍历识别时间较短且差异不大,均在2s左右。Table 1 shows the average recognition time for several typical stations using the method of the present embodiment. The traversal recognition time is short and the difference is not large, both are around 2s.
表1典型电视台台标的平均识别时间Table 1 Average recognition time of typical TV station logos
电视台Television station 台标识别时间(s)Station identification time (s)
CCTV1CCTV1 1.441.44
CCTV5CCTV5 1.291.29
凤凰卫视Phoenix Television 1.761.76
东南卫视Southeast Satellite TV 1.931.93
北京生活Beijing life 2.172.17
北京体育Beijing Sports 2.252.25
表2给出了采用本实施方式的方法对几个典型台标的平均匹配率及识别率。表2证明了本实施方式的方法的有效性、稳定性和可靠性。Table 2 shows the average matching ratio and recognition rate of several typical stations using the method of the present embodiment. Table 2 demonstrates the effectiveness, stability, and reliability of the method of the present embodiment.
表2典型电视台的平均匹配率及识别率Table 2 Average matching rate and recognition rate of typical TV stations
Figure PCTCN2016088466-appb-000004
Figure PCTCN2016088466-appb-000004
Figure PCTCN2016088466-appb-000005
Figure PCTCN2016088466-appb-000005
本实施方式的方法适合央视台、卫视台及地方台的匹配识别,实现95%以上的高识别率正确识别,并将遍历识别时间控制在3s以内。The method of the present embodiment is suitable for matching identification of CCTV stations, satellite stations and local stations, achieving high recognition rate recognition of more than 95%, and controlling the traverse recognition time within 3s.
对于方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明实施例并不受所描述的动作顺序的限制,因为依据本发明实施例,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作并不一定是本发明实施例所必须的。For the method embodiments, for the sake of simple description, they are all expressed as a series of action combinations, but those skilled in the art should understand that the embodiments of the present invention are not limited by the described action sequence, because the embodiment according to the present invention Some steps can be performed in other orders or at the same time. In the following, those skilled in the art should also understand that the embodiments described in the specification are all preferred embodiments, and the actions involved are not necessarily required by the embodiments of the present invention.
图6是本发明一种实施方式的台标识别装置的结构框图;参照图6,所述装置包括:6 is a structural block diagram of a station identification device according to an embodiment of the present invention; referring to FIG. 6, the device includes:
台标判断单元601,用于判断台标区域中的待识别台标是否为央视台标;The station label determining unit 601 is configured to determine whether the station label to be identified in the station label area is a CCTV station label;
第一识别单元602,用于在所述待识别台标为央视台标时,按照第一预设策略对所述待识别台标进行识别;The first identifying unit 602 is configured to identify the to-be-identified station logo according to the first preset policy when the to-be-identified station is marked as a CCTV station logo;
第二识别单元603,用于在所述待识别台标为非央视台标时,按照第二预设策略对所述待识别台标进行识别。The second identifying unit 603 is configured to identify the to-be-identified station logo according to the second preset policy when the to-be-identified station is marked as a non-CCTV station logo.
在本发明的一种可选实施例中,所述台标判断单元,进一步用于根据台标区域的长宽比、灰度和颜色判断台标区域中的待识别台标是否为央视台标。In an optional embodiment of the present invention, the station identifier determining unit is further configured to determine, according to the aspect ratio, the gray scale, and the color of the logo area, whether the station logo to be identified in the logo area is a CCTV logo .
在本发明的一种可选实施例中,所述第一识别单元,进一步用于提取 所述台标区域中的数字区域,对提取的数字区域进行位数判别,根据位数判别结果识别所述数字区域中的数字,将央视台标的标志和识别的数字之间的组合作为所述待识别台标的识别结果。In an optional embodiment of the present invention, the first identifying unit is further configured to extract a digital area in the logo area, performing bit number discrimination on the extracted digital area, identifying a number in the digital area according to the bit number discrimination result, and using a combination between the CCTV logo mark and the recognized number as the The identification result of the station logo to be identified.
在本发明的一种可选实施例中,所述第一识别单元,进一步用于根据所述台标区域和数字区域之间的位置关系提取所述台标区域中的数字区域,并对所述数字区域中的数字部分和背景部分进行二值化处理。In an optional embodiment of the present invention, the first identifying unit is further configured to extract a digital area in the logo area according to a positional relationship between the logo area and the digital area, and The digital portion and the background portion in the digital area are binarized.
在本发明的一种可选实施例中,所述第一识别单元,进一步用于按列对所述数字区域中各像素的灰度值进行投影,以构成长度为所述数字区域水平宽度的投影向量,在一列投影向量中具有超过预设数量的属于数字部分的像素时,将该列投影向量进行标识,若存在相邻两个被标识的投影向量之间的最小距离大于预设距离,则将所述数字判别结果设为两位,否则将所述数字判别结果设为一位。In an optional embodiment of the present invention, the first identifying unit is further configured to project a gray value of each pixel in the digital area by a column to form a horizontal width of the digital area. a projection vector, when there are more than a preset number of pixels belonging to the digital part in a column of projection vectors, the column projection vector is identified, if there is a minimum distance between two adjacent identified projection vectors greater than a preset distance, Then, the digital discrimination result is set to two bits, otherwise the digital discrimination result is set to one bit.
在本发明的一种可选实施例中,所述第二识别单元,进一步用于计算所述台标区域的边缘与各标准台标的匹配率;在最高匹配率与次高匹配率之差大于第一预设差值时,将最高匹配率对应的标准台标作为所述待识别台标的识别结果;在最高匹配率与次高匹配率之差小于等于第一预设差值时,判断所述最高匹配率或次高匹配率对应的标准台标是否属于地方台标;在所述最高匹配率或次高匹配率对应的标准台标均不属于地方台标时,将所述最高匹配率和次高匹配率对应的标准台标作为待匹配标准台标,将所述台标区域的彩色信息与所述待匹配标准台标进行匹配,以更新所述待匹配标准台标的匹配率,将匹配率较高的待匹配标准作为所述待识别台标的识别结果。In an optional embodiment of the present invention, the second identifying unit is further configured to calculate a matching ratio between an edge of the logo area and each standard station label; the difference between the highest matching rate and the second highest matching rate is greater than When the first preset difference is used, the standard station corresponding to the highest matching rate is used as the recognition result of the to-be-identified station logo; when the difference between the highest matching rate and the second highest matching rate is less than or equal to the first preset difference, the determining unit Whether the standard logo corresponding to the highest matching rate or the second highest matching rate belongs to the local station label; if the standard label corresponding to the highest matching rate or the second highest matching rate does not belong to the local station label, the highest matching rate is The standard station label corresponding to the second highest matching rate is used as a standard station logo to be matched, and the color information of the station label area is matched with the standard station label to be matched to update the matching rate of the standard station label to be matched. The criterion to be matched with a high matching rate is used as the recognition result of the to-be-identified station logo.
在本发明的一种可选实施例中,所述第二识别单元,进一步用于对所述台标区域中的边缘通过空间分布直方图与标准台标进行匹配,以计算所述台标区域的边缘与各标准台标的匹配率。In an optional embodiment of the present invention, the second identifying unit is further configured to match an edge in the logo area by a spatial distribution histogram with a standard logo to calculate the logo area. The matching rate between the edge of each standard and each standard.
在本发明的一种可选实施例中,所述第二识别单元,进一步用于在所述最高匹配率或次高匹配率对应的标准台标属于地方台标时,根据所述台标区域的边缘与各标准台标的匹配率来确定所述待识别台标所属的地区,分割出所述台标区域中的文字区域,分离所述文字区域中的单字部分,对分离出的单字部分进行识别,将单字部分识别结果与所述地区对应的标准 台标进行文字匹配,将匹配的标准台标作为所述待识别台标的识别结果。In an optional embodiment of the present invention, the second identifying unit is further configured to: when the standard label corresponding to the highest matching rate or the second highest matching rate belongs to a local station logo, according to the station label area And determining a region to which the to-be-identified logo belongs, separating a text region in the logo region, separating a single-word portion in the text region, and performing the separated single-word portion Identification, the standard corresponding to the word recognition result of the word part The logo is matched by the logo, and the matching standard logo is used as the recognition result of the logo to be identified.
在本发明的一种可选实施例中,所述第二识别单元,进一步用于对所述单字部分进行细化处理;根据细化处理后的单字部分的细化边缘计算所述单字部分和标准单字之间的边缘匹配率;判断最高边缘匹配率和次高边缘匹配率的匹配率差值是否高于第二预设差值;在所述匹配率差值高于第二预设差值时,将所述最高边缘匹配率对应的标准单字作为所述单字区域的单字识别结果;在所述匹配率差值不高于第二预设差值时,根据细化处理后的单字部分的区域特征和标准单字进行匹配,对所述边缘匹配率进行更新;将最高边缘匹配率对应的标准单字作为所述单字区域的单字识别结果。In an optional embodiment of the present invention, the second identifying unit is further configured to perform refinement processing on the single word portion; and calculate the single word portion according to the refined edge of the refined word portion The edge matching rate between the standard words; determining whether the matching rate difference between the highest edge matching rate and the second highest edge matching rate is higher than the second preset difference; wherein the matching rate difference is higher than the second preset difference And the standard single word corresponding to the highest edge matching rate is used as a single word recognition result of the single word area; when the matching rate difference is not higher than the second preset difference, according to the refined word part The area feature is matched with the standard word, and the edge matching rate is updated; the standard word corresponding to the highest edge matching rate is used as the word recognition result of the single word area.
在本发明的一种可选实施例中,所述装置还包括:In an optional embodiment of the present invention, the device further includes:
区域获取单元,用于从包括待识别台标的视频的预设区域内获取视频帧图像序列,对各视频帧图像进行边缘提取,将各视频帧图像的边缘进行合成,获取合成的边缘的最小外接矩阵,按照所述最小外接矩阵分别对各视频帧图像进行分割,并将分割出的图像通过加权平均的方式进行合成,以获得包括待识别台标的台标区域。The area obtaining unit is configured to obtain a video frame image sequence from a preset area of the video that includes the to-be-identified station logo, perform edge extraction on each video frame image, and combine edges of each video frame image to obtain a minimum external connection of the synthesized edge. a matrix, respectively dividing each video frame image according to the minimum outer matrix, and synthesizing the segmented images by weighted averaging to obtain a logo area including a station identifier to be identified.
图7是示出本申请另一实施例的服务器的结构框图。FIG. 7 is a block diagram showing the structure of a server according to another embodiment of the present application.
参照图7,所述服务器,包括:Referring to FIG. 7, the server includes:
处理器(processor)701、存储器(memory)702、通信接口(Communications Interface)703和总线704;其中,a processor 701, a memory 702, a communication interface 703, and a bus 704; wherein
所述处理器701、存储器702、通信接口703通过所述总线704完成相互间的通信;The processor 701, the memory 702, and the communication interface 703 complete communication with each other through the bus 704;
所述通信接口703用于该服务器与用户设备之间的信息传输;The communication interface 703 is used for information transmission between the server and the user equipment;
所述处理器701用于调用所述存储器702中的逻辑指令,以执行如下方法;The processor 701 is configured to invoke logic instructions in the memory 702 to perform the following methods;
判断台标区域中的待识别台标是否为央视台标;在所述待识别台标为央视台标时,按照第一预设策略对所述待识别台标进行识别;在所述待识别台标为非央视台标时,按照第二预设策略对所述待识别台标进行识别。Determining whether the to-be-identified station logo in the station logo area is a CCTV station logo; when the to-be-identified station logo is a CCTV station logo, identifying the to-be-identified station logo according to the first preset policy; When the station logo is a non-CCTV station logo, the to-be-identified station logo is identified according to a second preset policy.
参见图1,本发明另一实施例公开了一种计算机程序,包括程序代码,所述程序代码用于执行如下操作: Referring to FIG. 1, another embodiment of the present invention discloses a computer program, including program code, for performing the following operations:
判断台标区域中的待识别台标是否为央视台标;Determining whether the to-be-identified station logo in the station logo area is a CCTV station logo;
在所述待识别台标为央视台标时,按照第一预设策略对所述待识别台标进行识别;When the to-be-identified station is marked as a CCTV station logo, the to-be-identified station logo is identified according to a first preset policy;
在所述待识别台标为非央视台标时,按照第二预设策略对所述待识别台标进行识别。When the to-be-identified station is marked as a non-CCTV station logo, the to-be-identified station logo is identified according to a second preset policy.
本发明另一实施例公开一种存储介质,用于存储如前述实施例所述的计算机程序。Another embodiment of the present invention discloses a storage medium for storing a computer program as described in the foregoing embodiments.
本领域普通技术人员可以理解:实现上述方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成,前述的程序可以存储于一计算机可读取存储介质中,该程序在执行时,执行包括上述方法实施例的步骤;而前述的存储介质包括:ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。A person skilled in the art can understand that all or part of the steps of implementing the above method embodiments may be completed by using hardware related to the program instructions. The foregoing program may be stored in a computer readable storage medium, and the program is executed when executed. The foregoing steps include the steps of the foregoing method embodiments; and the foregoing storage medium includes: a medium that can store program codes, such as a ROM, a RAM, a magnetic disk, or an optical disk.
最后应说明的是:以上各实施例仅用以说明本发明实施例的技术方案,而非对其限制;尽管参照前述各实施例对本发明实施例进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。 It should be noted that the above embodiments are only used to explain the technical solutions of the embodiments of the present invention, and are not limited thereto; although the embodiments of the present invention are described in detail with reference to the foregoing embodiments, those skilled in the art It should be understood that the technical solutions described in the foregoing embodiments may be modified, or some or all of the technical features may be equivalently replaced; and the modifications or substitutions do not deviate from the embodiments of the present invention. The scope of the technical solution.

Claims (23)

  1. 一种台标识别方法,其特征在于,所述方法包括:A station identification method, characterized in that the method comprises:
    判断台标区域中的待识别台标是否为央视台标;Determining whether the to-be-identified station logo in the station logo area is a CCTV station logo;
    在所述待识别台标为央视台标时,按照第一预设策略对所述待识别台标进行识别;When the to-be-identified station is marked as a CCTV station logo, the to-be-identified station logo is identified according to a first preset policy;
    在所述待识别台标为非央视台标时,按照第二预设策略对所述待识别台标进行识别。When the to-be-identified station is marked as a non-CCTV station logo, the to-be-identified station logo is identified according to a second preset policy.
  2. 根据权利要求1所述的方法,其特征在于,所述判断台标区域中的待识别台标是否为央视台标,进一步包括:The method according to claim 1, wherein the determining whether the to-be-identified station logo in the logo area is a CCTV logo further comprises:
    根据台标区域的长宽比、灰度和颜色判断台标区域中的待识别台标是否为央视台标。It is judged whether the to-be-identified station logo in the station logo area is a CCTV station logo according to the aspect ratio, the gray scale and the color of the station logo area.
  3. 根据权利要求1所述的方法,其特征在于,所述按照第一预设策略对所述待识别台标进行识别,进一步包括:The method according to claim 1, wherein the identifying the to-be-identified station logo according to the first preset policy further comprises:
    提取所述台标区域中的数字区域,对提取的数字区域进行位数判别,根据位数判别结果识别所述数字区域中的数字,将央视台标的标志和识别的数字之间的组合作为所述待识别台标的识别结果。Extracting a digital area in the logo area, performing bit number discrimination on the extracted digital area, identifying a number in the digital area according to the bit number discrimination result, and using a combination between the CCTV logo mark and the recognized number as a The recognition result of the identification logo is mentioned.
  4. 根据权利要求3所述的方法,其特征在于,所述提取所述台标区域中的数字区域,进一步包括:The method according to claim 3, wherein the extracting the digital area in the logo area further comprises:
    根据所述台标区域和数字区域之间的位置关系提取所述台标区域中的数字区域,并对所述数字区域中的数字部分和背景部分进行二值化处理。And extracting a digital area in the logo area according to a positional relationship between the logo area and the digital area, and performing binarization processing on the digital part and the background part in the digital area.
  5. 根据权利要求4所述的方法,其特征在于,所述对提取的数字区域进行位数判别,进一步包括:The method according to claim 4, wherein said determining the number of digits of the extracted digital region further comprises:
    按列对所述数字区域中各像素的灰度值进行投影,以构成长度为所述数字区域水平宽度的投影向量,在一列投影向量中具有超过预设数量的属于数字部分的像素时,将该列投影向量进行标识,若存在相邻两个被标识的投影向量之间的最小距离大于预设距离,则将所述数字判别结果设为两位,否则将所述数字判别结果设为一位。Projecting a gray value of each pixel in the digital area in a column to form a projection vector having a horizontal width of the digital area, and having more than a preset number of pixels belonging to the digital part in a column of projection vectors The column projection vector is identified. If there is a minimum distance between two adjacent identified projection vectors that is greater than a preset distance, the digital discrimination result is set to two bits, otherwise the digital discrimination result is set to one. Bit.
  6. 根据权利要求1所述的方法,其特征在于,所述按照第二预设策略对所述待识别台标进行识别,进一步包括: The method according to claim 1, wherein the identifying the to-be-identified station logo according to the second preset policy further comprises:
    计算所述台标区域的边缘与各标准台标的匹配率;Calculating a matching ratio between an edge of the logo area and each standard station mark;
    在最高匹配率与次高匹配率之差大于第一预设差值时,将最高匹配率对应的标准台标作为所述待识别台标的识别结果;When the difference between the highest matching rate and the second highest matching rate is greater than the first preset difference, the standard station corresponding to the highest matching rate is used as the recognition result of the to-be-identified station logo;
    在最高匹配率与次高匹配率之差小于等于第一预设差值时,判断所述最高匹配率或次高匹配率对应的标准台标是否属于地方台标;When the difference between the highest matching rate and the second highest matching rate is less than or equal to the first preset difference, it is determined whether the standard logo corresponding to the highest matching rate or the second highest matching rate belongs to the local station label;
    若所述最高匹配率或次高匹配率对应的标准台标均不属于地方台标,则将所述最高匹配率和次高匹配率对应的标准台标作为待匹配标准台标,将所述台标区域的彩色信息与所述待匹配标准台标进行匹配,以更新所述待匹配标准台标的匹配率,将匹配率较高的待匹配标准作为所述待识别台标的识别结果。If the standard station label corresponding to the highest matching rate or the second highest matching rate does not belong to the local station label, the standard station label corresponding to the highest matching rate and the second highest matching rate is used as the standard station logo to be matched, and the The color information of the logo area is matched with the standard standard to be matched to update the matching rate of the standard standard to be matched, and the to-be-matched standard with a high matching rate is used as the recognition result of the to-be-identified station logo.
  7. 根据权利要求6所述的方法,其特征在于,所述计算所述台标区域的边缘与各标准台标的匹配率,进一步包括:The method according to claim 6, wherein the calculating the matching ratio of the edge of the logo area to each standard station mark further comprises:
    对所述台标区域中的边缘通过空间分布直方图与标准台标进行匹配,以计算所述台标区域的边缘与各标准台标的匹配率。The edge in the logo area is matched with the standard logo by the spatial distribution histogram to calculate the matching ratio between the edge of the logo area and each standard station mark.
  8. 根据权利要求6所述的方法,其特征在于,所述判断所述最高匹配率或次高匹配率对应的标准台标是否属于地方台标之后,所述方法还包括:The method according to claim 6, wherein the method further comprises: after determining whether the standard logo corresponding to the highest matching rate or the second highest matching rate belongs to a local station logo, the method further comprises:
    若所述最高匹配率或次高匹配率对应的标准台标属于地方台标,则根据所述台标区域的边缘与各标准台标的匹配率来确定所述待识别台标所属的地区,分割出所述台标区域中的文字区域,分离所述文字区域中的单字部分,对分离出的单字部分进行识别,将单字部分识别结果与所述地区对应的标准台标进行文字匹配,将匹配的标准台标作为所述待识别台标的识别结果。If the standard station label corresponding to the highest matching rate or the second highest matching rate belongs to the local station logo, determining the area to which the to-be-identified station label belongs according to the matching ratio of the edge of the station label area and each standard station label, and dividing Extracting a text area in the logo area, separating a single word part in the text area, identifying the separated single word part, and matching the single word part recognition result with the standard station logo corresponding to the area, and matching The standard logo is used as the recognition result of the logo to be identified.
  9. 根据权利要求8所述的方法,其特征在于,所述对分离出的单字部分进行识别,进一步包括:The method of claim 8 wherein said identifying said separated word portion further comprises:
    对所述单字部分进行细化处理;Refine the single word portion;
    根据细化处理后的单字部分的细化边缘计算所述单字部分和标准单字之间的边缘匹配率;Calculating an edge matching ratio between the single word portion and the standard word according to the refined edge of the refined word portion;
    判断最高边缘匹配率和次高边缘匹配率的边缘匹配率差值是否高于第二预设差值; Determining whether the edge matching rate difference between the highest edge matching rate and the second highest edge matching rate is higher than the second preset difference;
    在所述边缘匹配率差值高于第二预设差值时,将所述最高边缘匹配率对应的标准单字作为所述单字区域的单字识别结果;When the edge matching rate difference is higher than the second preset difference, the standard word corresponding to the highest edge matching rate is used as the single word recognition result of the single word area;
    在所述边缘匹配率差值不高于第二预设差值时,根据细化处理后的单字部分的区域特征和标准单字进行匹配,对所述边缘匹配率进行更新;When the edge matching rate difference is not higher than the second preset difference, the edge matching rate is updated according to the regional feature of the refined word portion and the standard word matching;
    将最高边缘匹配率对应的标准单字作为所述单字区域的单字识别结果。The standard word corresponding to the highest edge matching rate is used as the word recognition result of the single word area.
  10. 根据权利要求1~9中任一项所述的方法,其特征在于,所述判断台标区域中的待识别台标是否为央视台标之前,所述方法还包括:The method according to any one of claims 1 to 9, wherein the method further comprises: before determining whether the to-be-identified station logo in the logo area is a CCTV logo, the method further comprises:
    从包括待识别台标的视频的预设区域内获取视频帧图像序列,对各视频帧图像进行边缘提取,将各视频帧图像的边缘进行合成,获取合成的边缘的最小外接矩阵,按照所述最小外接矩阵分别对各视频帧图像进行分割,并将分割出的图像通过加权平均的方式进行合成,以获得包括待识别台标的台标区域。Obtaining a video frame image sequence from a preset area of the video including the to-be-identified station logo, performing edge extraction on each video frame image, synthesizing the edges of each video frame image, and obtaining a minimum outer matrix of the synthesized edge, according to the minimum The external matrix separately segments each video frame image, and combines the segmented images by weighted averaging to obtain a logo area including the station identifier to be identified.
  11. 一种台标识别装置,其特征在于,所述装置包括:A station identification device, characterized in that the device comprises:
    台标判断单元,用于判断台标区域中的待识别台标是否为央视台标;The station label determining unit is configured to determine whether the station identifier to be identified in the station label area is a CCTV station label;
    第一识别单元,用于在所述待识别台标为央视台标时,按照第一预设策略对所述待识别台标进行识别;a first identifying unit, configured to: when the to-be-identified station is marked as a CCTV logo, identify the to-be-identified station logo according to a first preset policy;
    第二识别单元,用于在所述待识别台标为非央视台标时,按照第二预设策略对所述待识别台标进行识别。The second identifying unit is configured to identify the to-be-identified station logo according to the second preset policy when the to-be-identified station is marked as a non-CCTV station logo.
  12. 根据权利要求11所述的装置,其特征在于,所述台标判断单元,进一步用于根据台标区域的长宽比、灰度和颜色判断台标区域中的待识别台标是否为央视台标。The device according to claim 11, wherein the station marking determining unit is further configured to determine, according to the aspect ratio, the gradation and the color of the logo area, whether the station logo to be identified in the logo area is a CCTV station Standard.
  13. 根据权利要求11所述的装置,其特征在于,所述第一识别单元,进一步用于提取所述台标区域中的数字区域,对提取的数字区域进行位数判别,根据位数判别结果识别所述数字区域中的数字,将央视台标的标志和识别的数字之间的组合作为所述待识别台标的识别结果。The apparatus according to claim 11, wherein the first identifying unit is further configured to extract a digital area in the logo area, perform bit number discrimination on the extracted digital area, and identify according to the number of digits The number in the digital area is a combination of the CCTV logo and the identified number as the recognition result of the to-be-identified station logo.
  14. 根据权利要求13所述的装置,其特征在于,所述第一识别单元,进一步用于根据所述台标区域和数字区域之间的位置关系提取所述台标区域中的数字区域,并对所述数字区域中的数字部分和背景部分进行二值化处理。 The apparatus according to claim 13, wherein the first identifying unit is further configured to extract a digital area in the logo area according to a positional relationship between the logo area and the digital area, and The digital portion and the background portion in the digital area are binarized.
  15. 根据权利要求14所述的装置,其特征在于,所述第一识别单元,进一步用于按列对所述数字区域中各像素的灰度值进行投影,以构成长度为所述数字区域水平宽度的投影向量,在一列投影向量中具有超过预设数量的属于数字部分的像素时,将该列投影向量进行标识,若存在相邻两个被标识的投影向量之间的最小距离大于预设距离,则将所述数字判别结果设为两位,否则将所述数字判别结果设为一位。The apparatus according to claim 14, wherein the first identifying unit is further configured to project a gray value of each pixel in the digital area by a column to form a horizontal width of the digital area. The projection vector, when there are more than a preset number of pixels belonging to the digital part in a column of projection vectors, the column projection vector is identified, if there is a minimum distance between adjacent two identified projection vectors is greater than a preset distance Then, the digital discrimination result is set to two bits, otherwise the digital discrimination result is set to one bit.
  16. 根据权利要求11所述的装置,其特征在于,所述第二识别单元,进一步用于计算所述台标区域的边缘与各标准台标的匹配率;在最高匹配率与次高匹配率之差大于第一预设差值时,将最高匹配率对应的标准台标作为所述待识别台标的识别结果;在最高匹配率与次高匹配率之差小于等于第一预设差值时,判断所述最高匹配率或次高匹配率对应的标准台标是否属于地方台标;在所述最高匹配率或次高匹配率对应的标准台标均不属于地方台标时,将所述最高匹配率和次高匹配率对应的标准台标作为待匹配标准台标,将所述台标区域的彩色信息与所述待匹配标准台标进行匹配,以更新所述待匹配标准台标的匹配率,将匹配率较高的待匹配标准作为所述待识别台标的识别结果。The apparatus according to claim 11, wherein the second identifying unit is further configured to calculate a matching ratio between an edge of the logo area and each standard station mark; a difference between a highest matching rate and a second highest matching rate When the difference is greater than the first preset difference, the standard station corresponding to the highest matching rate is used as the recognition result of the to-be-identified station logo; when the difference between the highest matching rate and the second highest matching rate is less than or equal to the first preset difference, the judgment is performed. Whether the standard logo corresponding to the highest matching rate or the second highest matching rate belongs to the local station label; if the standard label corresponding to the highest matching rate or the second highest matching rate does not belong to the local station label, the highest matching is performed. The standard station label corresponding to the rate and the second highest matching rate is used as a standard station logo to be matched, and the color information of the station label area is matched with the standard station label to be matched to update the matching rate of the standard station label to be matched. The criterion to be matched with a high matching rate is used as the recognition result of the to-be-identified station logo.
  17. 根据权利要求16所述的装置,其特征在于,所述第二识别单元,进一步用于对所述台标区域中的边缘通过空间分布直方图与标准台标进行匹配,以计算所述台标区域的边缘与各标准台标的匹配率。The apparatus according to claim 16, wherein the second identifying unit is further configured to match an edge in the logo area with a standard station logo by a spatial distribution histogram to calculate the station label The matching rate between the edge of the area and each standard station mark.
  18. 根据权利要求16所述的装置,其特征在于,所述第二识别单元,进一步用于在所述最高匹配率或次高匹配率对应的标准台标属于地方台标时,根据所述台标区域的边缘与各标准台标的匹配率来确定所述待识别台标所属的地区,分割出所述台标区域中的文字区域,分离所述文字区域中的单字部分,对分离出的单字部分进行识别,将单字部分识别结果与所述地区对应的标准台标进行文字匹配,将匹配的标准台标作为所述待识别台标的识别结果。The device according to claim 16, wherein the second identifying unit is further configured to: when the standard station label corresponding to the highest matching rate or the second highest matching rate belongs to a local station logo, according to the station label Determining the area to which the to-be-identified station logo belongs by dividing the edge of the area with the matching rate of each standard station mark, separating the text area in the station label area, separating the single-word part in the text area, and separating the single-word part The identification is performed, and the single-part partial recognition result is matched with the standard standard logo corresponding to the region, and the matched standard logo is used as the recognition result of the to-be-identified logo.
  19. 根据权利要求18所述的装置,其特征在于,所述第二识别单元,进一步用于对所述单字部分进行细化处理;根据细化处理后的单字部分的细化边缘计算所述单字部分和标准单字之间的边缘匹配率;判断最高边缘匹配率和次高边缘匹配率的匹配率差值是否高于第二预设差值;在所述匹 配率差值高于第二预设差值时,将所述最高边缘匹配率对应的标准单字作为所述单字区域的单字识别结果;在所述匹配率差值不高于第二预设差值时,根据细化处理后的单字部分的区域特征和标准单字进行匹配,对所述边缘匹配率进行更新;将最高边缘匹配率对应的标准单字作为所述单字区域的单字识别结果。The apparatus according to claim 18, wherein the second identifying unit is further configured to perform refinement processing on the single word portion; and calculate the single word portion according to the refined edge of the refined word portion An edge matching ratio between the standard word and the standard single word; determining whether the matching rate difference between the highest edge matching rate and the second highest edge matching rate is higher than the second preset difference; When the ratio difference is higher than the second preset difference, the standard word corresponding to the highest edge matching rate is used as the word recognition result of the single word area; and the matching rate difference is not higher than the second preset difference When the value is matched, the edge matching rate is updated according to the regional feature of the single word portion after the refinement processing, and the standard matching word corresponding to the highest edge matching rate is used as the single word recognition result of the single word region.
  20. 根据权利要求11~19中任一项所述的装置,其特征在于,所述装置还包括:The device according to any one of claims 11 to 19, wherein the device further comprises:
    区域获取单元,用于从包括待识别台标的视频的预设区域内获取视频帧图像序列,对各视频帧图像进行边缘提取,将各视频帧图像的边缘进行合成,获取合成的边缘的最小外接矩阵,按照所述最小外接矩阵分别对各视频帧图像进行分割,并将分割出的图像通过加权平均的方式进行合成,以获得包括待识别台标的台标区域。The area obtaining unit is configured to obtain a video frame image sequence from a preset area of the video that includes the to-be-identified station logo, perform edge extraction on each video frame image, and combine edges of each video frame image to obtain a minimum external connection of the synthesized edge. a matrix, respectively dividing each video frame image according to the minimum outer matrix, and synthesizing the segmented images by weighted averaging to obtain a logo area including a station identifier to be identified.
  21. 一种服务器,其特征在于,包括:A server, comprising:
    处理器、存储器、通信接口和总线;其中,a processor, a memory, a communication interface, and a bus; wherein
    所述通信接口用于用户设备与服务器之间的信息传输;The communication interface is used for information transmission between a user equipment and a server;
    所述处理器用于调用所述存储器中的逻辑指令,以执行如下方法;The processor is configured to invoke logic instructions in the memory to perform the following method;
    判断台标区域中的待识别台标是否为央视台标;在所述待识别台标为央视台标时,按照第一预设策略对所述待识别台标进行识别;在所述待识别台标为非央视台标时,按照第二预设策略对所述待识别台标进行识别。Determining whether the to-be-identified station logo in the station logo area is a CCTV station logo; when the to-be-identified station logo is a CCTV station logo, identifying the to-be-identified station logo according to the first preset policy; When the station logo is a non-CCTV station logo, the to-be-identified station logo is identified according to a second preset policy.
  22. 一种计算机程序,其特征在于,包括程序代码,所述程序代码用于执行如下操作:A computer program comprising program code for performing the following operations:
    判断台标区域中的待识别台标是否为央视台标;Determining whether the to-be-identified station logo in the station logo area is a CCTV station logo;
    在所述待识别台标为央视台标时,按照第一预设策略对所述待识别台标进行识别;When the to-be-identified station is marked as a CCTV station logo, the to-be-identified station logo is identified according to a first preset policy;
    在所述待识别台标为非央视台标时,按照第二预设策略对所述待识别台标进行识别。When the to-be-identified station is marked as a non-CCTV station logo, the to-be-identified station logo is identified according to a second preset policy.
  23. 一种存储介质,其特征在于,用于存储如权利要求22所述的计算机程序。 A storage medium characterized by storing the computer program of claim 22.
PCT/CN2016/088466 2015-11-24 2016-07-04 Method of identifying digital on-screen graphic and device WO2017088479A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510824167.4 2015-11-24
CN201510824167.4A CN105868683A (en) 2015-11-24 2015-11-24 Channel logo identification method and apparatus

Publications (1)

Publication Number Publication Date
WO2017088479A1 true WO2017088479A1 (en) 2017-06-01

Family

ID=56623826

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/088466 WO2017088479A1 (en) 2015-11-24 2016-07-04 Method of identifying digital on-screen graphic and device

Country Status (2)

Country Link
CN (1) CN105868683A (en)
WO (1) WO2017088479A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111444915A (en) * 2020-03-26 2020-07-24 山东云缦智能科技有限公司 Television station logo detection method based on edge detection
CN111652925A (en) * 2020-06-29 2020-09-11 中国科学院合肥物质科学研究院 Method for extracting target global feature Hu invariant moment by using single-pixel imaging

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107516097B (en) * 2017-08-10 2020-03-24 青岛海信电器股份有限公司 Station caption identification method and device
US10650072B2 (en) * 2017-10-30 2020-05-12 Facebook, Inc. System and method for determination of a digital destination based on a multi-part identifier
CN108764154B (en) * 2018-05-30 2020-09-08 重庆邮电大学 Water surface garbage identification method based on multi-feature machine learning
CN109117768A (en) * 2018-07-30 2019-01-01 上海科江电子信息技术有限公司 A kind of TV station symbol recognition method based on deep learning
CN112257768B (en) * 2020-10-19 2023-01-31 广州金融科技股份有限公司 Method and device for identifying illegal financial pictures and computer storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040255249A1 (en) * 2001-12-06 2004-12-16 Shih-Fu Chang System and method for extracting text captions from video and generating video summaries
CN102436575A (en) * 2011-09-22 2012-05-02 Tcl集团股份有限公司 Method for automatically detecting and classifying station captions
CN102446272A (en) * 2011-09-05 2012-05-09 Tcl集团股份有限公司 Method and device for segmenting and recognizing station caption as well as television comprising device
CN103020650A (en) * 2012-11-23 2013-04-03 Tcl集团股份有限公司 Station caption identifying method and device
CN103077384A (en) * 2013-01-10 2013-05-01 北京万集科技股份有限公司 Method and system for positioning and recognizing vehicle logo
CN103336954A (en) * 2013-07-08 2013-10-02 北京捷成世纪科技股份有限公司 Identification method and device of station caption in video
CN104023249A (en) * 2014-06-12 2014-09-03 腾讯科技(深圳)有限公司 Method and device of identifying television channel
CN104200238A (en) * 2014-09-22 2014-12-10 北京酷云互动科技有限公司 Station caption recognition method and station caption recognition device

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101546424B (en) * 2008-03-24 2012-07-25 富士通株式会社 Method and device for processing image and watermark detection system
TWI493382B (en) * 2013-01-31 2015-07-21 Pixart Imaging Inc Hand posture detection device for detecting hovering and click
CN103955713B (en) * 2014-04-24 2017-08-18 海信集团有限公司 A kind of icon-based programming method and apparatus

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040255249A1 (en) * 2001-12-06 2004-12-16 Shih-Fu Chang System and method for extracting text captions from video and generating video summaries
CN102446272A (en) * 2011-09-05 2012-05-09 Tcl集团股份有限公司 Method and device for segmenting and recognizing station caption as well as television comprising device
CN102436575A (en) * 2011-09-22 2012-05-02 Tcl集团股份有限公司 Method for automatically detecting and classifying station captions
CN103020650A (en) * 2012-11-23 2013-04-03 Tcl集团股份有限公司 Station caption identifying method and device
CN103077384A (en) * 2013-01-10 2013-05-01 北京万集科技股份有限公司 Method and system for positioning and recognizing vehicle logo
CN103336954A (en) * 2013-07-08 2013-10-02 北京捷成世纪科技股份有限公司 Identification method and device of station caption in video
CN104023249A (en) * 2014-06-12 2014-09-03 腾讯科技(深圳)有限公司 Method and device of identifying television channel
CN104200238A (en) * 2014-09-22 2014-12-10 北京酷云互动科技有限公司 Station caption recognition method and station caption recognition device

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111444915A (en) * 2020-03-26 2020-07-24 山东云缦智能科技有限公司 Television station logo detection method based on edge detection
CN111652925A (en) * 2020-06-29 2020-09-11 中国科学院合肥物质科学研究院 Method for extracting target global feature Hu invariant moment by using single-pixel imaging
CN111652925B (en) * 2020-06-29 2023-04-07 合肥中科迪宏自动化有限公司 Method for extracting target global feature Hu invariant moment by using single-pixel imaging

Also Published As

Publication number Publication date
CN105868683A (en) 2016-08-17

Similar Documents

Publication Publication Date Title
WO2017088479A1 (en) Method of identifying digital on-screen graphic and device
US10896349B2 (en) Text detection method and apparatus, and storage medium
CN106254933B (en) Subtitle extraction method and device
CN104751142B (en) A kind of natural scene Method for text detection based on stroke feature
CN109389121B (en) Nameplate identification method and system based on deep learning
US8965127B2 (en) Method for segmenting text words in document images
Shivakumara et al. A new gradient based character segmentation method for video text recognition
WO2019085971A1 (en) Method and apparatus for positioning text over image, electronic device, and storage medium
US9275030B1 (en) Horizontal and vertical line detection and removal for document images
US8300937B2 (en) Image segmentation method and system based on region features of pixels
Wei et al. A robust video text detection approach using SVM
AU2018207032B2 (en) Logo detection video analytics
CN111626145B (en) Simple and effective incomplete form identification and page-crossing splicing method
CN113436222A (en) Image processing method, image processing apparatus, electronic device, and storage medium
CN113435438B (en) Image and subtitle fused video screen plate extraction and video segmentation method
Zhang et al. A novel approach for binarization of overlay text
CN114241490A (en) Method for improving handwriting recognition model performance based on stroke disturbance and post-processing
JP5347793B2 (en) Character recognition device, character recognition program, and character recognition method
CN116030472A (en) Text coordinate determining method and device
WO2017088478A1 (en) Number separating method and device
Rajendran et al. A new Fourier-moments based video word and character extraction method for recognition
Jiang et al. The fast recognition of vehicle license plate based on the improved template matching
CN108304825B (en) Text detection method and device
CN114648751A (en) Method, device, terminal and storage medium for processing video subtitles
CN109409370B (en) Remote desktop character recognition method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16867706

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16867706

Country of ref document: EP

Kind code of ref document: A1