CN117010325A

CN117010325A - Media preview method, device, computer equipment and storage medium

Info

Publication number: CN117010325A
Application number: CN202211018035.9A
Authority: CN
Inventors: 余自强
Original assignee: Tencent Technology Shenzhen Co Ltd
Current assignee: Tencent Technology Shenzhen Co Ltd
Priority date: 2022-08-24
Filing date: 2022-08-24
Publication date: 2023-11-07

Abstract

The present application relates to a media preview method, apparatus, computer device, storage medium and computer program product. The method comprises the following steps: in response to a preview trigger event for the media library, displaying a media preview area of the media library; when the media library comprises a text type image, displaying an image thumbnail pointing to the text type image in a media preview area; the image thumbnail has a preset size, and the text display area ratio in the text type image reaches a text theme ratio threshold; displaying a scaled image obtained by scaling a partial region taken from the text type image to a preset size in an image thumbnail pointing to the text type image; the resolution of the text in the scaled image is not less than the visual resolution threshold of the text and meets the text recognition condition; in the text type image, the degree of importance of the text in the partial region is higher than that of the text outside the partial region. The method can increase the information quantity presented during media preview.

Description

Media preview method, device, computer equipment and storage medium

Technical Field

The present application relates to the field of computer technology, and in particular, to a media preview method, apparatus, computer device, storage medium, and computer program product.

Background

Currently, aiming at a media library comprising media resources such as videos, pictures and the like, a preview function is provided for a user to preview so as to select media to be viewed. When the media is previewed, the user previews the thumbnail through the corresponding compressed thumbnail of the media, and the user can preview the whole content of the media through the thumbnail. However, for pictures containing a large amount of characters, such as bulletins, character screenshots and the like, the characters in the compressed thumbnail are often difficult to recognize, so that the user is difficult to distinguish the pictures during previewing, the pictures can be accurately distinguished only after clicking to view the original pictures, and the information presented during media previewing is limited.

Disclosure of Invention

In view of the foregoing, it is desirable to provide a media preview method, apparatus, computer device, computer readable storage medium, and computer program product that can increase the amount of information presented during media preview.

In a first aspect, the present application provides a media preview method. The method comprises the following steps:

in response to a preview trigger event for the media library, displaying a media preview area of the media library;

when the media library comprises a text type image, displaying an image thumbnail pointing to the text type image in a media preview area; the image thumbnail has a preset size, and the text display area ratio in the text type image reaches a text theme ratio threshold;

Displaying a scaled image obtained by scaling a partial region taken from the text type image to a preset size in an image thumbnail pointing to the text type image; the resolution of the text in the scaled image is not less than the text visual resolution threshold; in the text type image, the degree of importance of the text in the partial region is higher than that of the text outside the partial region.

In a second aspect, the application further provides a media preview device. The device comprises:

the preview area display module is used for responding to a preview trigger event aiming at the media library and displaying a media preview area of the media library;

the thumbnail display module is used for displaying an image thumbnail pointing to the text type image in the media preview area when the media library comprises the text type image; the image thumbnail has a preset size, and the text display area ratio in the text type image reaches a text theme ratio threshold;

the text region display module is used for displaying a zoomed image which zooms a partial region intercepted from the text type image to a preset size in the image thumbnail pointing to the text type image; the resolution of the text in the scaled image is not less than the text visual resolution threshold; in the text type image, the degree of importance of the text in the partial region is higher than that of the text outside the partial region.

In a third aspect, the present application also provides a computer device. The computer device comprises a memory storing a computer program and a processor which when executing the computer program performs the steps of:

In a fourth aspect, the present application also provides a computer-readable storage medium. The computer readable storage medium having stored thereon a computer program which when executed by a processor performs the steps of:

In a fifth aspect, the present application also provides a computer program product. The computer program product comprises a computer program which, when executed by a processor, implements the steps of:

According to the media preview method, the device, the computer equipment, the storage medium and the computer program product, for the text type image with the text display area proportion reaching the text theme proportion threshold, the zoom image with the text cut from the partial area of the text type image scaled to the preset size is displayed in the previewed image thumbnail, the resolution of the text in the zoom image is not smaller than the text visual resolution threshold, and the text importance degree in the partial area is higher than the text importance degree outside the partial area, so that the text content with the high text importance degree can be directly displayed through the previewed zoom image, the information quantity displayed during media preview is increased, and the user can select the media resource to be checked according to the previewed thumbnail.

Drawings

FIG. 1 is an application environment diagram of a media preview method in one embodiment;

FIG. 2 is a flow diagram of a media preview method in one embodiment;

FIG. 3 is an interface diagram of a media preview interface displaying an image thumbnail in one embodiment;

FIG. 4 is a schematic diagram of an interface for displaying keywords in an image thumbnail, in one embodiment;

FIG. 5 is a flow diagram of text type image determination in one embodiment;

FIG. 6 is a schematic diagram of an interface for browsing albums in one embodiment;

FIG. 7 is a schematic diagram of an interface for browsing media in an application in one embodiment;

FIG. 8 is a schematic diagram of a text type image in one embodiment;

FIG. 9 is a thumbnail image corresponding to the text type image in the embodiment of FIG. 8;

FIG. 10 is a schematic diagram of a chapter screenshot in one embodiment;

FIG. 11 is a thumbnail image corresponding to a chapter shot in the embodiment of FIG. 10;

FIG. 12 is a diagram of an interface displaying text keywords associated with text type images in one embodiment;

FIG. 13 is a flowchart of a media preview method according to another embodiment;

FIG. 14 is a schematic diagram of a chapter sectional view of the embodiment of FIG. 10 that identifies text areas;

FIG. 15 is a schematic illustration of centering cuts with high priority in one embodiment;

FIG. 16 is a schematic illustration of a width-first centered cut in one embodiment;

FIG. 17 is a schematic diagram of a width-first centered emphasis cut in one embodiment;

FIG. 18 is a block diagram of a media preview device in one embodiment;

fig. 19 is an internal structural view of the computer device in one embodiment.

Detailed Description

The present application will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present application more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the application.

The media preview method provided by the embodiment of the application can be applied to an application environment shown in fig. 1. Wherein the terminal 102 communicates with the server 104 via a network. The data storage system may store data that the server 104 needs to process. The data storage system may be integrated on the server 104 or may be located on the cloud or other servers. The media library may be a media resource library local to the terminal 102, or may be a media resource library of the server 104 accessed by the terminal 102 through a network; the media library may include media resources obtained locally by the terminal 102, such as pictures or videos obtained by local shooting, and may also include media resources obtained by the terminal 102 from the server 104 through a network. The user may trigger a preview trigger event for the media library at the terminal 102, the terminal 102 displays a media preview area of the media library, for a text type image conforming to a text theme condition, that is, for a text type image whose text display area duty ratio reaches a text theme duty ratio threshold, the terminal 102 displays a scaled image in which a partial area cut from the text type image is scaled to a preset size in a previewed image thumbnail, the resolution of text in the scaled image is not less than a text visual resolution threshold, and the importance of text in the partial area is higher than that of text outside the partial area.

The terminal 102 may be, but not limited to, various desktop computers, notebook computers, smart phones, tablet computers, internet of things devices, and portable wearable devices, where the internet of things devices may be smart speakers, smart televisions, smart air conditioners, smart vehicle devices, and the like. The portable wearable device may be a smart watch, smart bracelet, headset, or the like. The server 104 may be implemented as a stand-alone server or as a server cluster of multiple servers.

In one embodiment, as shown in fig. 2, a media preview method is provided, where the method is executed by a computer device, specifically, may be executed by a computer device such as a terminal or a server, or may be executed by the terminal and the server together, and in an embodiment of the present application, the method is applied to the terminal in fig. 1, and is described by taking the example as an example, including the following steps:

step 202, in response to a preview trigger event for a media library, displaying a media preview area of the media library.

The media can comprise various resources such as images, videos and the like, the media library is a resource database of the media, and various media which can be accessed and checked are included in the media library. Previewing refers to an interactive manner in which media is viewed in advance without directly accessing the media. The preview trigger event refers to an event that triggers previewing of media in the media library. The preview trigger event may be generated by an operation trigger by the user, such as the user may trigger a preview operation for the media library, thereby generating the preview trigger event. The preview trigger event may also be automatically generated when a preset trigger condition is satisfied, for example, the preview trigger event may be automatically generated when a preset time is reached, a preset place is reached, a preset number of media is reached, etc., so as to preview the media library. The media preview area is an area for previewing each media in the media library, and thumbnail images of each media in the media library can be displayed in the media preview area, so that a user can preview each media in the media library by browsing each thumbnail image.

Specifically, the user may trigger the terminal or the media library of the server to generate a preview trigger event, which indicates that previewing is required for each media in the media library, and the terminal displays a media preview area of the media library in response to the preview trigger event, so that each media in the media library is previewed in the media preview area by the user.

Step 204, when the media library includes text type images, displaying image thumbnails pointing to the text type images in the media preview area; the image thumbnail has a preset size, and the text display area ratio in the text type image reaches the text theme ratio threshold.

The text display area ratio refers to the ratio of text in the text type image in the image, and can be specifically calculated according to the area sizes of the text display area and the text type image area. The text topic duty cycle threshold is used to determine whether the image is topic by text content. The text topic duty ratio threshold can be set according to actual needs, for example, different text topic duty ratio thresholds can be correspondingly set for images of different sources. Further, the text display area ratio in the text type image reaches a text theme ratio threshold, namely the relation of the text in the text type image relative to the text type image accords with the text theme condition, the text type image takes the text content as the theme, and the theme of the text type image is the text content included in the text type image. For example, for a screenshot of an article in the internet, the media form of the screenshot is an image, but the text in the article intercepted in the image is the subject of the image, and the relation of the text in the screenshot relative to the screenshot meets the text subject condition. The media form of the text type image is an image, but the subject content of the text type image is the text in the text type image, namely the text in the text type image is the core information carried by the image. In particular applications, images in a media library may be analyzed to determine whether they belong to text-type images, such as whether the relationship of text in the images to the images meets text subject matter conditions. In a specific implementation, based on the area ratio of the text display area of the text in the image to the area ratio in the image, the area ratio is compared with a preset text theme ratio threshold value to determine whether the text theme condition is met, that is, whether the image takes the included text as a theme, that is, whether the image belongs to the text type image is determined.

The image thumbnail has a preset size, and the preset size can be flexibly set according to actual needs. For example, the length and width may be M pixels. The image thumbnails point to media in the media library, and each image thumbnail may point to one of the media in the media library to reveal preview information of the corresponding media via the image thumbnail. Compared with the original size of the media, the image thumbnail is obtained by scaling the media, so that the corresponding media is indicated by the image thumbnail suitable for preview display, and a user can trigger operations on the image thumbnail, such as clicking the image thumbnail, to trigger access to the corresponding media.

Specifically, in the media preview area of the media library, the terminal displays image thumbnails directed to each media, the image thumbnails having a preset size, the preset size being set according to actual needs. The user can also perform self-defined adjustment on the preset size of the image thumbnail so as to meet the corresponding preview requirement. For a text type image included in the media library, the terminal displays an image thumbnail pointing to the text type image in the media preview area. The relation of the text in the text type image relative to the text type image meets the text theme condition, namely the text in the text type image is taken as the theme content of the text type image.

Step 206, displaying a zoom image obtained by zooming the partial region intercepted from the text type image to a preset size in the image thumbnail pointing to the text type image; the resolution of the text in the scaled image is not less than the text visual resolution threshold; in the text type image, the degree of importance of the text in the partial region is higher than that of the text outside the partial region.

Wherein the text is included in the partial region, the text importance level in the partial region is higher than the text importance level outside the partial region, and the text importance level is used for representing the importance of the text in the text type image. In a text type image, the higher the importance of text is in the region of higher importance in the text type image. If the importance degree of the text in the partial region is higher than that of the text outside the partial region, the partial region may be a region in which the text with the highest importance degree of the text is distributed in the text type image. The resolution of the text in the scaled image is not smaller than the visual resolution threshold of the text, and meets the text recognition condition, namely, when the scaled image is displayed in the image thumbnail, the text in the scaled image can support user recognition, namely, the user can still accurately recognize the content of the text in the scaled image. The text recognition condition may be set according to actual needs, for example, may be set such that the font size satisfies the font size threshold. The visual resolution threshold may be a minimum resolution of text capable of accurately identifying text content, and the resolution of text may be determined according to a font size of text, e.g., a height of a font of text may be directly used as the resolution of text. The font height of the text in the scaled image is greater than or equal to the visual resolution threshold, which may ensure that the text in the scaled image is accurately recognizable by the user. The size of the zoom image is the same as that of the image thumbnail, the zoom image is obtained by performing zoom processing on a partial area intercepted by the text type image according to the preset size of the image thumbnail, and the zoom image is used for representing the corresponding text type image.

Specifically, in the image thumbnail, the terminal displays a scaled image, and the resolution of the text in the scaled image is not less than the text visual resolution threshold, i.e., the text in the scaled image supports accurate recognition by the user. The scaled image is obtained by scaling a partial region of the text type image to a preset size, wherein the partial region is a region with highest text importance degree in the text type image, namely, in the text type image, the text importance degree in the partial region is higher than the text importance degree outside the partial region. The text topic content of the text type image can be accurately characterized through the text in the partial region.

For example, for a text type image, which is specifically a screenshot of a complete article, the partial area intercepted from the text type image may be an area including a title in the text type image, after the area including the title is scaled to a preset size, the obtained scaled image with resolution not less than the threshold of visual resolution of the text is displayed in an image thumbnail, and the user can directly identify the title of the article in the text type image according to the image thumbnail during previewing, thereby obtaining the subject content of the article in the text type image, being beneficial to the user to select the media resource to be checked according to the previewed thumbnail, avoiding the user from selecting after viewing the original image of the text type image, simplifying the operation of media selection, and improving the user experience.

In one particular application, as shown in FIG. 3, an image thumbnail is displayed in a media preview interface of a media library that points to media in the media library, either to images in the media library or to video in the media library. Wherein, the image 1, the image 2 and the image 4 belong to a text type image, and the relation of the included text relative to the text type image accords with the text theme condition, namely, the image 1, the image 2 and the image 4 take the text content as the theme; image 3 is a photograph of a person. Each image thumbnail displayed in the media preview interface points to a corresponding media, respectively. For image 1, image 2, and image 4, which belong to the text type image, a scaled image is displayed in which a partial area truncated from the respective images is scaled to a preset size. The resolution of the text in the scaled image is not less than the text visual resolution threshold, and the text importance level in the partial region is higher than the text importance level outside the partial region. In the image thumbnail 1, titles of articles in the screenshot are displayed, and a user can directly and rapidly acquire key information in the image 1 according to the titles of the articles, namely topic contents related to daily life health maintenance. In the image thumbnail 2 and the image thumbnail 4, the contents of the group advertisement are respectively displayed, and the user can directly obtain key information in the image 2 and the image 4, namely the contents related to the group advertisement, on the media preview interface.

In the media preview method, for the text type image with the text display area proportion reaching the text theme proportion threshold, the zoom image obtained by zooming the partial area of the text type image to the preset size is displayed in the previewed image thumbnail, the resolution of the text in the zoom image is not smaller than the text visual resolution threshold, and the text importance degree in the partial area is higher than the text importance degree outside the partial area, so that the text content with high text importance degree can be directly displayed through the previewed zoom image, the information quantity displayed during media preview is increased, and the user can select the media resource to be watched according to the previewed thumbnail.

In one embodiment, the media preview method further comprises: displaying at least one text keyword associated with the text type image in the image thumbnail; at least one text keyword for describing a content subject of text in the text-type image.

The text keywords are associated with the text type images, and can be specifically obtained based on text recognition in the text type images and used for describing the content subjects of the text in the text type images. The text keywords can be directly extracted from the text type images, and can also be obtained by carrying out semantic recognition on the texts in the text type images. The text keywords comprise at least one text keyword, and the number of the text keywords can be set according to actual needs, and particularly can be set to be a fixed number, such as 3 text keywords; a number of ranges, for example, 3 to 5, may be set, and 3 to 5 may be displayed correspondingly according to text keywords recognized from texts in the text type image.

Specifically, the terminal displays at least one text keyword associated with the text type image in the image thumbnail, and the text keyword may be displayed overlaid on the scaled image. The resolution of the text keywords is not smaller than the text visual resolution threshold, namely at least one text keyword displayed in the image thumbnail, so that the user can be supported to accurately identify when previewing, and the text keywords of each text type image can be accurately distinguished according to the text keywords of each text type image. The text keywords describe the content subjects of the texts in the corresponding text type images, when the texts in the text type images are different, the text keywords associated with the text type images are different, and the key information of the text type images can be displayed through the text keywords, so that a user can preview and select each thumbnail based on the key information of the text type images reflected by the text keywords.

In one specific application, as shown in fig. 4, in the media preview interface of the media library, image thumbnails of 4 images are displayed from left to right, respectively, and for the first image thumbnail and the second image thumbnail, text keywords of each of the first image and the second image are displayed in the image thumbnails. Wherein the text keywords of the first image comprise "life, health, sports, diet", and the text keywords of the second image comprise "group bulletins". The user can quickly learn important content information of the image pointed by the image thumbnail by looking at the text keywords which are displayed in the image thumbnail and are associated with the corresponding image. And for the third image thumbnail, the third image thumbnail is a person photo, does not belong to the text type image, and the zoomed person image is directly displayed in the media preview interface.

In this embodiment, the terminal displays, in the image thumbnail, at least one text keyword describing a content theme of the text in the text type image, so as to support the user to accurately identify based on the text keyword in the preview process, further increase the information content presented in the media preview process through the text keyword, and facilitate the user to select a media resource to be viewed according to the preview thumbnail.

In one embodiment, displaying at least one text keyword associated with a text type image in an image thumbnail includes: in response to a keyword presentation triggering operation in the media preview area, at least one text keyword associated with the text type image from which the scaled image originated is displayed in the scaled image.

The keyword display triggering operation is an operation of displaying text keywords associated with the text type images in a triggering mode. The keyword presentation triggering may be triggered by a user, e.g., the user may trigger an operation on a keyword presentation control of the media library to implement the keyword presentation triggering operation. Text keywords associated with the text type image are displayed in the scaled image, and may specifically be displayed overlaid over the scaled image, for example, may be displayed in a floating manner over the scaled image.

Specifically, the user may trigger a keyword presentation triggering operation in the media preview area, such as the user clicking a keyword presentation control in the media preview area, and the terminal displays at least one text keyword in the scaled image in response to the keyword presentation triggering operation of the user, where the displayed text keyword is associated with the text type image from which the scaled image originated, and may specifically be obtained by text recognition from the text type image from which the scaled image originated.

In this embodiment, the terminal responds to the keyword display triggering operation of the user in the media preview area, and displays at least one text keyword associated with the text type image in the zoom image, so that the text type image pointed by the image thumbnail is described by using the displayed text keyword, and the information content presented during media preview can be increased according to the user's needs through the text keyword, which is beneficial to the user to select the media resource to be viewed according to the preview thumbnail.

In one embodiment, the media preview method further comprises: displaying a keyword display activation entry identifying a state to be activated in a media preview area;

the keyword display activation entry is an operation entry for triggering and activating the display text keywords by a user, and can also identify the activation state of the display text keywords, specifically, the activation state of the display text keywords can be identified through different display modes, such as different colors, marks and the like. When the keyword display activation entry identifies a state to be activated, the keyword display activation entry indicates that the text keyword is not activated currently, i.e. the text keyword associated with the text type image is not displayed in the image thumbnail. When the keyword display activation entry identifies an activation state, it indicates that a text keyword has been currently activated, i.e., a text keyword associated with a text type image has been displayed in the image thumbnail. The activation entry is displayed through the keywords, so that the user is supported to switch and control whether the text keywords associated with the text type images are displayed or not, and the state of the activation entry identification can be displayed through the keywords, so that the user is prompted whether the text keywords associated with the text type images are activated or not currently.

Specifically, the terminal may display a keyword display activation entry in the media preview area, where a specific form of the keyword display activation entry may be set according to actual needs, for example, may be a switch control, a slider control, or the like. The keyword presentation activation portal may identify a status of a current keyword presentation, specifically identify a status to be activated, to indicate that text keywords associated with presentation text type images have not been currently activated.

Further, in response to a keyword presentation triggering operation in the media preview area, displaying in the scaled image at least one text keyword associated with the text type image from which the scaled image originated, comprising: in response to a triggering operation of the keyword presentation activation portal, the keyword presentation activation portal is switched to be displayed in an identified activation state, and at least one text keyword associated with the text-type image from which the scaled image originated is displayed in the scaled image.

The triggering operation is an operation triggered by the user aiming at the keyword display activation portal, for example, the triggering operation can be a clicking operation of the user aiming at the keyword display activation portal. The identification activation state is used to identify text keywords associated with the currently activated presentation text type image.

Specifically, the user can trigger operation for the keyword display activation portal, for example, the user can click on the keyword display activation portal, and the terminal responds to the trigger operation of the user on the keyword display activation portal to switch and display the keyword display activation portal to be in an identification activation state so as to identify text keywords associated with the currently activated display text type image. The terminal displays at least one text keyword associated with the text type image in the scaled image. The text type image is an image from which a partial region corresponding to the scaled image is derived.

Further, the media preview method further comprises: and in response to a triggering operation of the keyword presentation activation portal identifying the activation state, displaying the keyword presentation activation portal in a switching manner to identify the state to be activated, and hiding at least one text keyword in the scaled image.

The keyword display activation entry is displayed to identify a state to be activated, indicating that the cancellation of the text keyword associated with the display text type image has been triggered. Specifically, the user may trigger the keyword presentation activation entry again, indicating that the user needs to cancel presentation of the text keywords associated with the text-type image. And the terminal responds to the triggering operation of the user on the keyword display activation entry for identifying the activation state, hides at least one text keyword in the zoom image, and switches and displays the keyword display activation entry as the identification state to be activated. The terminal conceals the displayed text keywords, so that only the zoom image is displayed in the media preview area according to the requirements of the user.

In one particular application, as shown in fig. 4, a keyword presentation activation entry, in particular a switch control, is displayed in a media preview area of a media library, for which a user may trigger an operation to toggle activation or deactivation of a displayed keyword. The switch control may identify the status of an activated display keyword or an inactivated display keyword by different display statuses. When the switch control is in the identification to-be-activated state, a user can trigger the switch control, such as clicking the switch control, the switch control is switched and displayed to be in the identification to-be-activated state, and at least one text keyword associated with the corresponding text type image is displayed in the scaled image of the image thumbnail. The user can trigger the operation for the switch control again to cancel the display keywords, and then the switch control switches to display to identify the state to be activated and hide the displayed text keywords.

In this embodiment, the terminal displays a keyword display activation entry in the media preview area, and prompts whether to activate the text keywords associated with the display text type images currently or not by displaying the state of the activation entry identifier through the keyword, and supports the user to activate or cancel the text keywords associated with the display text type images according to actual needs, so that the user can conveniently select whether to increase the information content presented when media preview is performed through the display text keywords according to actual needs, and the user can conveniently select media resources to be viewed according to the previews of the previews.

In one embodiment, displaying at least one text keyword associated with a text type image in an image thumbnail includes: displaying a preset number of text keywords associated with the text type images in the image thumbnail; and sequencing and displaying the preset number of text keywords according to the respective text importance degrees.

The preset number can be set according to actual needs, and can be a fixed number or a number range. For example, the preset number may be 3, and then 3 text keywords are displayed in the image thumbnail. The preset number may be 3-6, and the number of text keywords displayed in the image thumbnail may be 3-6. The degree of importance of the text is used to characterize the importance of the text in the text-type image, the higher the importance of the text in the text-type image is in the region of higher importance of the text. Different areas in the text type image can have corresponding text importance degrees, and the importance degrees are used for representing the importance of the different areas in the text type image; and different text keywords associated with the text type image can also have corresponding text importance levels for characterizing the importance of the text keywords in the text type image.

Specifically, the terminal displays a preset number of text keywords associated with the text type image in the image thumbnail, such as 3 text keywords or 5 text keywords, which may be displayed. Each text keyword can be displayed in a sequence according to the respective text importance degree, for example, the text keywords with high text importance degree can be displayed in a sequence from high text importance degree to low text importance degree, so that the text keywords with high text importance degree are displayed in priority.

In this embodiment, the terminal sorts and displays a preset number of text keywords according to respective text importance degrees, so that each text keyword can be orderly arranged and displayed according to the text importance degrees, and a plurality of text keywords can be orderly displayed, so that the information quantity presented during media preview is increased, and the user can select media resources to be checked according to the thumbnail of the preview.

In one embodiment, displaying at least one text keyword associated with a text type image in an image thumbnail includes: in the case where the resolution of the text in the scaled image is less than the text visual resolution threshold, at least one text keyword associated with the text type image is displayed in the image thumbnail.

The resolution of the text in the scaled image is smaller than the visual resolution threshold of the text, which indicates that the text in the scaled image meets the text recognition condition, namely, when the text in the scaled image is displayed in the image thumbnail, the text in the scaled image can support user recognition, namely, the user can still accurately recognize the content of the text in the scaled image. The text visual resolution threshold may be set according to actual needs, for example, may be set to a font height of N pixels.

Specifically, when the resolution of the text in the scaled image is smaller than the text visual resolution threshold, it is indicated that when the terminal displays the scaled image, the user cannot accurately identify the text in the scaled image, and then the terminal displays at least one text keyword associated with the text type image in the image thumbnail, and the resolution of the displayed text keyword is greater than or equal to the text visual resolution threshold, so that the user can effectively distinguish each image thumbnail through the text keyword even though the user cannot identify the text in the scaled image.

In one specific application, as shown in fig. 4, in the media preview area of the media library, when the resolution of the text in the scaled image is smaller than the text visual resolution threshold, that is, the text cannot be accurately recognized by the user, such as the scaled images in the image thumbnail 1 and the image thumbnail 2, the user cannot directly recognize the accurate content of the text therein, at least one text keyword associated with the corresponding text type image is displayed in a floating manner on the scaled image. For the image thumbnail 4, the resolution of the text therein is greater than or equal to the text visual resolution threshold, and the text keywords thereof may not be displayed.

In this embodiment, when the resolution of the text in the scaled image is smaller than the text visual resolution threshold, that is, when the user cannot directly identify the text in the scaled image to effectively distinguish each text type image, the terminal displays at least one text keyword associated with the text type image in the image thumbnail, so as to support the user to accurately identify based on the text keyword in the preview process, further increase the information content presented in the media preview process through the text keyword, and facilitate the user to select the media resource to be viewed according to the thumbnail of the preview.

In one embodiment, displaying at least one text keyword associated with a text type image in an image thumbnail includes: in the image thumbnail, at least one text keyword associated with the text type image is displayed in a highlighted manner with respect to the scaled image.

The highlighting mode can be flexibly set according to actual needs, for example, text keywords can be highlighted relative to the scaled image through different font colors, font styles, font distribution and the like. Specifically, when at least one text keyword associated with the text type image is displayed in the image thumbnail, the text keyword is displayed in a highlighting mode relative to the zoom image, so that the text keyword can be highlighted, and the recognition of a user is facilitated.

In one specific application, as shown in fig. 4, in the image thumbnail 2, text keywords are highlighted with respect to the scaled image by bold, underlined, and italic display so that the user can accurately recognize the text keywords.

In this embodiment, the terminal highlights the text keywords associated with the text type image according to the highlighting mode relative to the zoom image, which is beneficial for the user to recognize the text keywords associated with the text type image.

In one embodiment, displaying at least one text keyword associated with a text type image in a highlighted manner relative to a scaled image in an image thumbnail includes: at least one text display area in the image thumbnail displays at least one text keyword associated with the text type image respectively; the font color of the at least one text keyword is color highlighted relative to the background color of the text display area in which the at least one text keyword is located.

The text display area is an area for displaying text keywords by a user in the image thumbnail, and may be specifically a text box. At least one text keyword may be displayed in the text display area, i.e. all text keywords or part of text keywords associated with the text type image may be displayed in the text display area. Font color refers to the color of the font when the text keyword is displayed. The background color refers to the color of the background of the text display area.

Specifically, the terminal divides at least one text display area in the image thumbnail, and at least one text keyword associated with the text type image is respectively displayed in the text display area. For example, a text display area may be defined in the image thumbnail, and text keywords associated with the text type image may be displayed in the text display area. For another example, the text display areas divided in the image thumbnail are the same as the text keywords, and each text display area is used for displaying one text keyword. And the terminal displays at least one text keyword associated with the text type image in a text display area. For the displayed text keywords, the font color of the text keywords is highlighted relative to the background color of the text display area, for example, the font color of the text keywords can form complementary colors with the background color of the text display area, so that the text keywords are highlighted relative to the zoom image. For example, when the background color of the text display area is white, the font color of the text keyword may be black; while the background color of the text display area is black, the font color of the text keywords may be white, thereby ensuring that the text keywords are highlighted relative to the scaled image.

In this embodiment, the font color of the text keyword displayed by the terminal is highlighted by color with respect to the background color of the text display area where the text keyword is located, so as to ensure the recognizable degree of the text keyword, and facilitate the user to recognize the text keyword associated with the text type image.

In one embodiment, the media preview method further comprises: responding to the keyword editing triggering operation of the text type image, and displaying a keyword operation area aiming at the text type image; in response to an editing operation for at least one text keyword triggered in the keyword operation area, displaying at least one set keyword associated with the text type image set through the editing operation;

the keyword editing triggering operation is a triggering operation for editing the text keywords associated with the text type images by a user. The keyword operation area is an area in which an editing operation is performed with respect to a text keyword associated with a text type image. Editing operations include user operations for changing, adding, or deleting text keywords associated with text type images. The set keyword is a text keyword set for the text type image by the user through the editing operation.

Specifically, the user may actively edit the text keywords associated with the text type image, e.g., the user may trigger a keyword editing operation for the text keywords of the text type image. For example, a user may access attribute information of a text type image and trigger a keyword editing operation for text keywords in the attribute information. As another example, for text keywords that may be used to trigger a keyword editing operation on a text type image, a particular user may trigger a keyword editing operation by pressing text keywords of the text type image long. The terminal responds to the keyword editing triggering operation of the user on the text type image, a keyword operation area aiming at the text type image is displayed, and the user can edit the text keywords of the text type image in the keyword operation area.

The terminal responds to the editing operation of at least one text keyword triggered by the user in the keyword operation area, and displays at least one set keyword which is set through the editing operation and is associated with the text type image, so that the result set by the user through the editing operation is displayed, and the user can confirm the editing operation. In a specific implementation, a user can perform operations such as changing, adding, deleting or sorting on text keywords of the text type image to obtain at least one set keyword associated with the text type image.

Further, displaying at least one text keyword associated with the text type image in the image thumbnail, including: in the image thumbnail, at least one set keyword associated with the text type image is displayed.

Specifically, the terminal displays setting keywords associated with the text type images set by the user in the image thumbnail, so that the user can accurately distinguish the text type images through the setting keywords.

In this embodiment, the user may trigger an editing operation on a text keyword associated with a text type image, so as to perform personalized setting on the text keyword associated with the text type image according to actual needs, and the terminal displays the set keyword associated with the text type image set by the user, so that the user is supported to accurately identify based on the set keyword during previewing, and the information amount presented during media previewing is further increased by the set keyword, which is beneficial to the user to select media resources to be viewed according to the previewed thumbnail.

In one embodiment, the media preview method further comprises: responding to triggering operation of target keywords in at least one text keyword, and displaying a complete media display area; in the complete media display area, positioning a target text area in a text type image for focusing display; the target text region is a text region associated with a content subject described by the target keyword in the text type image.

The target keywords are text keywords which are selected by a user from the displayed text keywords and aimed at triggering. The media full display area is an area for displaying full media in which media of an original size can be completely displayed. The target text region is a text region in the text type image associated with the content subject described by the target keyword, i.e., the target keyword is identified from text in the target text region. Focusing and displaying the target text area refers to focusing and displaying the focus of the media display to the target text area, namely, displaying by taking the target text area as the display focus. For example, the target text region may be presented in a central location.

Specifically, the user can trigger an operation on the displayed text keywords, for example, the user can select and click on a target keyword in the text keywords, the terminal responds to the trigger operation of the user on the target keyword to display a complete media display area, and the terminal completely displays the text type image in the complete media display area. In the complete media display area, the terminal positions to a target text area in the text type image for focusing display, namely, the target text area in the text type image is used as a display focus for displaying. For example, the terminal may present a target text region in the text type image as a center. The target text region is a text region associated with the content subject described by the target keyword, so that the terminal can quickly locate the text region associated with the target keyword for focusing display.

In this embodiment, the terminal responds to the triggering operation of the user on the target keyword in the text keyword, and in the media complete display area, the terminal locates the target text area in the text type image to perform focusing display so as to perform focusing display on the text area associated with the content subject described by the target keyword, so that the focus of display can be quickly located according to the user's needs, simplifying the operation of media browsing, and being beneficial to improving the media browsing efficiency.

In one embodiment, in response to a preview trigger event for a media library, displaying a media preview area of the media library includes: displaying an access entry to a media library; in response to a triggering operation for accessing the portal, a media preview area of the media library is displayed.

The access portal is a portal for accessing the media library, and a user can access the media library by triggering operation on the access portal to browse all media in the media library. The access entrance can be set according to actual needs, for example, the access entrance can be set in an album interface and a local media cache interface. Specifically, the terminal displays an access entry of the media library, and the user can trigger an operation on the access entry displayed by the terminal, for example, the user can click on the access entry, and the terminal responds to the triggering operation on the access entry by the user to display a media preview area of the media library, wherein the user can preview each media in the media library in the media preview area.

Further, the media preview method further comprises: and in response to the triggering operation of the image thumbnail, displaying the complete image content of the text type image in the media complete display area of the text type image.

The media complete display area is an area for displaying complete media, and in the media complete display area, the media with the original size can be completely displayed. The complete image content refers to the original image in which the text type image of the original size is displayed, i.e. the original image in which the text type image is displayed.

Specifically, the user may perform a triggering operation on the image thumbnail, for example, the user may click on the image thumbnail, and the terminal responds to the triggering operation on the image thumbnail by the user to display a media complete display area of the text type image, and in the media complete display area, display complete image content of the text type image, that is, the terminal displays the text type image with the original size.

In this embodiment, a user may trigger a preview trigger event for the media library through an access entry of the media library, and through a trigger operation on an image thumbnail, the terminal displays complete image content of a text type image in a complete media display area, thereby implementing access of the user to complete media content in the media library.

In one embodiment, the media preview method further comprises: displaying at least one text keyword associated with the text type image in the media complete display area; responding to triggering operation of a target keyword in at least one text keyword, and positioning a target text area in a text type image for focusing display; the target text region is a text region associated with a content subject described by the target keyword in the text type image.

The target keywords are text keywords which are selected by a user from text keywords associated with the text type images and are used for triggering. The target text region is a text region in the text type image associated with the content subject described by the target keyword, i.e., the target keyword is identified from text in the target text region. Focusing and displaying the target text area refers to focusing and displaying the focus of the media display to the target text area, namely, displaying by taking the target text area as the display focus. For example, the target text region may be presented in a central location.

Specifically, after the terminal displays the complete image content of the text type image in the media complete display area of the text type image, the user can trigger operation for the text keywords associated with the text type image, for example, the user can trigger the text keywords associated with the text type image to be displayed, and select and click on the target keywords in the text keywords. And the terminal responds to the triggering operation of the user on the target keyword, and positions the target text region in the text type image for focusing display, namely, the target text region in the text type image is used as a display focus for displaying. For example, the terminal may present a target text region in the text type image as a center. The target text region is a text region associated with the content subject described by the target keyword, so that the terminal can quickly locate the text region associated with the target keyword for focusing display.

In this embodiment, when the terminal displays the complete image content of the text type image in the complete media display area, the terminal may also locate the target text area in the text type image for focus display in response to the triggering operation of the user on the target keyword in the text keyword associated with the text type image, so as to focus display the text area associated with the content subject described by the target keyword, and may quickly locate the focus of the display according to the user's needs, thereby simplifying the operation of media browsing and being beneficial to improving the media browsing efficiency.

In one embodiment, the partial region includes at least one of a text header region, a text body core region, or a text body highlighting region in the text-type image. Displaying an image thumbnail pointing to a text-type image in a media preview area, comprising: in the media preview area, image thumbnails pointing to text type images are displayed in order of the text type images in the media library.

The text title area refers to an area including a title of text in the text type image; the text core area refers to an area including text core content in the text type image; the text body highlighting area refers to an area including text body highlighting in the text type image. The partial region includes at least one of a text header region, a text body core region, or a text body highlighting region in the text-type image. In practical applications, the text header area, the text body core area, or the text body highlighting area may be provided with a priority, for example, the text header area has the highest priority, the text body core area has the second highest priority, and the text body highlighting area has the lowest priority, i.e., is selected as a partial area to be displayed in the image thumbnail after zooming according to the priority.

Specifically, the partial area taken from the text type image includes at least one of a text header area, a text body core area, or a text body highlighting area in the text type image. In a specific application, a partial area can be determined from a text title area, a text core area or a text highlighting area according to the priority, and the partial area can be obtained by cutting out from a text type image, so that a zoom image displayed in an image thumbnail can be obtained according to the partial area. The terminal determines the ordering of the text type images in the media library, and in the media preview area, the terminal displays the image thumbnails pointing to the text type images according to the ordering of the text type images in the media library, so that the ordering of each media in the media library is the same as the ordering of the image thumbnails pointing to the media in the media preview area. For example, the terminal may sequentially display the image thumbnails directed to the text-type images in the media preview area according to the chronological order of the text-type images in the media library.

In this embodiment, the partial area includes at least one of a text header area, a text core area, and a text highlighting area, so that the text importance level in the partial area is ensured to be higher than the text importance level outside the partial area, and thus, the area with high text importance level in the text type image is displayed in the image thumbnail, and the text content with high text importance level can be directly displayed through the previewed zoom image, so that the information amount displayed during media preview is increased, and the user can select the media resource to be viewed according to the previewed thumbnail. And the terminal displays corresponding image thumbnails according to the ordering of the text type images in the media library, so as to ensure that the ordering of each image thumbnail in the media preview area is matched with the ordering in the media library, and the terminal is beneficial to the user to select the media resources to be checked according to the previewed thumbnail.

In one embodiment, as shown in fig. 5, the media preview method further includes a text type image determination process, specifically including:

step 502, a target image in a media library is determined.

Wherein the target image is an image in the media library that needs to be determined whether it is a text type image. Specifically, the terminal determines a target image from the media library to determine for the type of the target image, and determines whether the target image is a text type image. If the target image is a text type image, a scaled image obtained by scaling a partial region cut from the target image to a predetermined size may be displayed in the image thumbnail.

In step 504, character recognition is performed on the target image, and a text display area including text in the target image is determined.

The text display area refers to an area including text in the target image, and the text display area can be determined by performing character recognition on the target image. Specifically, the terminal may perform character recognition on the target image, for example, may perform character recognition on the target image by using a template matching character recognition algorithm, a neural network character recognition algorithm or a support vector machine character recognition algorithm, so as to recognize a text display region in the target image.

In step 506, when the area ratio of the text display area in the target image reaches the text theme ratio threshold, it is determined that the target image belongs to the text type image.

The text theme ratio threshold is used for judging whether the target image belongs to the text type image, when the area ratio of the text display area in the target image exceeds the text theme ratio threshold, the relation of the text in the target image relative to the target image can be considered to accord with the text theme condition, namely the target image takes the text as the theme content, and the terminal can determine that the target image belongs to the text type image.

Specifically, the terminal may determine an area ratio of the text display area in the target image, and in specific implementation, the terminal may perform duty ratio calculation according to the text display area and the image area of the target image, to obtain the area ratio of the text display area in the target image, that is, obtain the text display area ratio. In addition, the terminal can also calculate according to the area ratio of the text display area and the non-background area in the target image to obtain the area ratio of the text display area in the target image. And comparing the area ratio of the text display area with a text theme ratio threshold value by the terminal, if the text display area ratio exceeds the text theme ratio threshold value, indicating that the target image comprises a large number of characters, and determining that the target image belongs to a text type image by the terminal by taking the text content as a main body of the target image. After determining that the target image belongs to the text type image, when the target image is subjected to preview display, the target image is used as the text type image in the image preview method, namely, a zoom image obtained by zooming a partial area intercepted from the target image to a preset size is displayed in an image thumbnail.

In this embodiment, the text display area of the target image in the media library is determined through character recognition, when the text display area duty ratio reaches the text theme duty ratio threshold, the terminal determines the target image as belonging to the text type image, so that when the target image is subjected to preview display, the target image is used as the text type image in the image preview method to perform preview display processing, text content with high text importance degree can be directly displayed through the preview zoom image, the information amount presented during media preview is increased, and the user can select the media resource to be checked according to the preview thumbnail of the preview.

In one embodiment, when the area ratio of the text display area in the target image reaches the text topic ratio threshold, determining that the target image belongs to the text type image includes: acquiring image source information of a target image; determining a text theme duty ratio threshold according to the image source information; and when the area ratio of the text display area in the target image reaches the text theme ratio threshold value, determining that the target image belongs to the text type image.

The image source information refers to information related to a target image source, and specifically may include, but is not limited to, an image obtaining mode, an image source platform, and the like. Different sources of the image can correspondingly set different text theme duty ratio thresholds so as to accurately judge whether the image belongs to the text type image or not through the text theme duty ratio thresholds. For example, for a screenshot of a chat interface or history session derived from an instant messaging application, which typically has text in the chat recorded as the subject matter, the text subject matter occupancy threshold may be set lower; for images shot by a camera, the images are generally taken as main contents, and the text theme duty ratio threshold value can be set higher at the moment, so that the type of the images can be accurately judged according to the source scene of the images. In a specific implementation, the text theme duty ratio threshold value of the images from different scenes can be preset according to actual needs.

Specifically, the terminal acquires image source information of the target image, where the image source information may include a mode of acquiring the target image, a source platform, and a scene from which the image is derived. The image source information can be obtained by inquiring the attribute information of the target image, and specifically, the terminal can inquire the attribute information of the target image, wherein various information related to the target image, such as image size, format, shooting equipment, update time and the like, is recorded in the attribute information. The terminal inquires the image source information of the target image from the attribute information of the target image. The terminal determines a text topic duty ratio threshold based on image source information of the target image. The mapping relation between the image source information and the text theme duty ratio threshold value can be preset according to actual needs. The terminal can query the text topic duty ratio threshold value associated with the image source information of the target image based on the preset mapping relation between the image source information and the text topic duty ratio threshold value. And comparing the area ratio of the text display area in the target image with a text theme ratio threshold value by the terminal, if the area ratio of the text display area in the target image reaches the text theme ratio threshold value, considering that the target image contains a large amount of text content, taking the text as the theme content, and determining the target image as belonging to the text type image by the terminal.

In this embodiment, the terminal determines the text theme duty ratio threshold according to the image source information of the target image, and determines the area duty ratio of the text display area in the target image based on the determined text theme duty ratio threshold, so as to determine whether the target image belongs to the text type image, thereby determining the type of the target image in combination with the source scene of the target image, and improving the accuracy of determining the type of the target image.

In one embodiment, the media preview method further comprises: determining a text body area comprising a text body from a text area comprising text in the text type image; intercepting a partial area from a text body area; a scaled image is obtained based on the partial region.

The text region refers to a region of text in the text type image, and the text body region refers to a region including text body in the text type image. The text in the text-type image includes text in various scenes such as seal text, terminal status bar text, terminal notification bar text, or control text, etc., while the text body refers to text related to the subject matter of the text-type image. For example, the text type image is a historical conversation screenshot, and the text body can be a conversation message in a chat interface; the text type image is an article screenshot, namely, a text body can be the text part content of the article; when the text type image is a bulletin screenshot, then the text body can be the text part content of the bulletin. The text body in the text type image is text content that the user needs to obtain, and can be used to distinguish between the individual text type images. The partial area is obtained by cutting from the text body area, and specifically can comprise at least one of a text title area, a text body core area or a text body highlighting area in the text type image.

Specifically, the terminal may perform character recognition with respect to the text type image, determine a text region including text in the text type image, and determine a text body region including text body from the text region. And the terminal intercepts a partial area in the text area, and obtains a zoom image displayed in the image thumbnail according to the intercepted partial area. In specific implementation, the terminal may perform character recognition on the text type image, determine each text region in the text type image, and further determine for the text in each text region, for example, determine whether the text belongs to a text body according to content characteristics of the text, so as to determine a text body region including the text body from each text region. The terminal can intercept and obtain a partial area with high text importance from the text area based on the text importance degree of each text in the text area, and obtain a scaled image according to the partial area, for example, the partial area can be scaled to obtain the scaled image.

In this embodiment, the terminal intercepts a partial region with high text importance from a text region including a text in a text type image to obtain a zoom image, so that a part of content in the text can be displayed in an image thumbnail through the zoom image, text content with high text importance can be directly displayed through the previewed zoom image, the information content presented during media preview is increased, and a user can select media resources to be viewed according to the previewed thumbnail.

In one embodiment, intercepting a partial region from a text body region includes: and intercepting a text title area with the text font size meeting title judgment conditions from the text area, and obtaining a partial area according to the text title area.

The text font size refers to the font size of the text in the text body area. The title determination condition is used for determining whether the text in the text body area belongs to the title text, and specifically may include a font size threshold, and when the font size of the text exceeds the font size threshold, the corresponding text may be considered to belong to the title text. The font size threshold may be preset to a specific value, or may be adaptively set according to the font size of each text in the text body area, so that the title text can be accurately identified for the text body area. The text header area refers to an area including header text in the text body area.

Specifically, the terminal may determine the text font size of each text body in the text body region, and compare the text font size of each text body with the title determination condition, respectively, to determine a text title region including a text title from the text body region according to the text font size of each text body, and the terminal obtains a partial region based on the text title region. For example, the terminal may intercept a text header area from the text body area, and take the text header area obtained by the interception as a text area that needs to be preview-presented in the media preview area.

In this embodiment, according to the text title area in the text area, where the text font size meets the title determination condition, a partial area that needs to be previewed in the media preview area is determined, so that the title of the text can be displayed in the image thumbnail, text content with high text importance is displayed through the title, the information content displayed during media preview is increased, and the user can select the media resource that needs to be viewed according to the thumbnail of the preview.

In one embodiment, intercepting a partial region from a text body region includes: and intercepting a text body core area of which the text importance quantization parameter meets the text body core judgment condition from the text body area, and obtaining a partial area according to the text body core area.

The text importance quantization parameter is a quantization parameter for representing the importance of the text, and specifically may be a text importance score obtained by estimating the importance of the text, and based on the text importance quantization parameter, the importance of each text in the text body area may be quantized, so that the text core content in the text body area may be determined, and the text body core area may be a text area including the text core content. The text core judging condition is used for judging whether the text in the text area belongs to text core content, and specifically can comprise a quantization parameter threshold value and a quantization parameter ordering result. For example, text whose importance quantization parameter reaches a quantization parameter threshold may be determined as text core content; and ordering the texts according to the respective text importance quantization parameters, and determining the text with the largest text importance quantization parameter value as the text core content.

Specifically, the terminal may determine a text importance quantization parameter of each text in the text region, where the text importance quantization parameter may be obtained by analyzing each text, for example, may count word frequencies of each text, and obtain the text importance quantization parameter based on a result of the statistics. In a specific implementation, the terminal may calculate and obtain a text importance quantization parameter, specifically a TF-IDF score, of each text body in the text body region based on a TF-IDF (Term Frequency-inverse text Frequency index) algorithm. The terminal compares the text importance degree quantization parameter of each text body with the text core judgment condition respectively, so as to determine a text body core area comprising text core content from the text body area according to the text importance degree quantization parameter of each text body, and the terminal obtains a partial area based on the text body core area. For example, the terminal may intercept a text body core region from the text body region, and take the intercepted text body core region as a text region that needs to be previewed in the media preview region.

In this embodiment, according to the text core area in which the text importance quantization parameter in the text area meets the text core determination condition, a partial area in which preview display is required in the media preview area is determined, so that text core content can be displayed in an image thumbnail, text content with high text importance is displayed through the text core content, the information content displayed during media preview is increased, and a user can select media resources to be viewed according to the previewed thumbnail.

In one embodiment, intercepting a partial region from a text body region includes: and intercepting a text body highlighting area of the text highlighting from the text body area, and obtaining a partial area according to the text body highlighting area.

Text highlighting refers to displaying text in a highlighting manner, for example, using different font types, different font sizes, different font colors, underlining, diagonal fonts, thickening, and the like. The text body highlighting area is a text area that includes highlighted text.

Specifically, the terminal may determine a display manner of each text body in the text body region, thereby determining a highlighted text in each text body, and determine a text body highlighting region based on the highlighted text, and the terminal obtains a partial region based on the text body highlighting region. For example, the terminal may intercept the text body highlighting region from the text body region and take the intercepted text body highlighting region as the text region that needs to be preview-presented in the media preview region.

In this embodiment, according to the text body highlighting area in the text body area, a partial area where preview display is required in the media preview area is determined, so that the highlighted text content can be displayed in the image thumbnail, and the text content with high text importance is displayed through the highlighted text content, so that the information content displayed during media preview is increased, and the user can select the media resource to be viewed according to the previewed thumbnail.

In one embodiment, intercepting a partial region from a text body region includes: determining a zoom mode for a text body area; according to the scaling mode and the preset size, scaling the text area to obtain a scaled text area; cutting the scaled text body area according to a preset size to obtain a partial area with the size matched with the preset size.

The scaling mode includes a mode of scaling according to the image height and a mode of scaling according to the image width. When the height of the terminal display area is larger than the width, the scaling process may be performed in a manner of scaling according to the image width. The preset size can be flexibly set according to actual needs and can be set to be a square of M pixels.

Specifically, the terminal determines a scaling mode of the text body region, and specifically, scaling according to the height of the image or scaling according to the width of the image can be determined according to the size relationship between the height and the width of the display region of the terminal. The terminal zooms the text area according to the determined zooming mode and the preset size of the image thumbnail, and scaling the height or width of the text body area to be the same as the height or width in the preset size to obtain the scaled text body area. In a specific implementation, if the scaling is performed according to the image width, the text body region may be scaled to have the same width as the image thumbnail. And cutting the scaled text body area according to the preset size of the image thumbnail by the terminal so as to intercept and obtain a partial area. For example, after scaling according to the image width and scaling the text region to have the same width as that of the image thumbnail, the terminal may crop the scaled text region according to the height of the image thumbnail and crop a partial region having the same height as that of the image thumbnail therefrom. The width of the obtained partial region is the same as the width of the image thumbnail, and the height of the partial region is also the same as the height of the image thumbnail, namely the size of the partial region is the same as the preset size of the image thumbnail.

In this embodiment, the terminal sequentially performs scaling and cutting on the text region according to the determined scaling manner and the preset size to obtain a partial region with a size matched with the preset size, and the text importance degree in the partial region is higher than the text importance degree outside the partial region, so that text content with high text importance degree can be directly displayed through the previewed scaling image, the information amount presented during media preview is increased, and the user can select media resources to be checked according to the previewed thumbnail.

In one embodiment, the media preview method further comprises: word segmentation processing is carried out on texts in the text type images, and each text word segmentation is obtained; estimating the text importance degree of each text word to obtain the respective text importance degree quantization parameter of each text word; and determining at least one text keyword from each text word according to the text importance quantization parameter.

The text word segmentation is each word obtained by word segmentation of texts in the text type images, and the number of characters of the text word segmentation is not fixed and can be one, two or more. The text importance quantization parameter is a quantization parameter for representing the importance of the text, and specifically may be a text importance score obtained by estimating the importance of the text, and based on the text importance quantization parameter, the importance of each text in the text region may be quantized, so that text keywords in the text region may be determined, where the text keywords are used to describe the content subject of the text in the text type image.

Specifically, the terminal may perform word segmentation processing on the text in the text type image to obtain each text word. In specific implementation, the terminal can perform word segmentation processing on texts in the text type images based on named entity recognition to obtain each text word. The terminal can estimate the text importance degree of each text word, particularly can estimate by adopting a frequency statistical method, and can also estimate by a pre-trained artificial neural network model or a deep learning model to obtain the respective text importance degree quantization parameter of each text word. And the terminal determines at least one text keyword from each text word according to the text importance degree quantization parameter. For example, the terminal may determine at least one text word having the largest numerical value of the text importance quantization parameter as a text keyword associated with the text type image.

In this embodiment, after performing word segmentation on a text in a text type image, the terminal estimates the text importance degree of each text word, and determines at least one text keyword from each text word based on the respective text importance degree quantization parameter of each text word, so that the text keyword is determined from the text of the text type image based on the text importance degree quantization parameter, and the accuracy of the text keyword can be ensured.

In one embodiment, the media preview method further comprises: and establishing an association relationship between at least one text keyword and the text type image.

The association relationship may be a mapping between the text keyword and the text type image, for example, the association between the text keyword and the text type image may be recorded through a mapping table. Specifically, after determining the text keywords of the text type image, the terminal may establish an association relationship between at least one text keyword and the text type image, and store the association relationship, so that the associated text keywords may be quickly queried through the text type image based on the association relationship.

Further, displaying at least one text keyword associated with the text type image in the image thumbnail, including: and inquiring at least one text keyword associated with the text type image according to the association relation, and displaying the at least one text keyword in the image thumbnail.

Specifically, when the terminal displays text keywords of the text type image, the association relationship of the text type image can be queried, at least one text keyword associated with the text type image is determined based on the association relationship, and the queried at least one text keyword is displayed in the image thumbnail. The association relation can be stored in advance, so that when the text keywords of the text type images need to be displayed, the terminal can quickly inquire the associated text keywords based on the association relation, and the text keywords can be displayed efficiently.

In this embodiment, the terminal establishes an association relationship between the text keywords and the text type images, so that the text keywords associated with the text type images can be rapidly queried based on the association relationship, which is beneficial to improving the processing efficiency of text keyword display.

In one embodiment, the media preview method further comprises: determining a text size of text in the scaled image; obtaining the resolution of the text in the scaled image according to the font size; a size relationship between the resolution of text in the scaled image and a text visual resolution threshold is determined.

The resolution of the text may be determined according to the font size of the text, and in particular, may be determined according to the font height, for example, the font height may be directly used as the resolution of the text, and the text visual resolution threshold includes the font height threshold. When the font height of the text is greater than or equal to the font height threshold, the resolution of the text may be considered to be greater than or equal to the text visual resolution threshold, i.e., the text may be accurately recognized. The text visual resolution threshold can be preset according to actual needs, and can also support a user to carry out custom setting. For example, the user may set a text visual resolution threshold based on his own recognizability.

Specifically, after obtaining the scaled image, the terminal determines the font size of the text in the scaled image, and determines the resolution of the text in the scaled image according to the font size, for example, the font height included in the font size may be determined as the resolution of the text. The terminal inquires a preset text visual resolution threshold value, and compares the resolution of the text in the scaled image with the text visual resolution threshold value, so that the size relation between the resolution of the text in the scaled image and the text visual resolution threshold value is determined. If the resolution of the text in the scaled image is not less than the text visual resolution threshold, indicating that the font size of the text in the scaled image is sufficient for user recognition, it may be determined that the text in the scaled image meets the text recognition condition. Further, for a scaled image including text conforming to a text recognition condition, the terminal may not display text keywords associated with the text type image; and for the zoom image containing the text which does not accord with the text recognition condition, namely the zoom image containing the text with the resolution smaller than the text visual resolution threshold, the terminal can display text keywords associated with the text type image, so that the text content with high text importance in the text type image can be ensured to be effectively displayed directly.

In this embodiment, the terminal determines, according to the resolution determined by the font size of the text in the scaled image and a preset text visual resolution threshold, a size relationship between the resolution of the text in the scaled image and the text visual resolution threshold, so that the recognition degree of the text in the scaled image can be accurately determined based on the font size and the text visual resolution threshold.

The application also provides an application scene, which applies the media preview method. Specifically, the application of the media preview method in the application scene is as follows:

currently, in applications supporting browsing of pictures or videos, a thumbnail that reduces the resolution of the picture and cuts out is used for displaying a picture list, and a user can preview the whole content of the picture through the thumbnail. However, for pictures containing a large amount of characters, such as bulletin, text screenshot, and the like, the characters in the thumbnail are too small to be seen due to the compression of resolution, in addition, the head and tail information of the rectangular picture is easy to lose because the picture thumbnail is generally cut and displayed in square, the user is difficult to preview the core content expressed by the picture through the picture thumbnail, and the user is often required to click the expansion map to know the picture information, so that the user experience is poor. Based on this, in the media preview method provided in this embodiment, for the text type image mainly containing text content, the text main body area is extracted, the position of the text key information is confirmed, and then the thumbnail is generated by clipping and scaling, so that the thumbnail displays the text core content preferentially, and invalid information display is reduced. When the thumbnail text is still smaller than the visual resolution, the thumbnail text is further displayed through the picture text keywords, so that the thumbnail text can be identified, a user can know the key information of the text type image through the picture thumbnail, and the information expression capability of the text type image is improved.

The thumbnail is a small-size image after zooming and cropping the picture when the thumbnail is used for picture preview display, and the thumbnail is small and has very high loading speed, so that the thumbnail can be generally used for quick preview of a list. Resolution is a parameter that measures how much data is in a bitmap image, and is generally expressed as pixels per inch (ppi) and dots per inch (dpi). The visual resolution of text refers to the minimum resolution at which text content can be accurately identified. A pixel refers to a minimum unit in an image represented by a sequence of numbers.

The media preview method provided by the embodiment can be applied to thumbnail display of text type images, the non-text area is removed and scaled by extracting the text content and the position of the picture, and then the key text position is taken as the starting position for clipping, so that the key area thumbnail is obtained, the display efficiency of key information of the thumbnail can be effectively improved, and the display of invalid information is reduced. When the thumbnail is still smaller than the visual resolution, text keywords are extracted and the display of text thumbnail contents by using the suspended keywords is supported, so that the problem that the thumbnail cannot be seen clearly due to more text is effectively avoided, a user can conveniently and quickly know the subject contents of the picture, and the user experience is improved.

Specifically, as shown in fig. 6, the media preview method provided in this embodiment may be applied to a scene browsed by a terminal album, in an album preview interface of the terminal, thumbnails tiled with a picture list may be displayed, and the picture may include a picture mainly including characters, especially a screenshot picture including characters often propagated in social applications, where the current thumbnail can not see the text content of the picture thumbnail due to the reduced picture resolution, only the approximate distribution and foreground and background colors of the characters can be roughly seen, and a user cannot intuitively preview the text information of the picture through the thumbnail. As shown in fig. 7, the media preview method provided in this embodiment may also be applied to a scenario where an application program browses acquired media, where a user may browse acquired media such as pictures and videos, and may search for each media. In the browsing interface, the terminal displays the thumbnail of each media, and for the picture with characters as the main, the thumbnail cannot recognize the characters in the thumbnail because of compression, and a user needs to click to view the original picture to acquire detailed image information.

The current thumbnail is centered and compressed into square display by default, so that the non-square area content of the rectangular picture is not displayed in the thumbnail, and invalid information is displayed in the thumbnail due to the fact that the picture possibly has a non-text area, and the default thumbnail generation mode information is not efficient in transmission. As shown in fig. 8, for one image including the falsification notification content, it is a rectangular image. When the thumbnail is displayed, as shown in fig. 9, the text area where the fake notification is located is enlarged and cut into squares to be displayed as the thumbnail, and when the thumbnail is displayed, the characters in the thumbnail meet the text identification conditions, so that the user can directly obtain important information of the original image from the thumbnail. As shown in fig. 10, for a rectangular article screenshot, which is a rectangular image, if the article screenshot is directly cut centrally and compressed into a thumbnail, the user cannot be supported to quickly obtain the key information of the article. When the thumbnail is displayed, as shown in fig. 11, the region where the title of the article is located is cut out and enlarged, and when the thumbnail is displayed, the text in the thumbnail accords with the text recognition condition, and the user can directly obtain the title of the article included in the original image from the thumbnail, so that the key information of the article can be quickly known, and particularly, a nutritional technician guides the subject content with light diet. According to the media preview method provided by the embodiment, the main text region is identified, the positions of the key text regions are confirmed, and the text of the key region is preferentially displayed, so that the thumbnail can more intuitively embody the text subject content.

Further, when the text resolution of the thumbnail after cutting the key region is still smaller than the visual resolution, if the text of the picture is more and not clearly seen, text keywords in the picture can be further extracted for display. As shown in fig. 12, a user can turn on a switch for displaying a picture keyword in a picture manager, switch the display of the picture text keyword, display suspended keyword text by a text type image thumbnail, and the displayed keyword text can be highlighted by a complementary color with a background color at the position, so that the user can conveniently and quickly preview the key information, and the user can trigger full-screen display of the picture content by long-pressing the thumbnail. For example, in the first thumbnail, the displayed keyword words include "index, mei-gang, informative, big-disc and middle-disc", so that the subject content of the image pointed by the thumbnail is described by the keyword, thereby facilitating rapid understanding of the key information of the corresponding image.

Specifically, as shown in fig. 13, the media preview method provided in this embodiment includes steps 1302 to 1320, where: step 1302, triggering preview media; specifically, the user can click to access the media library to trigger the preview of each media in the media library; step 1304, determining whether a thumbnail cache exists; the terminal may determine whether thumbnail images corresponding to respective media in the media library have been cached in advance, and if so, directly jump to step 1320 to display respective thumbnail images for previewing; step 1306, extracting characters in the image if the thumbnail cache does not exist; the terminal can extract characters from the image through a character recognition technology; step 1308, determining whether the image is a text type image; the terminal can determine whether the picture belongs to a text type image according to the text extracted for the picture; if the picture does not belong to the text type image, directly displaying a corresponding thumbnail, wherein the thumbnail can be an image obtained by scaling the image; step 1310, if the picture belongs to the text type image, the terminal eliminates the non-text area content in the picture; the terminal can reject the content which is irrelevant to the text in the picture, such as status bar text, notification bar text, interface element text and the like, and reserve the text area content; step 1312, recognizing key text positions for text region contents; the terminal can identify key contents of the reserved text area content so as to identify the position of key characters in the text area content; step 1314, clipping key areas; after the terminal determines the position of the key text in the text region content, cutting out the key region in which the key text is positioned; step 1316, determining whether the cut key region is smaller than the visual resolution; if not, the terminal can display the key area as a thumbnail after scaling treatment; step 1318, if the cut key area is smaller than the visual resolution, the terminal extracts keywords in the thumbnail; step 1320, displaying the thumbnail of the preview; for non-text type images, the displayed thumbnail can be an image obtained by directly scaling the original image; for images with the resolution of the key areas not smaller than the visual resolution, the displayed thumbnail can be an image obtained by scaling the key areas; for an image with the resolution of the key area smaller than the visual resolution, the displayed thumbnail can be an image obtained by scaling the key area, and keywords obtained through keyword extraction are displayed in a suspending manner above the thumbnail, wherein the displayed keywords are not smaller than the visual resolution.

Further, the media preview method provided in this embodiment may be applied to a picture whose picture content is mainly text, and the determination of the text type image may be determined by identifying the proportion of the text region content to the whole picture. For example, the text content can be extracted by OCR (Optical Character Recognition ), which is a process of analyzing, recognizing and processing an image file of a text material to obtain text and layout information. And calculating the duty ratio of the text region in the whole non-background color region, wherein the background color is the color with the largest occurrence number in the picture, and when the duty ratio of the text region is greater than a certain threshold value, for example, 85%, the picture can be marked as a text type image. As shown in fig. 14, for the article shot of fig. 10, after character content is extracted by OCR, the area covered by the rectangular frame is determined as the character area in the article shot, and the article shot can be determined to belong to the text-based text type image based on the text by the area ratio of the character area.

Further, the picture thumbnail generation mode comprises that the width or the height is preferential, and the picture thumbnail generation mode is obtained by clipping and scaling in a middle position in proportion. As shown in fig. 15, the original is wider than it is high, centered with high priority for cropping, and the final cropped thumbnail range is shown in dashed boxes. As shown in fig. 16, the original is greater in height than in width, is cut with the width first centered, and the final cut thumbnail range is shown in dashed boxes. In the media preview method provided by the embodiment, the text content and the region where the text is located in the picture are extracted through OCR, after the non-text region is removed, the position where the text key information is located is judged, and the position where the text in the key region is located is used as a datum point for cutting. In the process of dynamically clipping the thumbnail of the key text region, as shown in fig. 17, clipping is performed by centering the key text region with priority of width, specifically, determining the picture text region from the original image, determining the starting position of the key text in the picture text region, clipping according to the starting position of the key text, and clipping with the size of the thumbnail, wherein the range of the finally clipped thumbnail is shown as a dashed box.

Since more text type images are obtained through screenshot, some non-text content related areas exist in the images, such as a top status bar of a smart phone screenshot, a head and tail blank area of a notification type image content, and the like. Conventional thumbnail cropping may also generate these regions into the thumbnail preview, resulting in smaller thumbnail text after containing these regions that is difficult to identify. When the thumbnail is cut, all text contents are obtained through OCR recognition, and according to the non-text character characteristics, such as the top status column characters of the screenshot, the line characters are generally time points and numbers; for example, the text of the button in the picture is composed of a few characters, and only the text area content is cut, so that the effective text area in the thumbnail can be improved. Furthermore, in order to avoid that the text cut out by the thumbnail is too close to the edge of the picture, a very small part of non-text area can be reserved, thereby being convenient for attractive appearance.

Further, the method for determining the position of the key text in the picture text recognition obtained by OCR can include, but is not limited to, whether the key text is a text title, importance of a text paragraph, etc. Specifically, for recognition of a text title in a picture, since the title is generally an enlarged font compared with a conventional font, the text height of a text region obtained by OCR recognition can be determined, the width cannot represent the text size, the width of english characters and chinese characters are inconsistent, the number of continuous characters in the same line is greater than a certain threshold, and the text height is maximum. The thumbnail may be preferentially presented with respect to the title position. For the paragraph importance, characters obtained through OCR recognition can be extracted, full-half-angle conversion and simplified and complex conversion are carried out on the characters, punctuation marks, word gases, adverbs, adjectives, numbers and the like which are irrelevant to the text importance are removed, and TF-IDF scores are calculated on all words of the text after word segmentation according to each line. TF-IDF is a weighting technique for information retrieval and data mining, where TF is Term Frequency (Term Frequency) and IDF is the inverse text Frequency index (Inverse Document Frequency). The TF-IDF calculation method may include: by counting the word frequency (TF) = (number of occurrences of a certain word/total number of words in a picture) of each word occurrence, and the Inverse Document Frequency (IDF) = log (total corpus document/(number of documents containing the word+1)), TF-idf=word frequency (TF) × Inverse Document Frequency (IDF) is available. And adding and averaging the TF-IDF scores of all words to obtain the important score of each line of characters, and finally obtaining the core paragraph of the characters corresponding to the picture. In addition, the emphasized characters can be determined by identifying the emphasized characters, and the emphasized characters are generally emphasized through setting colors or italics and bold letters of non-text, so that the important score weighting can be carried out on the characters at the positions by counting the positions and the number of the emphasized characters.

The specific thumbnail clipping process includes: the thumbnail zooming mode is determined to be high priority or width priority, and the width priority is taken as an example because the general screen of the mobile terminal is high and larger than the width and can be adopted in general cases. After the picture is removed from the text content, the picture is scaled, and the thumbnail scaling ratio=the text region width/the thumbnail width. And cutting the reduced picture, wherein the starting height/scaling ratio of the key characters is equal to or smaller than the starting height/scaling ratio of the key characters, namely, the starting abscissa= [0 (the width priority abscissa is 0), the maximum ordinate cannot exceed the height of the scaled picture, namely, the height of the thumbnail, and the cutting width and the cutting height are the fixed size of the current thumbnail.

Because the thumbnail is generated by scaling the picture, the problem that the thumbnail is not clearly seen after the text is compressed due to more text still exists after the key region is cut. At this time, the text height recognized by the original image OCR can be obtained by averaging all the text, and after scaling, the text height of the thumbnail is obtained by dividing the scaling, which characterizes the text resolution. When the text height of the thumbnail is smaller than a certain threshold, namely lower than the general macroscopic resolution, keyword previews can be generated for text contents corresponding to the thumbnail. The keyword generation method can include, but is not limited to, conventional TF-IDF, texttrank algorithm, lda (Latent Dirichlet Allocation, hidden Dirichlet distribution) and other algorithms based on text importance to mine important keywords, and keyword generation algorithm based on deep learning model can be adopted.

Specifically, word segmentation processing is performed on characters of the picture through OCR recognition, the word segmentation is introduced into named entity recognition (NER, named Entity Recognition), and names, place names, organization names, proper nouns and the like in the characters of the picture are extracted, so that all final words are obtained. Named entity recognition is also called special name recognition, and refers to the recognition of entities with specific meaning in text, and mainly comprises personal names, place names, organization names, proper nouns and the like. The TF-IDF based method can adopt the formula of TF-IDF to count the TF-IDF score of each word, and the TF-IDF scores are arranged in descending order, N values at the head can be taken as thumbnail keywords, and N can also be obtained by dynamically calculating whether the total length of the keywords is larger than the visual resolution.

For an algorithm based on a deep learning model and capable of being divided into an unsupervised algorithm and a supervised algorithm, the supervised method is used for obtaining an estimated model by extracting text features such as TF-IDF values, first occurrence positions, whether the text features are in a title, part of speech, context features and the like, and then training the model under different neural network structures by combining training corpus data, keyword probability of each word can be obtained through the estimated model, and importance probability of the keywords can be obtained according to a model estimated value reverse order. The unsupervised method may employ extraction of the vectors of the document (Embedding) and the vectors of all words (Embedding), as based on a large-scale text pre-training model BERT (Bidirectional Encoder Representation from Transformers, bi-directional coded representation based on a converter), which is a pre-training technique for natural language processing (NLP, natural Language Processing). And calculating the similarity between the candidate keywords and the document, and selecting the first N words as final keywords according to the reverse order of the similarity.

After the thumbnail keywords are generated, floating display can be performed above the thumbnail in the media preview interface. The user may select whether to turn on the function, while long-press thumbnails may reveal the full view to which the thumbnail corresponds. Further, in order to avoid repeated calculation and judgment of the pictures which are checked by the user later, the processed results can be cached locally, the pictures which are checked by the user next time are read and rendered to generate the thumbnail preferentially through caching, and the speed of checking the preview thumbnail by the user is improved.

According to the media preview method, the thumbnail is generated by scaling and cutting the text type image by adopting the key areas, so that the effective text information quantity of the thumbnail is improved, the key information of the article is displayed in a high-quality mode, and a user can conveniently know the subject content of the whole image through the thumbnail. When the text resolution is still lower than the visual resolution of the user, the problem that the thumbnail cannot be seen clearly when more text is displayed through keyword suspension is avoided. According to the method and the device for the thumbnail information transmission of the text type image, the thumbnail information transmission efficiency of the text type image can be effectively improved, and a user can rapidly preview the core content of the image through the thumbnail in the image list.

It should be understood that, although the steps in the flowcharts related to the embodiments described above are sequentially shown as indicated by arrows, these steps are not necessarily sequentially performed in the order indicated by the arrows. The steps are not strictly limited to the order of execution unless explicitly recited herein, and the steps may be executed in other orders. Moreover, at least some of the steps in the flowcharts described in the above embodiments may include a plurality of steps or a plurality of stages, which are not necessarily performed at the same time, but may be performed at different times, and the order of the steps or stages is not necessarily performed sequentially, but may be performed alternately or alternately with at least some of the other steps or stages.

Based on the same inventive concept, the embodiment of the application also provides a media preview device for realizing the related media preview method. The implementation of the solution provided by the device is similar to the implementation described in the above method, so the specific limitation in one or more embodiments of the media preview device provided below may be referred to the limitation of the media preview method hereinabove, and will not be repeated here.

In one embodiment, as shown in FIG. 18, a media preview device 1800 is provided, comprising: a preview area display module 1802, a thumbnail display module 1804, and a text area display module 1806, wherein:

a preview area display module 1802 for displaying a media preview area of a media library in response to a preview trigger event for the media library;

a thumbnail display module 1804 for displaying an image thumbnail pointing to a text type image in the media preview area when the media library includes the text type image; the image thumbnail has a preset size, and the text display area ratio in the text type image reaches a text theme ratio threshold;

a text region display module 1806, configured to display, in an image thumbnail pointing to a text type image, a scaled image obtained by scaling a partial region taken from the text type image to a preset size; the resolution of the text in the scaled image is not less than the visual resolution threshold of the text and meets the text recognition condition; in the text type image, the degree of importance of the text in the partial region is higher than that of the text outside the partial region.

In one embodiment, the method further comprises a keyword display module for displaying at least one text keyword associated with the text type image in the image thumbnail; at least one text keyword for describing a content subject of text in the text-type image.

In one embodiment, the keyword display module is further configured to display, in the scaled image, at least one text keyword associated with the text-type image from which the scaled image originated in response to a keyword presentation triggering operation in the media preview area.

In one embodiment, the system further comprises an activation entry display module for displaying a keyword presentation activation entry identifying a state to be activated in the media preview area; the keyword display module is further used for responding to the triggering operation of the keyword display activation entry, displaying the keyword display activation entry in a switching mode to be in an identification activation state, and displaying at least one text keyword associated with the text type image from which the zoom image is derived in the zoom image; and the activation entry display module is also used for responding to the triggering operation of the keyword display activation entry for identifying the activation state, displaying the keyword display activation entry in a switching manner as identifying the state to be activated, and hiding at least one text keyword in the zoom image.

In one embodiment, the keyword display module is further configured to display, in the image thumbnail, a preset number of text keywords associated with the text type image; and sequencing and displaying the preset number of text keywords according to the respective text importance degrees.

In one embodiment, the keyword display module is further configured to display at least one text keyword associated with the text type image in the image thumbnail if the resolution of the text in the scaled image is less than the text visual resolution threshold.

In one embodiment, the keyword display module is further configured to display at least one text keyword associated with the text type image in the image thumbnail in a highlighted manner with respect to the scaled image.

In one embodiment, the keyword display module is further configured to display at least one text keyword associated with the text type image in at least one text display area in the image thumbnail, respectively; the font color of the at least one text keyword is color highlighted relative to the background color of the text display area in which the at least one text keyword is located.

In one embodiment, the method further comprises a keyword editing triggering module, which is used for responding to the keyword editing triggering operation of the text type image and displaying a keyword operation area for the text type image; in response to an editing operation for at least one text keyword triggered in the keyword operation area, displaying at least one set keyword associated with the text type image set through the editing operation; and the keyword display module is also used for displaying at least one set keyword associated with the text type image in the image thumbnail.

In one embodiment, the system further comprises a complete display area module and a focusing display module; wherein: the complete display area module is used for responding to the triggering operation of the target keyword in the at least one text keyword and displaying the complete display area of the media; the focusing display module is used for positioning a target text area in the text type image in the complete media display area to perform focusing display; the target text region is a text region associated with a content subject described by the target keyword in the text type image.

In one embodiment, preview area display module 1802 is further configured to display access entries to a media library; responding to a triggering operation for an access entry, and displaying a media preview area of a media library; the system also comprises a thumbnail triggering response module, which is used for responding to the triggering operation of the image thumbnail and displaying the complete image content of the text type image in the complete media display area of the text type image.

In one embodiment, the thumbnail triggering response module is further configured to display at least one text keyword associated with the text type image in the media full presentation area; responding to triggering operation of a target keyword in at least one text keyword, and positioning a target text area in a text type image for focusing display; the target text region is a text region associated with a content subject described by the target keyword in the text type image.

In one embodiment, the partial region includes at least one of a text header region, a text body core region, or a text body highlighting region in the text-type image; the thumbnail display module 1804 is also configured to display, in the media preview area, image thumbnails that point to text-type images in order of the text-type images in the media library.

In one embodiment, the method further comprises a target image determining module, a character recognition module and an image type determining module; wherein: the target image determining module is used for determining target images in the media library; the character recognition module is used for carrying out character recognition on the target image and determining a text area comprising texts in the target image; and the image type determining module is used for determining that the target image belongs to the text type image when the area ratio of the text display area in the target image reaches the text theme ratio threshold value.

In one embodiment, the image type determination module includes a source information determination module, a duty cycle threshold determination module, and a duty cycle comparison module; wherein: the source information determining module is used for acquiring image source information of the target image; the duty ratio threshold determining module is used for determining a text theme duty ratio threshold according to the image source information; and the duty ratio comparison module is used for determining that the target image belongs to the text type image when the area duty ratio of the text display area in the target image reaches the text theme duty ratio threshold value.

In one embodiment, the method further comprises a scaled image obtaining module for determining a text body area including a text body from a text area including text in the text type image; intercepting a partial area from a text body area; a scaled image is obtained based on the partial region.

In one embodiment, the scaled image acquisition module is further configured to at least one of: intercepting a text title area with the text font size meeting title judgment conditions from the text area, and obtaining a partial area according to the text title area; intercepting a text body core area with text importance quantization parameters meeting text core judgment conditions from the text body area, and obtaining a partial area according to the text body core area; and intercepting a text body highlighting area of the text highlighting from the text body area, and obtaining a partial area according to the text body highlighting area.

In one embodiment, the scaled image obtaining module is further configured to determine a scaling manner for the text body region; according to the scaling mode and the preset size, scaling the text area to obtain a scaled text area; cutting the scaled text body area according to a preset size to obtain a partial area with the size matched with the preset size.

In one embodiment, the method further comprises a separation processing module, an importance degree estimation module and an estimation result processing module; wherein: the segmentation processing module is used for carrying out word segmentation processing on texts in the text type images to obtain text word segmentation; the importance degree estimation module is used for estimating the importance degree of the texts of each text word to obtain the respective quantization parameter of the importance degree of the texts of each text word; and the estimation result processing module is used for determining at least one text keyword from each text word according to the text importance degree quantization parameter.

In one embodiment, the method further comprises an association establishing module, which is used for establishing an association relationship between at least one text keyword and the text type image; and the keyword display module is also used for inquiring at least one text keyword associated with the text type image according to the association relation and displaying the at least one text keyword in the image thumbnail.

In one embodiment, the method further comprises a text recognition determination module for determining a font size of text in the scaled image; obtaining the resolution of the text in the scaled image according to the font size; a size relationship between the resolution of text in the scaled image and a text visual resolution threshold is determined.

The various modules in the media preview device described above may be implemented in whole or in part in software, hardware, and combinations thereof. The above modules may be embedded in hardware or may be independent of a processor in the computer device, or may be stored in software in a memory in the computer device, so that the processor may call and execute operations corresponding to the above modules.

In one embodiment, a computer device is provided, which may be a terminal or a server, and the internal structure of which may be as shown in fig. 19. The computer device includes a processor, a memory, an input/output interface, a communication interface, a display unit, and an input means. The processor, the memory and the input/output interface are connected through a system bus, and the communication interface, the display unit and the input device are connected to the system bus through the input/output interface. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage media. The input/output interface of the computer device is used to exchange information between the processor and the external device. The communication interface of the computer device is used for carrying out wired or wireless communication with an external terminal, and the wireless mode can be realized through WIFI, a mobile cellular network, NFC (near field communication) or other technologies. The computer program is executed by a processor to implement a media preview method. The display unit of the computer equipment is used for forming a visual picture, and can be a display screen, a projection device or a virtual reality imaging device, wherein the display screen can be a liquid crystal display screen or an electronic ink display screen, the input device of the computer equipment can be a touch layer covered on the display screen, can also be a key, a track ball or a touch pad arranged on a shell of the computer equipment, and can also be an external keyboard, a touch pad or a mouse and the like.

It will be appreciated by those skilled in the art that the structure shown in FIG. 19 is merely a block diagram of some of the structures associated with the present inventive arrangements and is not limiting of the computer device to which the present inventive arrangements may be applied, and that a particular computer device may include more or fewer components than shown, or may combine some of the components, or have a different arrangement of components.

In an embodiment, there is also provided a computer device comprising a memory and a processor, the memory having stored therein a computer program, the processor implementing the steps of the method embodiments described above when the computer program is executed.

In one embodiment, a computer-readable storage medium is provided, storing a computer program which, when executed by a processor, implements the steps of the method embodiments described above.

In an embodiment, a computer program product is provided, comprising a computer program which, when executed by a processor, implements the steps of the method embodiments described above.

It should be noted that, the user information (including but not limited to user equipment information, user personal information, etc.) and the data (including but not limited to data for analysis, stored data, presented data, etc.) related to the present application are information and data authorized by the user or sufficiently authorized by each party, and the collection, use and processing of the related data need to comply with the related laws and regulations and standards of the related country and region.

Those skilled in the art will appreciate that implementing all or part of the above described methods may be accomplished by way of a computer program stored on a non-transitory computer readable storage medium, which when executed, may comprise the steps of the embodiments of the methods described above. Any reference to memory, database, or other medium used in embodiments provided herein may include at least one of non-volatile and volatile memory. The nonvolatile Memory may include Read-Only Memory (ROM), magnetic tape, floppy disk, flash Memory, optical Memory, high density embedded nonvolatile Memory, resistive random access Memory (ReRAM), magnetic random access Memory (Magnetoresistive Random Access Memory, MRAM), ferroelectric Memory (Ferroelectric Random Access Memory, FRAM), phase change Memory (Phase Change Memory, PCM), graphene Memory, and the like. Volatile memory can include random access memory (Random Access Memory, RAM) or external cache memory, and the like. By way of illustration, and not limitation, RAM can be in the form of a variety of forms, such as static random access memory (Static Random Access Memory, SRAM) or dynamic random access memory (Dynamic Random Access Memory, DRAM), and the like. The databases referred to in the embodiments provided herein may include at least one of a relational database and a non-relational database. The non-relational database may include, but is not limited to, a blockchain-based distributed database, and the like. The processor referred to in the embodiments provided in the present application may be a general-purpose processor, a central processing unit, a graphics processor, a digital signal processor, a programmable logic unit, a data processing logic unit based on quantum computing, or the like, but is not limited thereto.

The technical features of the above embodiments may be arbitrarily combined, and all possible combinations of the technical features in the above embodiments are not described for brevity of description, however, as long as there is no contradiction between the combinations of the technical features, they should be considered as the scope of the description. The foregoing examples illustrate only a few embodiments of the application and are described in detail herein without thereby limiting the scope of the application. It should be noted that it will be apparent to those skilled in the art that several variations and modifications can be made without departing from the spirit of the application, which are all within the scope of the application. Accordingly, the scope of the application should be assessed as that of the appended claims.

Claims

1. A method of media previewing, the method comprising:

responding to a preview trigger event for a media library, and displaying a media preview area of the media library;

when the media library comprises a text type image, displaying an image thumbnail pointing to the text type image in the media preview area; the image thumbnail has a preset size, and the text display area ratio in the text type image reaches a text theme ratio threshold;

Displaying a scaled image in the image thumbnail pointing to the text type image, the scaled image scaling a partial region cut from the text type image to the preset size; the resolution of the text in the scaled image is not less than a text visual resolution threshold; in the text type image, the text importance degree in the partial area is higher than the text importance degree outside the partial area.

2. The method according to claim 1, wherein the method further comprises:

displaying at least one text keyword associated with the text type image in the image thumbnail;

the at least one text keyword is used for describing the content theme of the text in the text type image.

3. The method of claim 2, wherein displaying at least one text keyword associated with the text-type image in the image thumbnail comprises:

in response to a keyword presentation triggering operation in the media preview area, at least one text keyword associated with a text type image from which the scaled image originated is displayed in the scaled image.

4. A method according to claim 3, characterized in that the method further comprises:

Displaying a keyword display activation entry for identifying a state to be activated in the media preview area;

the displaying, in the scaled image, at least one text keyword associated with a text-type image from which the scaled image originated in response to a keyword presentation triggering operation in the media preview area, includes:

in response to a triggering operation of the keyword presentation activation portal, displaying the keyword presentation activation portal in a switching manner as an identification activation state, and displaying at least one text keyword associated with a text type image from which the scaled image is derived in the scaled image;

the method further comprises the steps of:

and in response to a triggering operation of the keyword presentation activation entry identifying an activation state, displaying the keyword presentation activation entry as identifying a state to be activated in a switching manner, and hiding the at least one text keyword in the scaled image.

5. The method of claim 2, wherein displaying at least one text keyword associated with the text-type image in the image thumbnail comprises:

displaying a preset number of text keywords associated with the text type image in the image thumbnail;

And the preset number of text keywords are displayed in a sequence according to the respective text importance degrees.

6. The method of claim 2, wherein displaying at least one text keyword associated with the text-type image in the image thumbnail comprises:

and displaying at least one text keyword associated with the text type image in the image thumbnail under the condition that the resolution of the text in the scaled image is smaller than the text visual resolution threshold.

7. The method of claim 2, wherein displaying at least one text keyword associated with the text-type image in the image thumbnail comprises:

and displaying at least one text keyword associated with the text type image in the image thumbnail in a highlighting manner relative to the zoom image.

8. The method of claim 7, wherein displaying at least one text keyword associated with the text type image in the image thumbnail in a highlighted manner with respect to the scaled image comprises:

at least one text display area in the image thumbnail respectively displays at least one text keyword associated with the text type image;

And the font color of the at least one text keyword is highlighted relative to the background color of the text display area where the at least one text keyword is located.

9. The method according to claim 2, wherein the method further comprises:

responding to a keyword editing triggering operation on the text type image, and displaying a keyword operation area aiming at the text type image;

responding to the editing operation for the at least one text keyword triggered in the keyword operation area, and displaying at least one set keyword which is set through the editing operation and is associated with the text type image;

the displaying, in the image thumbnail, at least one text keyword associated with the text type image, including:

and displaying the at least one setting keyword associated with the text type image in the image thumbnail.

10. The method according to claim 2, wherein the method further comprises:

responding to the triggering operation of the target keyword in the at least one text keyword, and displaying a complete media display area;

positioning a target text area in the text type image in the media complete display area for focusing display;

The target text region is a text region associated with a content subject described by the target keyword in the text type image.

11. The method of claim 1, wherein the displaying a media preview area of a media library in response to a preview trigger event for the media library comprises:

displaying an access entry to a media library;

responding to the triggering operation for the access entrance, and displaying a media preview area of the media library;

the method further comprises the steps of:

and responding to the triggering operation of the image thumbnail, and displaying the complete image content of the text type image in the media complete display area of the text type image.

12. The method of claim 11, wherein the method further comprises:

displaying the at least one text keyword associated with the text type image in the media complete display area;

responding to the triggering operation of the target keyword in the at least one text keyword, and positioning a target text area in the text type image for focusing display;

13. The method of any one of claims 1 to 12, wherein the partial region comprises at least one of a text header region, a text body core region, or a text body highlighting region in the text-type image;

displaying an image thumbnail pointing to the text type image in the media preview area comprises:

and displaying the image thumbnail pointing to the text type image in the media preview area according to the ordering of the text type image in the media library.

14. The method according to claim 1, wherein the method further comprises:

determining a target image in the media library;

performing character recognition on the target image, and determining a text display area comprising texts in the target image;

and when the area ratio of the text display area in the target image reaches the text theme ratio threshold value, determining that the target image belongs to a text type image.

15. The method of claim 14, wherein the determining that the target image belongs to a text type image when the area ratio of the text display area in the target image reaches the text subject ratio threshold comprises:

Acquiring image source information of the target image;

determining a text theme duty ratio threshold according to the image source information;

16. The method according to claim 1, wherein the method further comprises:

determining a text body area comprising a text body from a text area comprising text in the text type image;

intercepting the partial area from the text body area;

the scaled image is obtained based on the partial region.

17. The method of claim 16, wherein said intercepting said partial region from said text body region comprises at least one of:

intercepting a text title area with the text font size meeting title judgment conditions from the text area, and obtaining the partial area according to the text title area;

intercepting a text body core area with text importance quantization parameters meeting text core judgment conditions from the text body area, and obtaining the partial area according to the text body core area;

And intercepting a text body highlighting area with a text highlighting from the text body area, and obtaining the partial area according to the text body highlighting area.

18. The method of claim 16, wherein said intercepting said partial region from said text body region comprises:

determining a scaling mode for the text body area;

according to the scaling mode and the preset size, scaling the text area to obtain a scaled text area;

and cutting the scaled text body area according to the preset size to obtain the partial area with the size matched with the preset size.

19. The method according to claim 2, wherein the method further comprises:

word segmentation processing is carried out on texts in the text type images, and each text word segmentation is obtained;

estimating the text importance degree of each text word to obtain the respective text importance degree quantization parameter of each text word;

and determining the at least one text keyword from the text segmentation according to the text importance quantization parameter.

20. The method of claim 19, wherein the method further comprises:

establishing an association relationship between the at least one text keyword and the text type image;

and inquiring to obtain the at least one text keyword associated with the text type image according to the association relation, and displaying the at least one text keyword in the image thumbnail.

21. The method according to any one of claims 1 to 20, further comprising:

determining a font size of text in the scaled image;

obtaining the resolution of the text in the scaled image according to the font size;

a size relationship between a resolution of text in the scaled image and the text visual resolution threshold is determined.

22. A media preview device, the device comprising:

a thumbnail display module for displaying an image thumbnail pointing to a text type image in the media preview area when the media library includes the text type image; the image thumbnail has a preset size, and the text display area ratio in the text type image reaches a text theme ratio threshold;

A text region display module for displaying, in the image thumbnail directed to the text type image, a scaled image in which a partial region cut from the text type image is scaled to the preset size; the resolution of the text in the scaled image is not less than a text visual resolution threshold; in the text type image, the text importance degree in the partial area is higher than the text importance degree outside the partial area.

23. A computer device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor implements the steps of the method of any one of claims 1 to 21 when the computer program is executed.

24. A computer readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, implements the steps of the method of any one of claims 1 to 21.

25. A computer program product comprising a computer program, characterized in that the computer program, when executed by a processor, implements the steps of the method of any one of claims 1 to 21.