WO2020187117A1 - 附图页面的展示方法和装置、文本页面的展示方法和装置 - Google Patents

附图页面的展示方法和装置、文本页面的展示方法和装置 Download PDF

Info

Publication number
WO2020187117A1
WO2020187117A1 PCT/CN2020/078922 CN2020078922W WO2020187117A1 WO 2020187117 A1 WO2020187117 A1 WO 2020187117A1 CN 2020078922 W CN2020078922 W CN 2020078922W WO 2020187117 A1 WO2020187117 A1 WO 2020187117A1
Authority
WO
WIPO (PCT)
Prior art keywords
text
data
page
target
image
Prior art date
Application number
PCT/CN2020/078922
Other languages
English (en)
French (fr)
Inventor
张济徽
黑马
兀提哈
高欣婷
傅学勇
川合滋典
蔡洁
贾忠贤
Original Assignee
智慧芽信息科技(苏州)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 智慧芽信息科技(苏州)有限公司 filed Critical 智慧芽信息科技(苏州)有限公司
Publication of WO2020187117A1 publication Critical patent/WO2020187117A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Definitions

  • This specification belongs to the field of Internet technology, and in particular relates to methods and devices for displaying pages with drawings, and methods and devices for displaying text pages.
  • Website platforms usually display the specific content of files that users follow or want to query through web pages for users to query and use.
  • some documents (such as patent documents, etc.) will contain both the content of the drawings and the text.
  • the content of the above drawings and the text are often related to each other.
  • a certain text sentence may be a specific explanation of the content object indicated by a certain image identifier in the content of the drawing.
  • the content of the drawings and the text in the above-mentioned files are mostly separate and independent of each other.
  • the display of the text content corresponding to the drawings in the above-mentioned files is often separated from the display of the described drawings.
  • FIG. 1 For example, as shown in Figure 1, user A has retrieved an interesting patent document on a patent search website. User A wants to browse the drawings of the specification in the patent document first, so as to have an overall understanding of the technical solutions involved in the patent document. Specifically, on the page containing the drawings displayed to user A on the website, only the content contained in the original drawings of the patent document is displayed: the number of the drawings, the content objects in the drawings, and the instructions in the drawings The icon of the content object. At this time, if user A notices the component whose icon is 101 in FIG. 1, and wants to know more about what the component is and what is its use.
  • the user cannot directly obtain other related information of the component except the drawing from the currently displayed drawing page (such as the component’s Name etc.).
  • the user can only use cumbersome operations to jump from the page that currently displays the content of the image to the page that displays the text content of the file, and then search the pages that display the text content verbatim.
  • the related sentence description of the component 101 in order to obtain and understand other related information of the component.
  • the existing display method of drawing page is often not smart enough when displaying the drawing page, and it is impossible to display the text content associated with the drawing in the document text at the same time, which makes it relatively troublesome for users to read and understand. It is inconvenient and the user experience is relatively poor.
  • the purpose of this specification is to provide a method and device for displaying a drawing page, and a method and device for displaying a text page, so as to display the target drawing selected by the user on the drawing page while responding to the user’s comments on the target drawing.
  • the matching text of the triggered image mark is automatically determined and displayed, and the user interacts with the user to facilitate the user to read and understand the content of the file and improve the user's experience.
  • a method for displaying a drawing page includes: displaying a page containing a target drawing in a target file, wherein the target file includes drawing data and text data, and the target drawing includes an image identifier; receiving And in response to the trigger operation for the image identification in the target drawing, the triggered image identification is determined; according to the image data and the text data, the matching of the triggered image identification is displayed on the page Text, wherein the matching text of the triggered image identification includes at least a sentence or phrase used to describe the triggered image identification.
  • a method for displaying a drawing page includes: displaying a page containing a target drawing in a target file, wherein the target file includes drawing data and text data, the drawing data includes an image identifier, and The text data includes a keyword group, the target drawing includes an image identifier; receiving a first type of trigger operation, wherein the first type of trigger operation is used to control the display of the image identifier contained in the target drawing Matching text; in response to the first type of triggering operation, the matching text of the image identification contained in the target drawing is displayed on the page, wherein the matching text of the image identification includes a description of the image identification Sentence or phrase.
  • a method for displaying a text page includes: displaying a page containing target text in a target file, wherein the target file includes image data and text data, and the target text includes keyword groups; receiving and responding to The trigger operation of the keyword group in the target text determines the triggered keyword group; the matching drawing of the triggered keyword group is displayed in the page, wherein the matching drawing of the triggered keyword group At least include the drawings described by the triggered keyword group or the content objects in the drawings.
  • a display device for page with drawings comprising: a display module for displaying a page containing a target drawing in a target file, wherein the target file includes drawing data and text data, and the target drawing contains There is an image identification; a determination module, used to receive and respond to the trigger operation for the image identification in the target drawing, and determine the triggered image identification; a display module, used to determine the triggered image identification according to the image data and the text data , Displaying the matching text of the triggered image identification on the page, wherein the matching text of the triggered image identification includes at least a sentence or phrase for describing the triggered image identification.
  • a display device for drawing pages including: a display module for displaying a page containing target drawings in a target file, wherein the target file includes drawing data and text data, and the drawing data includes Image identification, the text data contains a keyword group, and the target drawing contains an image identification; a determining module is used to receive a first type of trigger operation, wherein the first type of trigger operation is used to control the display of the The matching text of the image identification contained in the target drawing; a display module for responding to the first type of trigger operation to display the matching text of the image identification contained in the target drawing on the page, wherein The matching text of the image identification includes a sentence or phrase used to describe the image identification.
  • a display device for a text page comprising: a display module for displaying a page containing target text in a target file, wherein the target file includes image data and text data, and the target text includes a keyword group Receiving module, used to receive and respond to the trigger operation for the keyword group in the target text, to determine the triggered keyword group; display module, used to display the matching attachment of the triggered keyword group in the page Figure, wherein the matching drawings of the triggered keyword group include at least the drawings described by the triggered keyword group or the content objects in the drawings.
  • This specification provides a method and device for displaying a picture page, and a method and device for displaying a text page.
  • the client device receives and responds to the user's triggering operation on the image mark in the picture page to determine the triggered image mark; Then, according to the acquired drawing data and text data of the target file, the matching text of the triggered image identifier is displayed on the drawing page, so as to perform the operation on the drawing indicated by the image identifier or the content object in the drawing.
  • Description so that users no longer need to search for text sentences related to the image identification of interest in the target drawing in the text of the target file by themselves, and can accurately obtain other information related to the image identification. It is convenient for users to efficiently read and understand the content of the drawings in the drawings page, thereby simplifying user operations and improving user experience.
  • FIG. 1 is a schematic diagram of an embodiment of a display method based on the existing drawing page
  • FIG. 2 is a schematic diagram of an embodiment of displaying a page with drawings for users based on a business system
  • FIG. 3 is a schematic diagram of an embodiment of the flow of a method for displaying a page of drawings provided by the present application;
  • FIG. 4 is a schematic diagram of an embodiment of applying the method for displaying the page of the drawings provided by this application in a scenario example;
  • FIG. 5 is a schematic diagram of an embodiment in which the method for displaying the page of drawings provided by this application is applied in a scene example;
  • FIG. 6 is a schematic diagram of an embodiment in which the method for displaying the page with drawings provided by this application is applied in a scenario example;
  • FIG. 7 is a schematic diagram of an embodiment in which the method for displaying the page of drawings provided by this application is applied in a scenario example;
  • FIG. 8 is a schematic diagram of an embodiment in which the method for displaying the page of the drawings provided by this application is applied in a scenario example;
  • FIG. 9 is a schematic diagram of an embodiment in which the method for displaying the page with drawings provided by this application is applied in a scene example
  • FIG. 10 is a schematic diagram of an embodiment of applying the display method of the page of the drawings provided by this application in a scene example
  • FIG. 11 is a schematic diagram of an embodiment in which the method for displaying the page with drawings provided by this application is applied in a scene example;
  • FIG. 12 is a schematic diagram of the structural composition of the display device on the page of the drawings provided by this application.
  • FIG. 13 is a schematic diagram of the structural composition of a client device provided by this application.
  • FIG. 14 is a schematic diagram of an embodiment of the flow of a method for displaying a page of drawings provided by the present application.
  • FIG. 15 is a schematic diagram of an embodiment in which the method for displaying the page with drawings provided by this application is applied in a scene example;
  • FIG. 16 is a schematic diagram of an embodiment of the flow of the method for displaying text pages provided by the present application.
  • FIG. 17 is a schematic diagram of an embodiment of applying the text page display method provided by this application in a scenario example
  • Figure 18 is a schematic diagram of the structural composition of the text page display device provided by this application.
  • FIG. 19 is a schematic diagram of an embodiment of the flow of the method for displaying the drawing page of the business system provided by the present application.
  • FIG. 20 is a schematic diagram of an embodiment of applying the page display method provided by this application in a scenario example
  • FIG. 21 is a schematic diagram of an embodiment of the flow of a method for displaying a page provided by this application.
  • FIG. 22 is a schematic diagram of an embodiment of applying the page display method provided by this application in a scenario example
  • FIG. 23 is a schematic diagram of an embodiment of applying the page display method provided by this application in a scene example
  • FIG. 24 is a schematic diagram of an embodiment of applying the page display method provided by this application in a scenario example
  • 25 is a schematic diagram of the structural composition of the page display device provided by this application.
  • FIG. 26 is a schematic diagram of an embodiment of the flow of a method for displaying a page provided by the present application.
  • FIG. 27 is a schematic diagram of an embodiment of applying the page display method provided by this application in a scene example
  • Figure 28 is a schematic diagram of an embodiment provided by the present application.
  • Figure 29 is a schematic diagram of an embodiment provided by the present application.
  • Fig. 30 is a schematic diagram of an embodiment provided by the present application.
  • the method for displaying the drawing page provided in this application can be applied to a business system that includes a server (for example, a background processing server of a platform that provides a target file query service for users) and a client device.
  • a server for example, a background processing server of a platform that provides a target file query service for users
  • the user can send the data acquisition request of the file of interest (ie the target file) to the server through the client device to obtain the drawing data and text data of the file, where the file
  • the attached drawing data and text data can be processed in advance by the server (in another embodiment, it can also be processed by the client device), and include the image identifiers in the drawings of the files, and the key in the text.
  • Phrases and related information (such as the identity number and location information of the image identification, and the identity number and location information of the keyword group, etc.) data.
  • the client device can respond to the user's specific operations on the client device based on the above-mentioned drawing data and text data, and display the drawings that the user is interested in and concerned about (that is, the target drawing) on the drawing page, and can also According to the user's needs, the matching text of the image identification in the drawing can be displayed, so that the entire drawing indicated by the image identification in the drawing, or a certain content object in the drawing, such as parts or modules in the drawing, etc. , And make corresponding descriptions and instructions on the page of the drawings.
  • the above-mentioned server may specifically be a background server that is applied to the side of the website platform and can implement functions such as data transmission and data processing.
  • the server may be an electronic device with data calculation, storage functions, and network interaction functions; it may also be a software program running in the electronic device to provide support for data processing, storage, and network interaction.
  • the number of servers is not specifically limited.
  • the foregoing server may specifically be one server, or may include multiple servers, or a server cluster formed by several servers.
  • the aforementioned client device may specifically be a front-end device that is applied to the user side and can implement functions such as data collection, data transmission, and data processing.
  • the aforementioned client device may be, for example, a desktop computer, a tablet computer, a notebook computer, a smart phone, a digital assistant, a smart wearable device, a shopping guide terminal, a TV with a network access function, and the like.
  • the client device may also be a software application capable of running in the above electronic device. For example, it can be an APP running on a smart phone.
  • This application provides a method for displaying a page with drawings.
  • the method can be applied to the client device side in the above-mentioned business system.
  • the specific implementation may include the following content.
  • S301 Display a page containing a target drawing in a target file, where the target file includes drawing data and text data, and the target drawing includes an image identifier.
  • the above-mentioned target file can be specifically understood as a kind of file data that the user pays attention to and wants to consult.
  • the aforementioned target file may be file data containing two different types of data: text data and picture data.
  • the aforementioned target document may be a patent document.
  • an application file that contains text data such as claims, descriptions, and drawings of the description, abstract drawings, etc. at the same time. It can also be a trademark file. It can also be a thesis file or contract file with drawings.
  • the target file may also be other types of file data containing text data and image data in addition to the file types listed above. This manual does not limit the specific types and contents of the above target files.
  • the above-mentioned target drawing may specifically be multiple drawings included in the drawing data of the target file, or may be one or several of the multiple drawings included in the drawing data of the target file.
  • Pictures selected by the user For example, the user can individually select Figure 2 in the drawings of the specification in the patent document as the target drawing, or make no selection, and default to all drawings included in the drawings of the specification, for example, Figure 1, Figure 2 and the attached drawings.
  • Figure 3 as the target drawing.
  • the aforementioned target drawing usually also contains one or more image marks. You can refer to Figure 1.
  • the "100”, “101” and “102” in the figure can be understood as the image marks in Figure 1.
  • the above-mentioned image identification can be specifically understood as a kind of identification information in the figure data used to indicate the whole figure or a certain content object in the figure.
  • the above-mentioned image identification may be identification information used to indicate the overall serial number of the drawing.
  • a number such as "101”
  • a letter such as "A”
  • a combination of letters and numbers such as "A101" and so on.
  • the above-mentioned image marks are only for better describing the embodiments of this specification.
  • the above-mentioned image identification may also be other types of identification information.
  • it may also be a combination of numbers and words that are not connected by a characteristic line, such as "module 21", etc., located in a box in the drawing. In this regard, this manual is not limited.
  • the client device may obtain the drawing data and text data of the target file in advance.
  • the above-mentioned drawing data and text data may specifically be obtained after pre-processing, including drawing data for which image identifiers have been identified and determined, and text data for which keyword groups have been identified and determined.
  • the above keyword group may specifically refer to the text of the target file used to describe or indicate the entire drawing in the target file, and/or the character combination of the content objects contained in the drawing.
  • each keyword group in the text data matches and corresponds to an image identifier in the drawing data, and the corresponding keyword group matches with each other and the entire drawing indicated by the image identifier or the content object in the drawing is the same.
  • the above keyword group may be a character combination of Chinese numbers and nouns in the text data, such as "101 component” or “component 101". It can also be a combination of letters and nouns, such as "A interface” or “interface A”. It can also be a combination of a number and a character that characterizes the figure, such as " Figure 1" or "fig1" and so on.
  • a text sentence containing multiple phrases can also be used as a keyword group.
  • a sentence describing the entire drawing of a certain drawing in the description of the target file may be used as a keyword group. For example, "FIG.
  • FIG. 1 is a schematic diagram of an embodiment of a display method based on an existing drawing page" can be used as a keyword group corresponding to the image identifier "FIG. 1".
  • the keyword groups listed above are only for better describing the embodiments of this specification.
  • other types of character combinations can also be introduced as keyword groups. In this regard, this manual is not limited.
  • the client device may first receive the user's display operation for the target drawing in the target file; in response to the display operation, determine the drawing to be displayed in the user selection as the target drawing, And according to the drawing data and text data of the target file, a page containing the target drawing is generated and displayed to the user.
  • the image identification of the target drawing will also be displayed.
  • S303 Receive and respond to the trigger operation on the image identifier in the target drawing, and determine the triggered image identifier.
  • the above-mentioned trigger operation for the image mark can also be understood as an operation of selecting the image mark.
  • the user when the user is interested in a certain image mark in the displayed picture page, he can select the image mark he is interested in with his finger or mouse to initiate a trigger operation for the image mark in the target picture. .
  • the foregoing trigger operation for the image identification in the target drawing may specifically include at least one of the following: a pressing operation, a selection operation, a click operation, and so on.
  • a pressing operation a selection operation
  • a click operation a click operation
  • the above-mentioned triggered image identification can be specifically understood as the image identification selected by the user who is relatively concerned and interested and wants to further understand related text information.
  • the foregoing receiving and responding to the trigger operation for the image identification in the target drawing determines the triggered image identification.
  • it may include the following content: receiving a preset trigger operation, where all The preset trigger operation includes at least one of the following: a pressing operation, a selection operation, and a click operation; determining the receiving position where the preset trigger operation is received; and setting the distance from the receiving position to an image that is less than or equal to the preset trigger distance threshold
  • the identification is determined as the triggered image identification.
  • the image identifier whose distance from the receiving position is less than or equal to the preset trigger distance threshold can be specifically understood as the image identifier directly touched by the trigger operation, or the trigger operation corresponding to the above trigger operation The receiving location is closer to the image logo, etc.
  • the method for determining the triggered image identifier listed above is only a schematic illustration. During specific implementation, according to specific circumstances, other suitable methods may also be used to determine the above-mentioned triggered image identifier.
  • the input box of the image identification to be triggered can also be displayed to the user through the client device. The user can input the identification information corresponding to the image identification to be triggered through the input box, thereby initiating a trigger operation for the image identification and so on. In this regard, this manual is not limited.
  • the user is interested in the parts indicated by the image mark "101", and he can click on the image mark "101" in Fig. 1 in the current drawing page to initiate the targeting of the image mark.
  • Trigger action The client device may receive and respond to the trigger operation, and determine the image identifier "101" as the triggered image identifier.
  • S305 Display the matching text of the triggered image identification on the page, where the matching text of the triggered image identification includes at least a sentence or phrase for describing the triggered image identification.
  • the matching text of the above-mentioned triggered image identifier can be specifically understood as a kind of text content used to describe or explain the drawing indicated by the triggered image identifier or the content object in the drawing.
  • the matching text of the aforementioned triggered image identification may at least include sentences or phrases used to describe the triggered image identification.
  • the matching text listed above is only a schematic description.
  • the above-mentioned matched text may also include other text content related to the triggered image identifier. In this regard, this manual is not limited.
  • the client device can first determine the keyword group matching the triggered image identifier according to the drawing data and text data of the target file, as the associated keyword group; then according to the text data, It is determined that the text data contains the text data of the related keyword group, and then the matching text can be determined according to the text data containing the related keyword group.
  • the drawing data of the above target file specifically includes the image identification and the related information of the image identification, for example, the position information used to characterize the specific position of the image identification in the drawing data, and the image identification. The corresponding identity number and so on.
  • the text data of the above-mentioned target file specifically contains keyword groups and related information about the keyword groups, for example, the position information used to characterize the specific positions of the keyword groups in the drawing data, and the identity serial numbers corresponding to the keyword groups and many more.
  • the identity serial number of the aforementioned image identification corresponds to the image identification one-to-one, that is, each image identification corresponds to an identification serial number.
  • the identity serial number of each keyword group corresponds to the keyword group one-to-one, that is, each keyword group corresponds to an identity serial number.
  • the identification numbers used to describe or indicate the same figure, or the keyword group and the image identifier of the same content object in the same figure are exactly the same, or similar, only differ by one or two characters.
  • the identity serial number of the triggered image identification can be determined first according to the drawing data of the target file; and then the identity serial number of the triggered image identification can be used to compare the text data of the target file.
  • the included keyword group is searched, and the keyword group whose identity serial number matches the identity serial number of the triggered image identifier is retrieved as the above-mentioned associated keyword group.
  • a sentence containing the associated keyword group can be determined from the text data of the target file according to the location information of the associated keyword group. Furthermore, the corresponding matching text can be generated based on the above-mentioned sentence including the related keyword group.
  • the above-mentioned associated keyword group itself can be directly used as a matching text. It is also possible to use the entire sentence of the associated keyword group in the text data as the matching text. It is also possible to input more complex sentences containing related keyword groups into a pre-trained semantic recognition model to obtain a more refined and general semantic recognition result, and then the above semantic recognition result can be used as a matching text.
  • the semantic recognition model may specifically obtain a text sentence including a keyword group as sample data in advance; then label the semantics related to the keyword group in the sample data to obtain the labeled sample data; use the labeled sample data The sample data is used to train the neural network to establish the semantic recognition model mentioned above.
  • the client device can display the matching text corresponding to the triggered image identifier in the drawing page currently displayed to the user, so that the user can conveniently Learn about the drawing indicated by the image identifier or the related information of the content object in the drawing in the current page.
  • the matching text of the triggered image identification is displayed on the page.
  • the matching text of the triggered image identification may be displayed at a preset display position on the page. text.
  • the aforementioned preset display position may specifically be an area adjacent to the triggered image mark on the drawing page.
  • the client device may display the matching text corresponding to the triggered image identifier at a preset display position close to the triggered image identifier on the drawing page.
  • the client device may display the corresponding matching content "component 101 is a connector" and so on at the lower left position of the triggered image identifier "101".
  • the matching text of the triggered image identifier is displayed on the page.
  • it may also include the following content:
  • the triggered image is displayed in the form of a floating frame
  • the matching text of the identifier is displayed on the page.
  • the above-mentioned floating frame displaying the matching text of the triggered image identification can be dragged to any position according to the customer's reading habits or preferences. Furthermore, when the matching text of the triggered image mark is displayed in the form of a floating box and the matching text of other image marks or other image marks is blocked, the user can drag the floating box to display the triggering The floating frame of the matching text of the image logo is moved to a position that will not block other image logos or the matching text of other image logos for display.
  • the triggered image logo can also be displayed in the preset display position on your page in the form of a floating frame
  • the matching text Specifically, as shown in FIG. 5, the corresponding matching text: "Part 101 is a connector" can be displayed at the lower left position of the image logo "101" in the form of a floating frame.
  • the position information of the associated keyword group used to generate the matched text in the text data of the target file may also be displayed.
  • the related keyword group will also be displayed.
  • the user wants to further read and research the text content related to the component 101 recorded in the text of the target file, he can use the position information displayed in the above floating box as an index, and accurately in the text content of the target file Find the text content related to the component 101 in the 5th line of page 2 of the manual, which can facilitate the user to make targeted queries and read the text of the target file when reading the drawings of the target file.
  • the above-mentioned location information may also be set as a trigger link associated with the text data of the target file.
  • a trigger link associated with the text data of the target file.
  • the client device can then respond to the above-mentioned trigger instruction, and according to the location information corresponding to the triggered trigger link, and according to the text data of the target file, retrieve and determine the above-mentioned position in the text "the second page of the manual, the fifth line" Text content; then pop up a dialog box or jump to the text page to show the user the specific text content at the location, so that the user can more easily read and understand the content indicated by the image mark in the text data Information about the object.
  • the above-mentioned location information can also be associated with the corresponding image identifier, that is, the image identifier itself can also be set as a trigger link.
  • the location information of the above keyword group may not be displayed on the page.
  • the client device can identify the location information of the keyword group corresponding to the trigger link according to the triggered image, find that the text data of the target file contains the text content of the associated keyword group, and then pop up a dialog box, etc. Show the user the specific text content mentioned above.
  • the target drawing to be displayed by the client device is relatively complicated, for example, the graphic structure of the target drawing itself is relatively complicated, or the number of image identifiers contained in the target drawing is relatively large.
  • the distribution is relatively dense, and the page area displayed to users is often limited.
  • the matching text is displayed directly in the vicinity of the triggered image logo in the target drawing in the above page, the displayed page will be relatively confusing, and there will even be matching text of some image logos in similar positions.
  • the phenomenon of mutual covering affects the user's reading and use experience.
  • the client device when it displays the matching text of the triggered image identifier on the above-mentioned drawing page, it can first detect the position of the triggered image identifier in the drawing and the center point of the drawing. The position relationship can then be used to determine the display area for displaying the matching text of the image mark according to the position relationship between the position of the triggered image mark in the drawing and the center point of the drawing, that is, the preset display position. The corresponding matching text is displayed in the preset display position, so as to avoid the problem that the matching text with other image identifiers or other image identifiers and other image information are mutually obstructed, which affects the user's reading.
  • the client device can determine the proximity of the triggered image identifier (that is, to the triggered image identifier) according to the positional relationship between the position of the triggered image identifier in the drawing and the center point of the drawing.
  • the distance is smaller than the first distance threshold), and the area position along the direction deviating from the center point of the figure is used as the preset display position to display the matched text of the triggered image identification.
  • the position coordinates of the image identifier "100" in the drawing and the position coordinates of the center point of the drawing may be determined first; Then according to the position coordinates of the image mark "100" in the drawing and the position coordinates of the center point of the drawing, the positional relationship between the image mark and the center point of the drawing is determined as: located above the center point of the drawing In the left direction.
  • the preset display position of the matching text of the image mark is close to the image mark, but a position in the direction deviating from the center point of the figure (that is, the upper left of the image mark) is used as the preset The display position, and then the matching text of the image identifier "100" can be displayed in the preset display position.
  • the preset display positions for displaying the matching text of the other image identifiers "102" and "101" in the figure can also be determined respectively, and the corresponding preset display positions are respectively triggered after the image identifiers are triggered.
  • the respective matching text is displayed at the display position.
  • the matching text of different image logos in the same drawing can be dispersed as much as possible when displayed, avoiding mutual occlusion, so that users can read the matching text of image logos more conveniently and clearly, and it will not affect the user's attachment The reading of the figure itself.
  • the matching text in order to further avoid display confusion, for example, because the matching text contains a large number of characters, even if the matching text is displayed at the above-mentioned preset position, it may cause the matching texts of different image identifications to appear between each other. Occlusion, a problem that affects users' reading.
  • the aforementioned preset length specifically refers to the number of characters originally contained in the matched text.
  • the first character in the corresponding matching text may be displayed at the preset display position of the triggered image mark; and then after receiving the user's trigger operation for the first character in the matching text; Then, in response to the trigger operation, the complete matching text of the image logo including the first character is displayed.
  • the image identification displayed on the page of the above drawings and the matching text of the image identification may be in a form relatively independent of the main structure of the above page (for example, a dialog box that can be dragged and separated, etc.)
  • What is displayed on the page, that is, the image logo in the above page, and the display position and display angle of the matching text of the image logo can also be not fixed, not fixed to the main structure of the page or the drawings in the page, but can Drag and adjust accordingly according to the user's specific instructions or operations.
  • the matching text of the triggered image mark will be displayed at the preset display position of the triggered image mark by default.
  • the user can first select the matching text currently displayed at the preset display position, and drag the matching text to other areas on the page where they feel suitable for display.
  • the matching text of the image identification "101" can be dragged from the lower left corner of the page to the lower right corner for display.
  • the user can also select the matching text in a similar manner, and then perform a 360-degree rotation adjustment on the matching text through a specific rotation trigger operation, so that it can finally be rotated to a suitable user Display after rotating the angle.
  • the user wants to rotate the target drawing by 90 degrees according to his own reading habits to read and browse.
  • the user can rotate the figure by 90 degrees to display the figure by rotating the figure, but because the matching text of the triggered image logo in the figure is displayed in a relatively independent form, although it is attached
  • the figure is rotated, but the matching text identified by the triggered image in the figure can remain unrotated, so as to avoid the inconvenience to the user's reading due to the rotation of the accompanying figure of the matching text.
  • the target drawing, the image identifier, and the matching text of the image identifier in the drawing page may be relatively independent.
  • the client device can also receive a drawing rotation instruction from the user; according to the drawing rotation instruction, the target drawing in the drawing page is rotated by a specified angle, so that the user can better read the content in the target drawing. And in the process of rotating the target drawing, the client device may keep the image identification in the drawing page, and/or the matching text of the image identification does not rotate with the target drawing.
  • the client device can also rotate the image identification and/or matching text of the image identification together with the target drawing in the process of rotating the target drawing, so that the image identification and/or image identification
  • the display direction of the matching text is consistent with the rotated target drawing; further, according to the default command or the character direction correction command issued by the user, the image in the drawing page and/or the matching text of the image can be separately identified Rotation correction is performed so that the corrected image logo and/or the matching text of the image logo can be displayed on the drawing page in a display direction suitable for normal reading by the user.
  • the figure page can also display a figure rotation control icon.
  • the client device can receive and rotate the target drawing 90 degrees clockwise according to the above-mentioned figure rotation instruction, while keeping the matching text of the image logo in the figure page unchanged in the original display direction, which is convenient for the user to finish the rotation. After the target drawing, you can continue to read the matching text of the image mark in the drawing page normally.
  • FIG. 28 In the example shown in FIG.
  • the display direction of the target drawing and the original icon in the target drawing (smaller numeric characters, for example, 230) in the target drawing after the rotation is clockwise It is rotated by 90 degrees, but the display direction of the matching text (larger character combination in the figure, for example, 230.legs) displayed on the target drawing does not change with the target drawing.
  • the user can customize the rotation direction and specify the angle when issuing the drawing rotation instruction. Furthermore, the client device can rotate the target drawing by a specified angle along the rotation direction customized by the user according to the user's drawing rotation instruction, so that the target drawing can be displayed on the drawing page to meet the user's display direction.
  • the client device may first count the number of characters contained in the matching text to be displayed. If the matching text contains a large number of characters, For example, if the number of characters is greater than the first threshold of 100, in order to reduce the occurrence of mutual occlusion, a relatively small character size can be automatically selected from a variety of preset character sizes to display the matching text as the default character size. This can also reduce the impact of occlusion on user reading.
  • the size of the characters in the matching text of the triggered image identifier displayed on the page can be adjusted by the user.
  • the client device may first determine a suitable character size as the default size to display the matching text of the triggered image identifier in the above manner. After the matching text is displayed, the user can choose according to his own situation. For example, a character in a matching text displayed on the page is too small and not clear enough. If the user wants to recognize the character, the user can first select the matching text. Then, the size of the characters in the matching text is adjusted by stretching or shrinking the selected matching text, so as to better meet the personalized requirements of different users, and display the matching text of the triggered image identification.
  • the corresponding number of matching texts of the triggered image identifiers displayed may also be relatively large, resulting in a lot of It is easy to cause confusion in the correspondence between the image mark and the matching text when the user reads.
  • the first image identification and the second image identification are relatively close, it may happen that the user mistakes the matching text of the second image identification as the matching text of the first image identification.
  • the first triggered image identification and the matching text of the image identification can be displayed in red, and the second triggered image identification and the image can be displayed in green.
  • the matching text of the identifier can be displayed in red, and different triggered image marks and matching texts of the corresponding image marks are displayed on the page.
  • the client device can also select a suitable layout method from a variety of preset layout methods to display according to the layout characteristics of the drawings in the displayed page, and the distance position relationship between adjacent image marks
  • the matching text of the triggered image identification in this figure.
  • the foregoing preset typesetting methods may specifically include vertical typesetting and horizontal typesetting.
  • the preset typesetting methods listed above are only to better illustrate the embodiments of this specification.
  • other types of typesetting methods may also be introduced according to specific scene requirements. In this regard, this manual is not limited.
  • the user considering that when the user reads the target text and the target drawing, it is for special needs such as comparison with other file content.
  • the user can also compare the target drawing according to actual needs.
  • the matching text is customized and edited to the content desired by the user, which is convenient for the user to remember and understand; however, the matching text in other parts of the target file can be changed or not changed accordingly.
  • the client device detects that the distance between a certain image identifier and the horizontally adjacent image identifier (for example, the image identifier adjacent to the left and the image identifier adjacent to the right) is relatively small, while the image adjacent to the vertical
  • the distance between the logo such as the image logo adjacent to the top and the image logo adjacent to the bottom
  • you can choose vertical typesetting As the default typesetting method, the matching text of the image logo is displayed.
  • the horizontal typesetting method can be selected as the default typesetting method. Display matching text.
  • the matching text can also be received and displayed according to the typesetting method selected by the user, so as to meet the personalized requirements of different users and further improve the user experience.
  • the control icon can also be preset on the displayed page. For example, "Matching Text Display” in the upper right corner of the page in Figure 9.
  • the user can control whether the matching text of the image logo is displayed on the displayed page according to their own needs through the operation settings of the control icon. For example, in Figure 9, if the user currently only wants to read the target drawing, and feels that the displayed page displays too much matching text, which interferes with his own reading of the drawing itself, he can turn the "matched text display” to the right Toggle to the "off” position, then the displayed matching text of the triggered image mark will be hidden on the above page, so that the user can clearly read the picture information of the target picture.
  • the client device may also place a blank area (or other It does not affect the user to read the target drawing area) to generate and display a data list.
  • the data list may specifically include each image identifier contained in the target drawing, and matching text corresponding to each image identifier.
  • the above-mentioned data list may also be associated with the image identifiers in the target drawings. Specifically, when no image identification is triggered, the image identification and the matching text of the image identification in the data list may be displayed in a font with a lighter gray scale or a transparent font.
  • the client device When the user triggers an image identification in the drawing by clicking or other means, the client device will identify the triggered image in the target drawing and the matching text of the triggered image identification in the above data list to distinguish it from the data list
  • the other untriggered image marks in the image mark, as well as the preset display mode of the matching text of the untriggered image mark are displayed in the above data list to remind the user of the target drawing
  • the image that is triggered in identifies the position in the data list. In this way, the user can clearly and accurately identify and determine the triggered image identification and the matching text of the triggered image identification from the data list.
  • the above-mentioned preset display mode may specifically include: displaying in a highlighted manner; displaying in a bold font; displaying in a preset font color; displaying in a floating frame, and so on.
  • display methods other than those listed above, or a combination of multiple display methods can also be used to display the triggered image identifier in the data list, and the triggered image The matching text of the image ID.
  • the triggered image mark 9 in the data list and the matching text of the triggered image mark 9 can be displayed in a highlighted manner.
  • the above-mentioned display modes are only schematic illustrations.
  • the untriggered image identification and the matching text of the untriggered image identification are displayed.
  • the embodiment of this specification does not limit it.
  • the displaying of the matching text of the triggered image identifier on the page may include: in the page, displaying the triggered image in a form independent of the main structure of the page.
  • the matching text of the triggered image identification; correspondingly, the method may specifically further include: receiving and according to a user's operation instruction, moving the displayed matching text of the triggered image identification to a designated position on the page.
  • the above-mentioned designated location can be flexibly set according to the needs of the user.
  • the aforementioned designated location may be a location that conforms to the user's personal browsing and reading habits, or may be a location that does not obstruct or interfere with other information in the drawings.
  • the method may specifically further include: recording the designated position. In this way, if the user subsequently triggers the image mark again, the matching text of the triggered image mark will be preferentially displayed at the above specified position, which can further improve the user experience.
  • the specific implementation of the method may further include: generating and displaying a data list on the page, wherein the data list includes the image identification included in the target drawing and the matching text of the image identification ; Wherein, the data list is associated with the image identification in the target drawing.
  • the displaying of the matching text of the triggered image identifier on the page may further include: displaying the triggered image in a preset display mode in the data list during specific implementation logo, and the matching text of the triggered image logo.
  • the user if the user triggers a certain image mark and/or the matching text of a certain image mark in the data list by a trigger such as clicking, it can also be displayed in a preset display mode in the target drawing The triggered image identification.
  • the preset display mode may specifically include at least one of the following: display in a highlighted manner; display in a bold font; display in a preset font color; display in a floating frame ; Display in italics, etc.
  • the method may further include: receiving and responding during the specific implementation
  • the user's save operation saves the target drawing page displaying the matching text of the triggered image identifier. In this way, it is convenient for the user to save and download the above-mentioned target drawing showing the matching text for subsequent use by the user, which further improves the user experience.
  • the display method of the drawing page receives and responds to the user's triggering operation on the image mark in the drawing page through the client device, and determines the triggered image mark; and then according to the obtained target file Figure data and text data.
  • the matching text of the triggered image identifier is displayed on the figure page to describe the figure indicated by the image identifier or the content object in the figure, so that the user no longer needs it.
  • the image identification may specifically include: identification information for indicating the serial number of the drawing, and/or identification information for indicating the content object in the drawing, and so on.
  • the target document may specifically include at least one of the following: patent documents, dissertation documents, contract documents, trademark documents, and so on.
  • patent documents may specifically include at least one of the following: patent documents, dissertation documents, contract documents, trademark documents, and so on.
  • the target files listed above are only to better illustrate the embodiments of this specification.
  • other types of files can also be introduced as the aforementioned target files. In this regard, this manual is not limited.
  • the method may further include the following content during specific implementation:
  • S1 Acquire the drawing data and text data of the target file, where the drawing data includes an image identifier, the text data includes a keyword group, and the keyword group is used to indicate the drawing in the text data, and/ Or, the character combination of the content object in the drawing;
  • the drawing data and text data of the target file acquired by the client device may specifically be processed by the client device in advance on the drawing data and text data of the target file to identify and determine The data obtained after the image identification in the drawing data and the keyword group in the text data are obtained; it can also be that the server performs corresponding processing on the drawing data and text data of the target file in advance to identify and determine the image identification in the drawing data. And the keyword group in the text data before sending to the client device.
  • the foregoing receiving and responding to the trigger operation for the image identification in the target drawing determines the triggered image identification.
  • it may include the following content: receiving a preset trigger operation, where the The preset trigger operation includes at least one of the following: a pressing operation, a selection operation, and a click operation; determining the receiving position where the preset trigger operation is received; identifying the image whose distance from the receiving position is less than or equal to the preset trigger distance threshold Determined as the triggered image identifier.
  • the above-mentioned method of determining the triggered image identifier in the target drawing is only a schematic illustration. During specific implementation, according to specific circumstances, other suitable methods may also be used to determine the triggered image identifier in the target drawing. In this regard, this manual is not limited.
  • the matching text of the triggered image identification is displayed on the page according to the drawing data and the text data.
  • the following content may be included: according to the target file To determine the identity serial number of the triggered image ID; from the keyword group contained in the text data of the target file, retrieve the identity serial number that matches the identity serial number of the triggered image ID The keyword group is used as the associated keyword group; according to the associated keyword group, the matching text of the triggered image identification is determined; at the preset display position in the page, the information of the triggered image identification is displayed Match text.
  • the implementations listed above are only schematic descriptions. During specific implementations, other suitable methods can also be used according to specific conditions to display the data on the page based on the drawing data and the text data.
  • the matching text of the triggered image ID In this regard, this manual is not limited.
  • the preset display position may be determined in the following manner: determine the position of the triggered image mark in the target drawing, and the position of the center point of the target drawing; according to the triggered image mark The position in the target drawing and the position of the center point of the target drawing determine the preset display position, wherein the preset display position includes a direction away from the center point of the target drawing, and The distance of the triggered image identifier is less than or equal to the position of the preset first distance threshold.
  • the above-mentioned method for determining the preset display position is only a schematic illustration. During specific implementation, other suitable methods may also be used to determine the preset display position according to specific conditions. In this regard, this manual is not limited.
  • the matching text of the triggered image identifier is displayed at the preset display position on the page.
  • it may also include the following content: at the preset display position on the page , Displaying characters of less than a preset length in the matching text of the triggered image identification; receiving a trigger operation for characters less than the preset length in the matching text of the triggered image identification; responding to the The trigger operation displays the matching text of the triggered image mark.
  • the above-mentioned displaying the matching text of the triggered image identification may further include: displaying the matching text of the triggered image identification through a preset display mode, wherein the preset The display mode includes at least one of the following: display in a highlighted manner; display in a bold font; display in a preset font color; display in a floating frame; display in an italic font; display in a font size The way it gets bigger is displayed and so on.
  • the above-mentioned manners of displaying the matching text of the triggered image identification are only for better describing the embodiments of this specification.
  • other display methods other than the display methods listed above may be introduced to display the matched text of the triggered image identifier. In this regard, this manual is not limited.
  • the client device or server may perform corresponding processing on the image data and text data in the target file in advance, so as to identify and determine the image identifier (and the identity including the image identifier) in the image data.
  • Related information such as serial number and location information), and keyword groups in the text data (and related information including the identity serial number and location information of the keyword groups).
  • the client device can quickly respond to the user's trigger operation according to the pre-processed picture data and text data of the target file in the above manner, determine the triggered image identifier, and display the matching text of the triggered image identifier .
  • the image identifier contained in the drawing data may be specifically determined in the following manner:
  • S2 Perform image data recognition on the first range area in the drawing data including the preset page header, and use the recognized first character group as the image identifier, wherein the first character group includes: Combinations of numbers and characteristic characters of the representation map;
  • S4 Perform image data recognition on the second range area in the drawing data including the preset page header, and use the recognized second character group as the image identifier, wherein the second character group includes: A combination of numbers and/or letters whose distance from the feature line in the drawing data containing the preset page header is less than the preset second distance threshold, the feature line includes: a straight line with an arrow, and/ Or, a curve with arrows.
  • the above-mentioned preset page header can be specifically understood as a page header containing "description drawings” or similar semantic characters.
  • the drawing data that contains the above-mentioned preset page header often contains the drawing data that constitutes various specific drawings in the patent document, and there are often image marks in the drawings. Therefore, in the specific implementation, you can first filter out the drawing data containing the preset page header from the drawing data of the target file according to the above preset page header, so that the subsequent specific processing of this part of the drawing data can be identified and determined. Out the included image logo.
  • the format features of the figure part of the patent document are further taken into consideration: that is, it will usually be included in each appendix.
  • the outer lower side of the figure (that is, the first range area), and the range area adjacent to the figure is provided with identification information (that is, the first type of image) that is used to indicate the serial number (or number) of the entire figure.
  • Identification such as caption
  • the above identification information is often a combination of numbers and characters that characterize the figure (such as "picture” or "fig", etc.) (corresponding to the above-mentioned first character group). For example, in FIG. 1, the identification information "FIG.
  • the identification information of the content object consisting of characters (that is, the second type of image identification)
  • the above identification information is often composed of individual numbers or individual letters, Chinese characters, or one or two of numbers, letters and Chinese characters.
  • a combination of three characters corresponding to the above-mentioned second character group, for example, part label).
  • the corresponding image data recognition processing can be performed on the different types of image identifiers in the drawing data in a targeted manner.
  • the aforementioned image data recognition processing may specifically be an image data recognition processing based on OCR (Optical Character Recognition). Of course, it can also be based on other principles, such as image data recognition processing based on other image features. During specific implementation, other suitable methods may be used to perform image data recognition processing according to specific conditions. The following mainly takes the image data recognition processing through optical character recognition as an example to specifically explain how to identify and determine the image identifier in the drawing data.
  • OCR Optical Character Recognition
  • the above-mentioned first range area can be specifically understood as an adjacent area on the lower side of the outside of the figure.
  • the figure data that meets the first range area can be determined from the figure data containing the preset page header; and then optical character recognition is performed on this area.
  • it can be targeted and efficiently recognized the combination of the figure number character combination used to indicate the figure that contains the combination of the number and the characteristic character of the characterization figure, that is, the first character group, as one of the figure data Image identification (for example, the first image identification).
  • the above-mentioned second range area can be specifically understood as the inside of the drawing, which is usually connected or adjacent to the characteristic line in the drawing (that is, the distance is less than the preset second distance threshold, for example, 5 mm) Area.
  • the aforementioned characteristic line is often connected with a specific content object (that is, the indicated content object) in the drawings.
  • a specific content object that is, the indicated content object
  • the specific form of the above-mentioned characteristic line may be a straight line or a curve with an arrow, or a connecting line without an arrow, and so on.
  • the first range area is determined and the first character group is recognized in the first range area
  • the data area of the specific content of the indicated drawing is the above-mentioned second range area.
  • optical character recognition can be performed on the above-mentioned second range area, so as to identify the combination of numbers and/or letters used to indicate content objects in the drawings in a targeted and efficient manner, that is, the second character group, as an additional Another type of image identification in the image data (for example, the second type of image identification).
  • the drawings may be typeset horizontally or vertically, and the different typesetting of the drawings will cause the above-mentioned first character group
  • the arrangement angle of the second character in the drawing For example, when the size of the drawing is large and is typeset horizontally, the arrangement angles of the first character group and the second character group in the drawing will be 90 degrees different from that in the case of vertical typesetting in the ordinary drawing.
  • the arrangement angle of the text character relative to the figure can be detected and determined, and then according to the arrangement angle of the text character, according to the corresponding Characters are recognized from the angle of, so as to accurately recognize the characters, thereby improving the accuracy of identifying and determining image marks.
  • the identified image identification (including the first character group and the second character group) can also be combined with the keyword group identified from the text data. Contrast verification to reduce the recognition error of the image mark.
  • the drawing data containing the preset page header may also be directly processed. Full range search.
  • the information related to the preset image identifier for example, the template image identifier
  • the second character group determines the image identifier in the drawing data.
  • the above-mentioned method of identifying and determining the image identifier in the drawing data is only a schematic illustration.
  • other suitable methods may also be used to perform the identification and determination processing of the image identification.
  • a pre-trained image identification recognition model based on a neural network may also be used to perform image data identification processing on the above-mentioned drawing data to determine the image identification contained in the image data.
  • this manual is not limited.
  • image data recognition processing can be performed on the drawing data by using a variety of different optical character recognition methods to improve the determined image The accuracy of the logo.
  • the client device or server can also generate the corresponding image identification serial number according to the image identification, and combine the identification serial number of the image identification Saved in the drawing data.
  • the above-mentioned identity serial number can also be understood as a kind of ID information, and the above-mentioned identity serial number has a one-to-one correspondence with the image identification. That is, an image identification in the drawing can be determined through an identity serial number. Similarly, an identity serial number can be identified and determined through an image mark.
  • the above-mentioned identity serial number may specifically be a character string generated according to a preset rule.
  • an identity serial number corresponding to the image identification can be generated according to characters such as numbers, letters, or Chinese characters included in the image identification.
  • the identity serial number of the image identification "101” can be expressed as: 0101. In this way, in the subsequent specific processing, the corresponding image identification "101" can be accurately locked through the identity serial number.
  • the corresponding identity serial number can also be generated according to the numbers and/or letters contained in the image identification, and the type characteristics of the image identification (for example, whether it belongs to the first type of image identification or the second type of image identification).
  • the identity serial number of the image identification "101” can be expressed as: label2_101.
  • the first half "label2" of the identity serial number can be used to indicate that the image identifier corresponding to the identity serial number is the second type of image identity (corresponding to the second character group), and the second half "00101" can be used to indicate the corresponding The number contained in the image logo is "101".
  • the identity serial number of the image identification "Figure 1" can be expressed as: label1_f1.
  • label1 can be used to indicate that the image identifier corresponding to the identity serial number is the first type of image identity (corresponding to the first character group), and "001" can be used to indicate that the number contained in the image identity corresponding to the identity serial number is "1” and "f” can be used to indicate the character "fig” contained in the image logo.
  • an identity serial number that can more accurately characterize the indicated image identification and distinguish it from other similar image identifications can be determined.
  • the above-mentioned method of generating the identity serial number corresponding to the image identifier is only a schematic illustration. During specific implementation, according to specific circumstances, the identity serial number corresponding to the image identification can also be generated according to other suitable rules. In this regard, this manual is not limited.
  • the server or the client device may also determine the location information of the image identifier.
  • the position information of the above image identifier can be specifically understood as a kind of address information used to indicate the position of the image identifier in the drawing in the target file, which can be recorded as pos.
  • the above-mentioned position information may specifically be a kind of position coordinates, and the specific position of the image mark in the drawing can be accurately determined through the position information.
  • the identity serial number and location information of the image logo determined by the above identification are saved in the drawing data.
  • the corresponding relationship between the image identification determined by the above identification and its identification number and location information is established in the drawing data.
  • the drawing data processing of the target file is completed, and the processed drawing data containing the image identification is obtained.
  • the specific position of the image identifier in the image data can be determined based on the above-mentioned figure data, according to the identity number of the image identifier, and also according to the receiving position of the trigger received by the client device and the position information of the image identifier Determine the image identification triggered.
  • determining the keyword group in the text data may include the following content in specific implementation:
  • S3 Filter a text area that meets the preset requirements from the multiple text areas as a third range area, where the third range area at least includes a text area whose subsection title is description of drawings or BRIFE DESCRIPTION OF DRAWINGS ;
  • S4 Determine the keyword group in the text data according to the third range area in the text data.
  • the text content of a patent document often complies with corresponding format requirements when drafting.
  • the text content in a patent document is required to be divided into multiple different text blocks (or text areas) when writing.
  • a section title for the text area is set to prompt the text content in the text area.
  • the title of the above section also meets the requirements of the preset format. For example, the characters in the section title are bold, or the font size of the characters in the section title is larger than the characters in normal text.
  • the first text area appears at the beginning of the specification part of the patent document, and the section title "Technical Field” is set at the beginning of the area by using a bold font.
  • the section title "Background” will be set by using a font size 4 (that is, a font slightly larger than ordinary text characters) technology”.
  • a font size 4 that is, a font slightly larger than ordinary text characters
  • the text data of the target file can also be processed through a preset file parser (for example, a preset description parser) to determine the text data corresponding to each text area in the text data of the target file. Scope area.
  • the text data of the target file can be divided into multiple content parts (ie multiple text regions) through the above-mentioned preset file parser, wherein the multiple content parts respectively carry the above-mentioned requirements for meeting the preset format The title of the section; identify and determine the corresponding name or type of the content part in the target file based on the section title carried by each content part; then determine the name of each content part (namely the name of the text area) according to the above name or type ).
  • the method may further include the following content: processing the text data using a preset area division model to determine a third range area from the text data, where The preset area division model is obtained by pre-training the neural network with sample text data.
  • the preset area division model is obtained by pre-training the neural network with sample text data.
  • the sample data used for training the preset region division model may specifically be text data of a file in which each text area is manually divided and each text area is named according to a uniform rule.
  • the above-mentioned third range area can be specifically understood as a range area in the text data that has a greater probability of containing the keyword group related to the image identifier in the drawing.
  • the subsection title in the text content is "Explanation of drawings” (for Chinese patent files) or “BRIFE DESCRIPTION OF DRAWINGS” (for English patent files)
  • the text area will be a more concentrated and comprehensive text description for the drawings and the image identifiers in the drawings. Therefore, in order to make the process of identifying and determining keyword groups more targeted, avoid text data processing on all text data, reduce the amount of data processing, and improve processing efficiency.
  • a text area that meets the preset requirements can be filtered out of the multiple text areas as the third range area, for example,
  • the subsection title is the description of the drawings or BRIFE DESCRIPTION OF DRAWINGS and other text areas that may have a relatively concentrated or relatively high probability of keyword groups are screened out as the third range area, and then only the third range area in the text data is targeted Text data processing to efficiently identify and determine the keyword groups in the text data.
  • the third range area listed above may specifically include a text area with a subsection titled Description of Drawings or BRIFE DESCRIPTION OF DRAWINGS.
  • other text areas that meet the requirements can also be introduced as the above-mentioned third range area according to specific circumstances. In this regard, this manual is not limited.
  • a sentence containing multiple phrases can also be used as a keyword group.
  • Figure 1 is a schematic diagram of the structure of the XX system provided in this specification
  • Figure 2 is a schematic flow diagram of the XX method provided in this specification.
  • keyword groups corresponding to the second type of image logo will appear in this type of text area.
  • the following text sentence "101 in the figure is a connector” or "101 connector” and so on will also appear in the text area.
  • the phrase “101 is a connector” and "101 connector” appearing in the above sentence are respectively keyword groups corresponding to the image identifier "101" in the text data.
  • the preset file parser designed for patent documents can be used to process the text data of the patent document.
  • the text data of the patent document can be divided into multiple contents according to format features such as section titles or blank lines. section. Then determine the name or type of each content part according to the section titles carried in the different content parts. For example, if the section title carried by a certain content part is "Background Technology (or BACKGROUND ART)", the content name corresponding to the content part can be determined as background technology, that is, the text data of the patent document corresponding to the background technology can be determined Scope area.
  • Background Technology or BACKGROUND ART
  • the text data recognition processing described above may specifically be a text data processing based on NLP (Natural Language Recognition Processing).
  • NLP Natural Language Recognition Processing
  • it can also be based on other principles, such as text data recognition processing based on other text features.
  • other suitable methods can be used to perform text data recognition processing according to specific conditions.
  • the following mainly uses natural language recognition processing to perform recognition processing on the text data in the third range area as an example to specifically explain how to recognize and determine the keyword group in the text data.
  • the above keyword group can be specifically understood as a combination of characters appearing in the text data and having a corresponding relationship with the image identifier in the drawing data. That is, the drawings or content objects in the drawings described by the keyword group are the same as the drawings or content objects in the drawings indicated by the image identifier.
  • the above keyword group may specifically be a combination of numbers and nouns, such as "101 component” or “component 101". It can also be a combination of letters and nouns, such as "A interface” or “interface A”. It can also be a combination of numbers and characters that characterize the graph, such as " Figure 1" or "fig1" and so on.
  • the keyword groups listed above are only for better describing the embodiments of this specification. During specific implementation, according to specific conditions and processing requirements, other types of character combinations can also be introduced as keyword groups. In this regard, this manual is not limited.
  • numbers or letters may be used as key characters first, and the key characters can be searched out from the text data. Then determine the character combination adjacent to the above key character (that is, the third character group), including the character combination where the key character is connected forward (for example, the character combination located in front of the key character), and the character combination where the key character is connected backward (For example, a combination of consecutive characters located after the key character).
  • the third character group specifically includes at least one of the following: a combination of a number and a noun, a combination of a letter and a noun, a combination of a number and a character representing a graph, and the like.
  • the adjacent character combination is a noun. If it is not a noun, it can be determined that the key character does not constitute a keyword group, and the text processing of the key character is ended. If a character combination in the adjacent character combination is a noun, it can be considered that the key character and the adjacent character combination determined as a noun may constitute a keyword group, and the character combination is a third character group that meets the requirements. Furthermore, the character combination can be determined as a keyword group.
  • semantic recognition can also be performed on the adjacent character combinations determined as nouns.
  • the semantic recognition result is semantically compared with the phrases in the keyword group library composed of commonly used keyword groups. If the semantic difference between the semantic recognition result and a certain phrase in the above keyword group library is less than the preset threshold degree, that is, the semantic recognition result is the same as or close to a certain phrase in the above keyword group library, then the adjacent The combination of characters and numbers or letters form a keyword group.
  • the image identifier obtained by performing preset image processing on the drawing data and refer to the characters contained in the image identifier to determine whether adjacent character combinations can form a keyword group with numbers or letters.
  • the above-mentioned method for determining the keyword group is only a schematic illustration. During specific implementation, according to specific application scenarios and accuracy requirements, other suitable methods may be used to determine the keyword group from the text data. In this regard, this manual is not limited.
  • the frequency of occurrence of each third character group in the text data may be determined. , Filtering out a third character with the highest appearance frequency from a plurality of third character groups as the keyword group.
  • the server finds the following phrases with similar semantics: "Figure 2", “ Figure 2", “Second Figure”, and count the frequency of occurrence of the three third character groups in the text data respectively It is: 1 time, 9 times, and 2 times.
  • the " Figure 2" with the highest frequency among the above phrases can be determined as a keyword group. This can effectively reduce noise interference and improve the accuracy of the determined keyword group.
  • the server can first identify and extract the separator in the third character group, compare the extracted separator with the preset symbol, determine the type of the separator, and then use it according to the type of the separator.
  • the split rule that matches the type of the separator splits multiple numbers into multiple numeric characters, and then re-splices them to obtain multiple keyword groups.
  • the extracted separator is the first type of preset separator, that is, the separator that is the same as the comma (","), comma (",”), etc.
  • the third Each number in the plurality of numbers included in the character group and other characters in the third character group except the number and the separator are combined and spliced to obtain a plurality of different keyword groups. For example, according to the original keyword group "Picture 1, 2, 3", split, combine and splice can get: “Picture 1", “Picture 2" and "Picture 3" these three keyword groups.
  • the extracted separator is the second type of preset separator, that is, the separator that is the same as the dash ("-"), curve (" ⁇ "), etc.
  • a number can be generated based on the numbers at both ends of the separator Sequence, and then sequentially extract a number from the number sequence and other characters in the third character group except the number and the separator, and combine and splice them to obtain a plurality of different keyword groups.
  • the numbers "1" and "3" at both ends of the separator "-" can be extracted first, and the above two numbers can be used as the start and end of the sequence. Number, generate a corresponding number sequence: "1, 2, 3".
  • the server or the client device may generate an identity serial number for indicating the keyword group according to the keyword group.
  • the identity serial number of the above-mentioned keyword group can also be understood as a kind of ID information, and the aforementioned identity serial number and the keyword group have a one-to-one correspondence.
  • a keyword group in the text data can be determined through an identity serial number.
  • a corresponding identity serial number can be determined through a keyword group.
  • the aforementioned identity serial number may specifically be a character string generated according to the same rule as the rule on which the identity serial number of the image identifier is determined.
  • generating the identity serial number of the keyword group may include: generating an identity serial number corresponding to the keyword group according to the numbers and/or letters contained in the keyword group. For example, the identity serial number of the keyword group "component 101" can be expressed as: 0101. In this way, the corresponding keyword group "101" can be accurately indicated through the identity serial number.
  • the above-mentioned method of generating the identity serial number corresponding to the keyword group is only a schematic illustration.
  • the identity serial number corresponding to the keyword group can also be generated according to other suitable rules. In this regard, this manual is not limited.
  • the location information of the keyword group can be further determined.
  • the position information of the keyword group can be specifically understood as a kind of address information used to indicate the position of the keyword group in the text in the target file, and can be recorded as pos.
  • the above-mentioned position information may specifically be a kind of position coordinates, and the specific position of the keyword group in the text can be accurately determined through the position information.
  • the location information listed above is only a schematic illustration. This specification does not limit the specific form of the above-mentioned location information.
  • the client device or the server can then save the identification number and location information of the keyword group determined by the above identification in the text data. For example, the corresponding relationship between the keyword group identified by the above identification and its identification number and location information can be established in the text data.
  • the processing of the text data of the target file is completed, and the processed text data containing the keyword group is obtained.
  • the specific position of the keyword group in the text data can be determined based on the above text data and the identity serial number of the keyword group.
  • the method may further include the following content when the method is specifically implemented: according to the image identification, the identity serial number of the corresponding image identification is generated, And the identity serial number of the image identification is stored in the drawing data.
  • the method may further include the following content: using a preset area division model to The text data is processed to determine a third range area from the text data, wherein the preset area division model is obtained by pre-training a neural network using sample text data.
  • the keyword group in the text data is determined according to the third range area in the text data.
  • it may include the following content: text the third range area in the text data
  • the third character group is recognized, wherein the third character group includes at least one of the following: a combination of numbers and nouns, a combination of letters and nouns, a combination of numbers and characters in a characterization map;
  • the third character group serves as the keyword group.
  • the method may further include the following content during specific implementation: counting each of the plurality of third character groups with similar semantics The appearance frequency of the third character group in the text data; the third character group with the highest appearance frequency is determined as the keyword group.
  • the method may also include the following content during specific implementation: counting the content contained in the text paragraph where each third character group of the multiple third character groups is located Number of characters; the third character group with the least number of characters in the text paragraph is taken as the keyword group.
  • the third character group may specifically further include: a combination of multiple numbers and separators (for example, ",”, “,” or “-”), in the third character group in the text data
  • the method may also include the following content during specific implementation: identifying the type of the separator; The split rule that matches the type of the symbol, splits the multiple numbers into multiple digital characters to be spliced; and separates the multiple digital characters to be spliced with the noun, letter or characterization in the third character group The characters of the picture are respectively spliced to obtain multiple keyword groups.
  • the method may further include the following content during specific implementation: according to the keyword group, the identity serial number of the corresponding keyword group is generated, and the The identity serial number of the keyword group is stored in the text data.
  • the display method of the drawing page receives and responds to the user's triggering operation on the image label in the drawing page through the client device, and determines the triggered image label; and then according to the obtained target file Figure data and text data.
  • the matching text of the triggered image identifier is displayed on the figure page to describe the figure indicated by the image identifier or the content object in the figure, so that the user no longer needs it.
  • the content of the drawings in the image page simplifies user operations and improves the user experience; the image data and text data of the target file are also recognized in advance to determine the image in the image data.
  • the identity serial numbers of the logo and image logo, as well as the identity serial numbers of the keyword group and keyword group in the text data are stored in the corresponding figure data and text data, so that the client device can subsequently identify the identity according to the triggered image
  • the serial number can quickly and accurately determine the matching keyword group from the text data, and then generate the corresponding matching text, and display it to the user, which improves the accuracy and efficiency of determining and displaying the matching text, and further improves the user experience.
  • this application also provides a corresponding drawing page display device, which can be referred to as shown in FIG. 12, which may specifically include the following structural modules:
  • the display module may be specifically used to display a page containing a target drawing in a target file, where the target file includes drawing data and text data, and the target drawing includes an image identifier;
  • the determining module may be specifically used to receive and respond to the trigger operation for the image identifier in the target drawing, and determine the triggered image identifier;
  • the display module can be specifically used to display the matching text of the triggered image identification on the page according to the drawing data and the text data, wherein the matching text of the triggered image identification includes at least To describe the sentence or phrase identified by the triggered image.
  • the image identification may specifically include: identification information for indicating the serial number of the drawing, and/or identification information for indicating the content object in the drawing, and so on.
  • the target document specifically includes at least one of the following: patent documents, dissertation documents, contract documents, trademark documents, and so on.
  • the device further includes an obtaining module, which can be specifically used to obtain the drawing data and text data of the target file before displaying the page containing the target drawing in the target file, wherein the drawing The data includes an image identifier, the text data includes a keyword group, which is a combination of characters in the text data used to indicate the drawing, and/or the content object in the drawing; and the receiving response module, which can be specifically It is used to receive a display operation for the target drawing in the target file; in response to the display operation, display a page containing the target drawing in the target file according to the drawing data of the target file.
  • an obtaining module which can be specifically used to obtain the drawing data and text data of the target file before displaying the page containing the target drawing in the target file, wherein the drawing The data includes an image identifier
  • the text data includes a keyword group, which is a combination of characters in the text data used to indicate the drawing, and/or the content object in the drawing
  • the receiving response module which can be specifically It is used to receive a display operation
  • the receiving response module may specifically include the following structural units:
  • the receiving unit may be specifically configured to receive a preset trigger operation, where the preset trigger operation includes at least one of the following: a pressing operation, a selection operation, and a click operation;
  • the first determining unit may be specifically configured to determine the receiving position where the preset trigger operation is received;
  • the second determining unit may be specifically configured to determine an image identifier whose distance from the receiving position is less than or equal to a preset trigger distance threshold as the triggered image identifier.
  • the display module may specifically include the following structural units:
  • the third determining unit may be specifically configured to determine the identity serial number of the triggered image identifier according to the drawing data of the target file;
  • the first retrieval unit may be specifically configured to retrieve, from the keyword groups contained in the text data of the target file, a keyword group whose identity serial number matches the identity serial number of the triggered image identifier as the associated keyword group;
  • the fourth determining unit may be specifically configured to determine the matching text of the triggered image identifier according to the associated keyword group;
  • the first display unit may be specifically configured to display the matched text of the triggered image identifier at a preset display position on the page.
  • the first display unit may specifically determine the preset display position according to the following procedure: determine the position of the triggered image identifier in the target drawing, and the position of the center point of the target drawing; The position of the triggered image identifier in the target drawing and the position of the center point of the target drawing determine the preset display position, wherein the preset display position includes the center point far away from the target drawing Direction, and the distance from the triggered image mark is less than or equal to the preset first distance threshold.
  • the first display unit may specifically include the following structural subunits:
  • the first display subunit may be specifically configured to display characters of less than a preset length in the matched text of the triggered image identifier at a preset display position on the page;
  • the first receiving subunit may be specifically configured to receive a trigger operation for characters less than a preset length in the matched text of the triggered image identifier
  • the second display subunit may be specifically configured to respond to the trigger operation and display the matching text of the triggered image identifier.
  • the matching text of the triggered image identification can be displayed according to the following procedure: the matching text of the triggered image identification is displayed in a preset display mode, where ,
  • the preset display mode includes at least one of the following: display in a highlighted manner; display in a bold font; display in a preset font color; display in a floating frame; display in an italic font Display; display in a way that the font size becomes larger, etc.
  • the preset display modes listed above are only schematic illustrations. This manual does not limit the above preset display mode.
  • the device further includes a data processing module, and the data processing module may be specifically configured to perform corresponding processing on the text data and the drawing data of the target file, respectively, to obtain that the information obtained by the acquisition module includes identification The figure data of the determined image identification and the text data containing the identified keyword group.
  • the data processing module specifically includes a drawing data processing unit and a text data processing unit.
  • the drawing data processing unit may be specifically used to identify and determine the image identifier contained in the drawing data.
  • the text data processing unit may be specifically used to identify and determine the keyword group contained in the text data.
  • the image identifier in the drawing data may be determined in the following manner: the attachments containing the preset header are filtered out from the drawing data Image data; image data recognition is performed on the first range area in the image data containing the preset page header, and the recognized first character group is used as the image identifier, wherein the first character group includes : A combination of numbers and characteristic characters of the characterization map; according to the recognized first character group, the second range area is determined in the figure data containing the preset page header; Image data recognition is performed on the second range area in the drawing data of the header, and the recognized second character group is used as the image identifier, wherein the second character group includes: and the preset page header
  • the characteristic line in the figure data is a combination of numbers and/or letters whose distance is less than the preset second distance threshold, and the characteristic line includes: a straight line carrying an arrow, and/or a curve carrying an arrow.
  • the drawing data processing unit after determining the image identification contained in the drawing data, may be specifically used to generate the identity serial number of the corresponding image identification according to the image identification, And the identity serial number of the image identification is stored in the drawing data.
  • the keyword group in the text data may be determined in the following manner, including: retrieving the section headings in the text data; The text data is divided into a plurality of text areas; a text area that meets the preset requirements is selected from the plurality of text areas as the third range area, wherein the third range area includes at least the subsection headings and description of the drawings Or the text area of BRIFE DESCRIPTION OF DRAWINGS; according to the third range area in the text data, the keyword group in the text data is determined.
  • the text data processing unit when the text data is retrieved and it is determined that the text data does not contain section headings, when the text data processing unit is specifically implemented, the text data processing unit can also be used to compare all the text data using a preset area division model.
  • the text data is processed to determine a third range area from the text data, wherein the preset area division model is obtained by pre-training a neural network using sample text data.
  • the third range area in the text data may be determined.
  • Perform text data recognition in the range area and recognize a third character group, where the third character group includes at least one of the following: a combination of numbers and nouns, a combination of letters and nouns, a combination of numbers and characters in a characterization map;
  • the recognized third character group serves as the keyword group.
  • the text data processing unit may also be used to: count the plurality of third character groups with similar semantics during specific implementation The appearance frequency of each third character group in the text data; the third character group with the highest appearance frequency is determined as the keyword group.
  • the text data processing unit may also be used to: count each third character group in the multiple third character groups during specific implementation The number of characters contained in the text paragraph; the third character group with the least number of characters in the text paragraph is taken as the keyword group.
  • the third character group further includes: a combination of a plurality of numbers and separators, and the text data processing unit performs data processing on the third range region in the text data, and recognizes that After the third character group of a combination of multiple numbers and separators, in specific implementation, the text data processing unit may also specifically execute according to the following procedure: identify the type of the separator; match according to the type of the separator According to the split rule, the multiple numbers are split into multiple digital characters to be spliced; the multiple digital characters to be spliced are separated from the nouns, letters, or characters in the third character group Perform splicing to obtain multiple keyword groups.
  • the text data processing unit when the text data processing unit is specifically implemented, it may also be used to generate the identity serial number of the corresponding keyword group according to the keyword group, and combine all The identity serial number of the keyword group is stored in the text data.
  • the units, devices, or modules described in the above embodiments may be specifically implemented by computer chips or entities, or implemented by products with certain functions.
  • the functions are divided into various modules and described separately.
  • the function of each module can be implemented in the same one or more software and/or hardware, or a module that implements the same function can be implemented by a combination of multiple sub-modules or sub-units.
  • the device embodiments described above are merely illustrative, for example, the division of the units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components can be combined or integrated To another system, or some features can be ignored, or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
  • the display device of the drawing page determines the triggered image identity by receiving and responding to the user's triggering operation on the image identity in the drawing page displayed by the display module;
  • the acquired drawing data and text data of the target file, the matching text of the triggered image identifier is displayed on the drawing page to describe the drawing indicated by the image identifier or the content object in the drawing, So that users no longer need to search for text sentences related to the image identification of interest in the target drawing in the text of the target file by themselves, and can accurately obtain other information related to the image identification, which is convenient for users to efficiently Read and understand the content of the drawings in the drawings page carefully, thereby simplifying user operations and improving user experience.
  • the client device includes a network communication interface, a processor, and a memory.
  • the above-mentioned structures are connected by internal cables, so that each structure can perform Specific data interaction.
  • the network communication interface may be specifically used to obtain drawing data and text data of the target file, and to receive a display operation for the target drawing in the target file.
  • the processor may be specifically used to display a page containing a target drawing in a target file, where the target file includes drawing data and text data, and the target drawing includes an image identifier; receiving and responding For the trigger operation of the image identification in the target drawing, the triggered image identification is determined; according to the image data and the text data, the matching text of the triggered image identification is displayed on the page, wherein, the matching text of the triggered image identification includes at least a sentence or phrase used to describe the triggered image identification.
  • the memory may be specifically used to store a corresponding instruction program on which the processor is based.
  • the network communication interface may be a virtual port that is bound to different communication protocols, so that different data can be sent or received.
  • the network communication interface may be port 80 responsible for web data communication, port 21 responsible for FTP data communication, or port 25 responsible for mail data communication.
  • the network communication interface may also be a physical communication interface or a communication chip.
  • it can be a wireless mobile network communication chip, such as GSM, CDMA, etc.; it can also be a Wifi chip; it can also be a Bluetooth chip.
  • the processor may be implemented in any suitable manner.
  • the processor may take the form of a microprocessor or a processor and a computer-readable medium, logic gates, switches, application specific integrated circuits ( Application Specific Integrated Circuit (ASIC), programmable logic controller and embedded microcontroller form, etc. This manual is not limited.
  • the memory can include multiple levels.
  • a digital system as long as it can store binary data, it can be a memory; in an integrated circuit, a circuit with a storage function without a physical form is also called a memory. Such as RAM, FIFO, etc.; in the system, storage devices in physical form are also called memories, such as memory sticks, TF cards, etc.
  • the embodiment of the present specification also provides a computer storage medium based on the above-mentioned method for displaying pages of the drawings.
  • the computer storage medium stores computer program instructions, which are implemented when the computer program instructions are executed: the display contains the target file The page of the target drawing in the target drawing, wherein the target file includes drawing data and text data, and the target drawing contains image identification; receiving and responding to the trigger operation for the image identification in the target drawing, determining According to the image data and the text data, the matching text of the triggered image identification is displayed on the page, wherein the matching text of the triggered image identification includes at least To describe the sentence or phrase identified by the triggered image.
  • the above-mentioned storage medium includes but is not limited to random access memory (Random Access Memory, RAM), read-only memory (Read-Only Memory, ROM), cache (Cache), and hard disk (Hard Disk Drive, HDD). Or memory card (Memory Card).
  • the memory can be used to store computer program instructions.
  • the network communication unit may be an interface set up in accordance with a standard stipulated by the communication protocol and used for network connection communication.
  • the embodiment of the present application also provides another method for displaying the page of the drawings.
  • the specific implementation may include the following content:
  • S1401 Display a page containing a target drawing in a target file, where the target file includes drawing data and text data, the drawing data includes an image identifier, the text data includes a keyword group, and the The target drawing contains an image mark.
  • the method may further include receiving a page display request for the target drawing in the target file sent by the user.
  • the above-mentioned page display request can be specifically understood as a kind of request data carrying a target drawing in a target file that the user is concerned about or wants to query.
  • the client device can determine the target file that the user is concerned about or want to query, and the corresponding selected target drawing, and then can generate and show to the user the target file according to the drawing data and text data of the target file.
  • the drawing page of the above target drawing can be specifically understood as a kind of request data carrying a target drawing in a target file that the user is concerned about or wants to query.
  • the user can input the name or number of the file he wants to query in the query box displayed on the client device to indicate the target file.
  • the client device can display a directory page for the target file according to the name or number of the target file input by the user.
  • the user can select the figure to be queried as the target figure through operations such as clicking or checking in the directory page, so that the client device can receive the user's page display request for the target figure in the target file.
  • the above-mentioned manner of receiving the page display request for the target drawing in the target file is only a schematic illustration.
  • the user can also directly enter the name or number of the file to be inquired and the specific number of the drawing to be inquired in the displayed query box according to the preset input rules, and use The preset connector (for example, "&") is separated, and the client device can receive the page display request for the target drawing in the target file by receiving the above parameter data input by the user according to the preset input rule.
  • the preset connector for example, "&"
  • This specification does not limit the specific implementation manner of receiving the page display request for the target drawing in the target file.
  • the method before displaying the page containing the target drawing in the target file, when the method is specifically implemented, the method further includes: acquiring drawing data and text data of the target file, wherein the drawing data is Contains an image identifier that has been identified and determined in advance, and the text data contains a keyword group that has been identified and determined in advance.
  • the foregoing display of the page containing the target drawing in the target file may include the following: the client device determines the user’s location according to the page display request sent by the user for the target drawing in the target file.
  • the target drawing in the selected target file the drawing data of the target file can then be obtained and the drawing data of the target drawing contained in the target drawing; and then based on the above drawing data containing the target drawing, it is generated and shown to the user.
  • the drawing page of the target drawing of the target file contains an image mark.
  • the client device while the client device generates and displays the drawing page containing the target drawing based on the above-mentioned drawing data containing the target drawing, it also performs processing on the above-mentioned drawing data containing the target drawing.
  • Search determine the image identifier contained in the above-mentioned drawing data containing the target drawing, and then obtain the text data of the target file, and retrieve the text of the target file based on the image identifier contained in the above-mentioned drawing data containing the target drawing Data, the keyword group corresponding to the image identifier is determined, and the matching text of the image identifier can be determined according to the keyword group.
  • the client device will temporarily not display the matching text of the image identifier contained in the target drawing on the drawing page.
  • S1403 Receive a trigger operation of the first type, where the trigger operation of the first type is used to control the display of the matching text of the image identifier contained in the target drawing.
  • the above-mentioned first type of trigger operation can be specifically understood as a preset trigger operation for controlling the display of the matching text of the image identifier contained in the target drawing.
  • any one of the multiple preset trigger operations listed below can be determined as the first type of trigger operation: pressing operation, selecting operation, and clicking operation on a specific position on the page Wait.
  • the first type of trigger operation listed above is only for better describing the embodiments of this specification.
  • other appropriate operations can also be introduced as the first type of trigger operation described above. In this regard, this manual is not limited.
  • control icons may be set in advance on the displayed drawing page.
  • the "matching text display” icon in the upper right corner of the page in Figure 9.
  • the user can control the matching text of the image logo to be displayed on the displayed page according to their own needs through the operation settings of the control icon.
  • the user can move the "matching text display” icon to the left to the "on” position, which can be regarded as the first Class trigger operation.
  • the client device can receive the first type of trigger operation described above.
  • control icons listed above are only to better illustrate the embodiments of this specification.
  • other types of switch control buttons can be provided according to specific conditions and usage scenarios, so that users can flexibly control the display of matching text on the page according to their needs.
  • the embodiment of this specification does not limit it.
  • the client device may further respond to the first type of trigger operation, and combine the target drawings previously determined based on the drawing data and text data of the target file.
  • the matching text of each image identification included is displayed together on the current drawing page for users to read and use.
  • the client device when the user flicks the icon of “matching text display” to the left to the position of “on”, the client device will respond to the trigger operation and display on the page Matching text for all image identifications contained in the currently displayed drawing.
  • the client device may respond to the trigger operation of the first type, according to the matching text of the image identifier contained in the target drawing that has been determined, on the basis of the current drawing page, in advance Set the display position to add matching text showing each image logo.
  • the client device may also respond to the first type of trigger operation, based on the above-mentioned drawing data containing the target drawing, and the image identifier contained in the determined target drawing.
  • Use HTML Canvas to redraw a display page for users to read; or use patented text layout algorithm to display the changed data, such as the matching text of each image logo, so as to show the user the content of the image logo. Pages that match the text; you can also display the pages that contain the matching text of the image logo for users to read by opening the settings in the browser and other reading tools.
  • This application provides a method for displaying a drawing page, because the client device first displays the target drawing page containing the target file to the user; when the user needs it, the first type of trigger operation sent by the user is received , And then display the matching text of the image identification contained in the target drawing on the page, so that the user no longer needs to search the text of the target file and the image of interest in the target drawing.
  • the text sentence related to the logo can accurately obtain other information related to the image logo, which is convenient for users to efficiently read and understand the content of the drawings in the drawings page, which simplifies user operations and does not affect the user's reading goals.
  • the content of the drawing itself has an impact and can meet the personalized needs of users, thereby further improving the user experience.
  • the method may also include the following content during specific implementation:
  • S4 Receive a second type of trigger operation, where the second type of trigger operation is used to control hiding the matching text of the image identifier contained in the target drawing.
  • the second type of trigger operation described above can be specifically understood as a preset trigger operation that is opposite to the first type of trigger operation and is used to control the hiding of the matching text of the image identifier contained in the target drawing.
  • any one of the multiple preset trigger operations listed below can be determined as the second type of trigger that is different from the first type of trigger operation according to the pre-negotiated rules Operation: Press operation, select operation, click operation, etc. on a specific position on the page.
  • Press operation, select operation, click operation, etc. Press operation, select operation, click operation, etc. on a specific position on the page.
  • the second type of trigger operation listed above is only for better describing the embodiments of this specification.
  • other appropriate operations can be introduced as the second type of trigger operation described above. In this regard, this manual is not limited.
  • control icons may be set on the displayed drawing page in advance.
  • the "matching text display” icon in the upper right corner of the page in Figure 9.
  • the user can control the matching text of the image logo not to be displayed on the displayed page according to their own needs through the operation settings of the control icon.
  • the user can move the above-mentioned "matching text display” icon to the right to the "off" position, which can be regarded as sending the second Class trigger operation.
  • the client device can receive the second type of trigger operation described above.
  • control icons listed above are only to better illustrate the embodiments of this specification.
  • other types of switch control buttons can be provided according to specific conditions and usage scenarios, so that users can flexibly control the non-display of matching text on the page according to their needs.
  • the embodiment of this specification does not limit it.
  • the client device can hide the matching text of the image identifier displayed on the page in various hiding methods in response to the second type of trigger operation.
  • the client device can hide the matching text of the image identifier by covering the matching text of the image identifier with a tile whose color is the background color of the target drawing. It is also possible to hide the matching civilization of the image identification by adjusting the characters of the matching text of the image identification to transparent characters. Or it is also possible to redraw a display page that does not carry the matching text of the image identifier through HTML Canvas based only on the above-mentioned figure data containing the target drawing, and replace the current display page to show the user the matching text of the hidden image identifier.
  • the displayed matching text is reduced to the content of the target drawing itself.
  • the target drawing contained in the target drawing is displayed on the page as shown in Figure 15.
  • the matching text of the image identification may also include the following content: in response to the first type of trigger operation, a data list of the target drawing is displayed on the page, wherein the data list contains Describe the image identification in the target drawing and the matching text of the image identification.
  • the above-mentioned data list can be understood as a kind of data in which each image identifier contained in the target drawing and the matching text corresponding to each image identifier are set in a certain setting sequence.
  • the above data list may be a list containing a dividing line frame, or may be a text that does not contain a dividing line frame, but the correspondence between the same image identifier and the matching text is clear.
  • the data list may be located in a blank area in the target drawing, or may be located on the displayed drawing page, outside the target drawing, and in a blank area close to the target drawing.
  • the data list listed above is only a schematic illustration. During specific implementation, other forms of data lists can also be selected according to specific conditions. This specification does not limit the specific form of the data list.
  • the client device can respond to the first type of trigger operation when receiving the first type of trigger operation, and display the data list of the target drawing on the currently displayed drawing page, so that Users can easily understand the matching text of each image identifier in the target drawing; when receiving the second type of trigger operation, they can respond to the second type of trigger operation and hide the displayed target drawing data in the currently displayed drawing page
  • the data list is so that users can read the content of the target drawing itself clearly and without interference, so as to meet the diverse needs of users.
  • the aforementioned data list may be specifically generated based on matched text data corresponding to the target drawing data.
  • the above-mentioned target drawing data can be specifically understood as the drawing data used to generate the target drawing in the drawing data of the target file, wherein the above-mentioned target drawing data contains the pre-identified and determined information contained in the target drawing. Out of the image logo.
  • the above-mentioned matched text data can be specifically understood as the text data in the text data of the target file that is used to generate the matching text of each image identifier contained in the target drawing, wherein the above-mentioned matching text data contains the pre-determined and Each image in the target drawing identifies the corresponding keyword group.
  • the method may further include the following during specific implementation Content: Receive and respond to the trigger operation for the image identification in the target drawing, and determine the triggered image identification in the target drawing; display the triggered image in the data list in a preset display mode The matching text of the logo and the triggered image logo.
  • the image identification and matching text in the data list displayed on the above-mentioned drawing page can be set to correspond to each other with the image identification in the target drawing.
  • the image identifier in the target drawing displayed on the drawing page is not triggered
  • the image identification and the matching text of the image identification in the data list can be first displayed in a normal untriggered display mode (for example, it can be displayed in a lighter gray or transparent font).
  • the client device can receive and respond to the above trigger operation to determine the triggered image identifier in the target drawing.
  • the same image identification in the data list and the matching text of the same image identification can be determined as the triggered image identification in the data list and the triggered image identification.
  • Match text Then through the preset display mode (that is to distinguish the above-mentioned normal untriggered display mode, for example, display in the form of a deeper gray or opaque font, or a floating frame) to display the triggered image identification in the data list, and the The matching text of the triggered image ID. In this way, the matching text of the image identification selected by the user can be highlighted from the matching text containing multiple image identifications, and the user experience is further improved.
  • the method when the method is specifically implemented, further It may include the following content: receiving and responding to the trigger operation for the image identification in the data list, and determining the triggered image identification in the data list; displaying the triggered image in the target drawing in a preset display mode Image logo.
  • the target The image logo in the drawing can be displayed in a normally untriggered display mode (for example, it can be displayed in a font with a lighter gray scale or a more transparent font).
  • the client device can receive and respond to the above trigger operation to determine the triggered image identifier in the data list.
  • the same image identification in the target drawing can be determined as the triggered image in the target drawing Logo. Then, the triggered image identifier in the target drawing is displayed through a preset display mode (that is, to distinguish the above-mentioned normal untriggered display mode, for example, a deeper gray or opaque font, or a floating frame for display). This can more clearly highlight the image mark selected by the user from the target drawing containing multiple image marks, and further improve the user experience.
  • a preset display mode that is, to distinguish the above-mentioned normal untriggered display mode, for example, a deeper gray or opaque font, or a floating frame for display.
  • trigger operations can be performed on the multiple image identifiers in the data list; accordingly, it will be displayed in a preset display mode on the drawing page Multiple triggered image identifications, and/or matching texts of multiple triggered image identifications are generated.
  • the user is also interested in the image marks 280, 282, and 292.
  • the user can click on the image identifiers 280, 282, and 292 in the data list in the data list to initiate a trigger operation.
  • the client device can receive and respond to the above operations.
  • the triggered image identifiers 280, 282, and 292 and the corresponding matching text are displayed simultaneously in a highlighted manner, namely: "280.locking arm” ", "282.central section” and "292.central section”.
  • the matching text of 280, 282, and 292 that were originally hidden can also be displayed correspondingly in the target drawing on the drawing page.
  • the user can also click on the image identifiers 280, 282, and 292 in the target drawing to initiate a trigger operation.
  • the client device can receive and respond to the above operations, and display the originally hidden matching text of 280, 282, and 292 in a highlighted manner in the target drawing on the drawing page.
  • the triggered image identifiers 280, 282, and 292, and the corresponding matching text can be correspondingly displayed in a highlighted manner.
  • a preset identification mark may also be displayed near the triggered image mark, for example, the eye-shaped image mark in FIG. 29, to identify the image mark, and the image mark The matching text of is currently in the triggered display state.
  • the method before displaying the page containing the target drawing in the target file, when the method is specifically implemented, the method may further include the following content: receiving a display operation for the target drawing in the target file; response; For the display operation of the target drawing in the target file, generate and send a data acquisition request for the target drawing in the target file; acquire target drawing data and matching text data, where the target drawing data includes There is an image identifier in the target drawing, the matched text data contains the matching text of the image identifier in the target drawing, and the target drawing data and the matched text data are based on the drawing data of the target file And the text data is determined.
  • the client device can receive the above-mentioned information about the target drawing in the target file by receiving the user's selection operation for the target file and the user's selection operation for the target drawing in the target file. Show operation.
  • the client device may generate and send a data acquisition request for the target drawing in the target file to request to obtain the target drawing data. , And matching text data.
  • the target drawing data and the matched text data are acquired.
  • the target drawing data and the matched text data may be obtained by the server according to the drawing data and text data of the target file. What is obtained after corresponding processing and provided to the client device may also be obtained by the client device itself performing corresponding processing on the drawing data and text data of the target file.
  • the method may further include the following during specific implementation Content: Receive a third type of trigger operation, where the third type of trigger operation is used to control the display position of the matching text of the image mark contained in the target drawing; in response to the third type of trigger operation, The preset display area in the page displays the matching text of the image identifier contained in the target drawing, wherein the preset display area includes a blank area in the page.
  • the above-mentioned third type of trigger operation can be specifically understood as a type of trigger operation that is different from the first type of trigger operation and the second type of trigger operation, and is used to control the display of the matching text of the image identifier contained in the target drawing.
  • Preset trigger operation for position Similar to the first type of trigger operation and the second type of trigger operation, it can be determined according to the pre-negotiated rules that any one of the multiple preset trigger operations listed below is different from the first type of trigger operation and the second type of trigger operation.
  • the trigger operation of the second type of trigger operation is regarded as the third type of trigger operation: pressing operation, selecting operation, and clicking operation on a specific position on the page.
  • the display position of the matching text of the image mark when the user feels that the display position of the matching text of the image mark does not conform to the user's reading habits, or affects the user's reading, for example, the displayed matching text obscures the original image content of the target drawing.
  • the display position of the matched text can be adjusted by issuing a third type of preset trigger operation on the client device.
  • the client device can respond to the third type of trigger operation, and display the information in the target drawing in the preset display area of the page.
  • the matching text for the included image ID can be specifically understood as an area on the page for displaying the matching text of each image identifier contained in the target drawing. This can include blank areas on the page and so on.
  • the preset display area listed above is only a schematic illustration. During specific implementation, according to specific circumstances, the foregoing preset display area may also include multiple different types of areas for the user to flexibly select.
  • the preset display area may specifically include: a blank area inside the target drawing, a blank area outside the target drawing in the page, the left area of the image logo, the upper area of the image logo, and so on.
  • the user can also trigger the third type of trigger operation to drag the matched text to any area on the page as a preset display area to display the matched text. So as to meet the diversified use needs of different users.
  • the method in response to the third type of triggering operation, after displaying the matching text of the image identification contained in the target drawing in the preset display area of the page, the method is specifically implemented , May also include the following content: receiving and responding to the trigger operation for the image identification in the target drawing, and determining the triggered image identification in the target drawing; in the preset display area in a preset display mode Display the matching text of the triggered image ID.
  • the selected image identifier is highlighted.
  • the image identifier in the display area and the image identifier in the target drawing can be set to correspond to each other.
  • a trigger operation can be initiated for the matching text.
  • the client device can receive and respond to the trigger operation for the matching text, and determine the matching text of the triggered image logo in the preset display area; and then can determine it in the target drawing and display it in a display mode different from other image logos
  • the image identifier corresponding to the triggered matching text in the preset display area in the target drawing is output.
  • the foregoing preset display mode may specifically include at least one of the following: displaying in a highlighted manner; displaying in a bold font; displaying in a preset font color; displaying in a floating frame; Display in italics; display in larger font size, etc.
  • displaying in a highlighted manner displaying in a bold font
  • displaying in a preset font color displaying in a floating frame
  • Display in italics display in larger font size, etc.
  • other suitable display modes may also be adopted as the above-mentioned preset display modes according to specific conditions. In this regard, this manual is not limited.
  • the method in response to the third type of triggering operation, after displaying the matching text of the image identification contained in the target drawing in the preset display area of the page, the method is specifically implemented , May also include the following content: receiving and responding to the trigger operation for the matching text in the preset display area, and determining the matching text that is triggered in the preset display area; using the preset in the target drawing The display mode displays the image identification corresponding to the triggered matching text.
  • a trigger operation can also be initiated for the image identifier.
  • the client device can receive and respond to the triggering operation for the image mark, and determine the triggered image mark in the target drawing; then, it can determine in the preset display area and display the preset in a display mode that is different from other matching text The matching text corresponding to the triggered image identifier in the target drawing in the display area.
  • the image identifier contained in the drawing data can be specifically identified and determined in the following manner: the drawing data containing the preset page header is filtered out from the drawing data; The first range area in the drawing data of the header is preset to perform image data recognition, and the recognized first character group is used as the image identifier, wherein the first character group includes: numbers and characteristic characters of the representation map According to the identified first character group, the second range area is determined in the picture data containing the preset header; for the first part of the picture data containing the preset header The image data is recognized in the second range area, and the recognized second character group is used as the image identifier, wherein the second character group includes: the same as the feature line in the drawing data containing the preset header A combination of numbers and/or letters whose distance is less than the preset second distance threshold, the characteristic line includes: a straight line carrying an arrow, and/or a curve carrying an arrow.
  • the method may further include the following content during specific implementation: according to the image identifier, the identity serial number of the corresponding image identifier is generated, and The identity serial number of the image identification is stored in the drawing data.
  • the above-mentioned determination of the keyword group in the text data may include the following content during specific implementation: searching for section titles in the text data; dividing the text data into multiple sections according to the section titles A text area; a text area that meets the preset requirements is selected from the multiple text areas as the third range area, where the third range area at least includes the section titled description of the drawing or BRIFE DESCRIPTION OF DRAWINGS Text area; according to the third range area in the text data, determine the keyword group in the text data.
  • the method may further include the following content: using a preset area division model to The text data is processed to determine a third range area from the text data, wherein the preset area division model is obtained by pre-training a neural network using sample text data.
  • the above-mentioned determination of the keyword group in the text data according to the third range area in the text data may also include the following content in specific implementation: the third range area in the text data Perform text data recognition, and identify a third character group, where the third character group includes at least one of the following: a combination of numbers and nouns, a combination of letters and nouns, a combination of numbers and characters in a characterization map; The third character group is used as the keyword group.
  • the method may further include the following content during specific implementation: counting each of the plurality of third character groups with similar semantics The appearance frequency of the third character group in the text data; the third character group with the highest appearance frequency is determined as the keyword group.
  • the method may further include the following content when implementing the method: counting the text of each third character group in the multiple third character groups The number of characters contained in the paragraph; the third character group with the least number of characters contained in the text paragraph is used as the keyword group.
  • the third character group may further include: a combination of multiple numbers and separators, and data processing is performed on the third range area in the text data, and it is recognized that multiple numbers and separators are included.
  • the method may further include the following: identifying the type of the separator; according to a split rule matching the type of the separator, dividing the multiple The number is split into a plurality of digital characters to be spliced; the plurality of digital characters to be spliced are spliced with the nouns, letters or characters of the characterization graph in the third character group respectively to obtain a plurality of keyword groups.
  • the method may further include the following content during specific implementation: according to the keyword group, the identity serial number of the corresponding keyword group is generated, and the The identity serial number of the keyword group is stored in the text data.
  • the method for displaying a drawing page is that the target drawing page containing the target file is first displayed to the user through the client device; when the user needs it, the first type of trigger sent by the user is received In the case of operation, the matching text of the image identification contained in the target drawing is displayed on the page, so that the user no longer needs to search back and forth in the text of the target file with the target drawing.
  • the text sentence related to the image identification of interest can be accurately obtained other information related to the image identification, which is convenient for users to efficiently read and understand the content of the drawings in the drawings page, which simplifies user operations and does not It affects the user's reading of the content of the target drawing itself, which can meet the user's personalized needs, thereby further improving the user's use experience; it also displays the image identification in the target drawing in the form of a data list on the page Match the text and associate the image identifiers in the data list with the image identifiers in the target drawings, which facilitates the user to read the content of the drawings and the text, and further improves the user experience.
  • this application also provides a corresponding drawing page display device, which may specifically include the following structural modules:
  • the display module can be specifically used to display a page containing a target drawing in a target file, where the target file includes drawing data and text data, the drawing data includes image identifiers, and the text data includes Keyword group, the target drawing contains image identification;
  • the determining module may be specifically configured to receive a first type of trigger operation, where the first type of trigger operation is used to control the display of the matching text of the image identifier contained in the target drawing;
  • the display module can be specifically used to respond to the first type of triggering operation, and display on the page the matching text of the image identification contained in the target drawing, wherein the matching text of the image identification includes A sentence or phrase describing the image logo.
  • the determining module may be specifically used to receive a second type of trigger operation, where the second type of trigger operation is used to control hiding of the matching text of the image identifier contained in the target drawing ;
  • the display module may also be specifically configured to respond to the second type of trigger operation to hide the matching text of the image identifier contained in the target drawing on the page.
  • the display module may be specifically configured to respond to the In the first type of trigger operation, the data list of the target drawing is displayed on the page, wherein the data list contains the image identification in the target drawing and the matching text of the image identification.
  • the determining module may be specifically configured to receive and respond to a trigger operation for the image identifier in the target drawing, and determine the triggered image identifier in the target drawing;
  • the display module may also be specifically used to display the triggered image identifier and the matching text of the triggered image identifier in a preset display manner in the data list.
  • the determining module may be specifically configured to receive and respond to a trigger operation for the image identifier in the data list, and determine the image identifier that is triggered in the data list;
  • the display module may be specifically used to display the triggered image identifier in the target drawing in a preset display manner.
  • the device further includes an obtaining module, which may be specifically configured to receive a display of the target drawing in the target file before displaying the page containing the target drawing in the target file Operation; in response to the display operation for the target drawing in the target file, generate and send a data acquisition request for the target drawing in the target file; acquire target drawing data and matching text data, wherein the target attachment
  • the image data includes the image identifier in the target image
  • the matched text data includes the matching text of the image identifier in the target image
  • the target image data and the matched text data are based on the image identifier of the target file.
  • Figure data and text data are determined.
  • the determining module may be specifically used to receive a third type of trigger operation, wherein the third type of trigger operation is used to control the matching text of the image identifier contained in the target drawing. Display position
  • the display module may also be specifically configured to respond to the third type of triggering operation, in a preset display area on the page, to display the matching text of the image identifier contained in the target drawing, wherein the The preset display area includes blank areas on the page.
  • the determining module may be specifically configured to receive and respond to a trigger operation for the image identifier in the target drawing, and determine the triggered image identifier in the target drawing;
  • the display module may also be specifically configured to display the matched text of the triggered image identifier in a preset display mode in the preset display area.
  • the determining module may be specifically configured to receive and respond to a trigger operation for the matching text in the preset display area, and determine the matching text that is triggered in the preset display area;
  • the display module may be specifically used to display the image identifier corresponding to the triggered matching text in a preset display mode in the target drawing.
  • the device further includes a data processing module, and the data processing module may be specifically configured to perform corresponding processing on the text data and the drawing data of the target file, respectively, to obtain that the information obtained by the acquisition module includes identification The figure data of the determined image identification and the text data containing the identified keyword group.
  • the data processing module specifically includes a drawing data processing unit and a text data processing unit.
  • the drawing data processing unit may be specifically used to identify and determine the image identifier contained in the drawing data.
  • the text data processing unit may be specifically used to identify and determine the keyword group contained in the text data.
  • the image identifier in the drawing data may be determined in the following manner: the attachments containing the preset header are filtered out from the drawing data Image data; image data recognition is performed on the first range area in the image data containing the preset page header, and the recognized first character group is used as the image identifier, wherein the first character group includes : A combination of numbers and characteristic characters of the characterization map; according to the recognized first character group, the second range area is determined in the figure data containing the preset page header; Image data recognition is performed on the second range area in the drawing data of the header, and the recognized second character group is used as the image identifier, wherein the second character group includes: and the preset page header
  • the characteristic line in the figure data is a combination of numbers and/or letters whose distance is less than the preset second distance threshold, and the characteristic line includes: a straight line carrying an arrow, and/or a curve carrying an arrow.
  • the drawing data processing unit after determining the image identification contained in the drawing data, may be specifically used to generate the identity serial number of the corresponding image identification according to the image identification, And the identity serial number of the image identification is stored in the drawing data.
  • the keyword group in the text data may be determined in the following manner, including: retrieving the section headings in the text data; The text data is divided into a plurality of text areas; a text area that meets the preset requirements is selected from the plurality of text areas as the third range area, wherein the third range area includes at least the subsection headings and description of the drawings Or the text area of BRIFE DESCRIPTION OF DRAWINGS; according to the third range area in the text data, the keyword group in the text data is determined.
  • the text data processing unit when the text data is retrieved and it is determined that the text data does not contain section headings, when the text data processing unit is specifically implemented, the text data processing unit can also be used to compare all the text data using a preset area division model.
  • the text data is processed to determine a third range area from the text data, wherein the preset area division model is obtained by pre-training a neural network using sample text data.
  • the third range area in the text data may be determined.
  • Perform text data recognition in the range area and recognize a third character group, where the third character group includes at least one of the following: a combination of numbers and nouns, a combination of letters and nouns, a combination of numbers and characters in a characterization map;
  • the recognized third character group serves as the keyword group.
  • the text data processing unit may also be used to: count the plurality of third character groups with similar semantics during specific implementation The appearance frequency of each third character group in the text data; the third character group with the highest appearance frequency is determined as the keyword group.
  • the text data processing unit may also be used to: count each third character group in the multiple third character groups during specific implementation The number of characters contained in the text paragraph; the third character group with the least number of characters in the text paragraph is taken as the keyword group.
  • the third character group further includes: a combination of a plurality of numbers and separators, and the text data processing unit performs data processing on the third range region in the text data, and recognizes that After the third character group of a combination of multiple numbers and separators, in specific implementation, the text data processing unit may also specifically execute according to the following procedure: identify the type of the separator; match according to the type of the separator According to the split rule, the multiple numbers are split into multiple digital characters to be spliced; the multiple digital characters to be spliced are separated from the nouns, letters, or characters in the third character group Perform splicing to obtain multiple keyword groups.
  • the text data processing unit when the text data processing unit is specifically implemented, it may also be used to generate the identity serial number of the corresponding keyword group according to the keyword group, and combine all The identity serial number of the keyword group is stored in the text data.
  • the display device of the drawing page provided by this application first displays the target drawing page containing the target file to the user through the display module; when the user needs the determination module and the display module, the user sends the first In the case of a similar trigger operation, the matching text of the image identification contained in the target drawing is displayed on the page, so that the user no longer needs to search back and forth between the text of the target file and the target drawing.
  • the text sentences related to the image identification of interest in the image identification can accurately obtain other information related to the image identification, which is convenient for users to efficiently read and understand the content of the drawings in the drawings page, which simplifies user operations and also It will not affect the user's reading of the content of the target drawing itself, and can meet the user's personalized needs, thereby further improving the user's experience.
  • This application also provides a specific client device, wherein the client device includes a network communication interface, a processor, and a memory, and the above structures are connected by internal cables, so that each structure can perform specific data interactions.
  • the network communication interface may be specifically used to obtain drawing data and text data of the target file, and to receive a display operation for the target drawing in the target file.
  • the processor may be specifically configured to display a page containing a target drawing in a target file, wherein the target file includes drawing data and text data, the drawing data includes an image identifier, and the text data Contains a keyword group, the target drawing includes an image identifier; receiving a first type of trigger operation, wherein the first type of trigger operation is used to control the display of the matching text of the image identifier contained in the target drawing In response to the first type of triggering operation, the matching text of the image identification contained in the target drawing is displayed on the page, wherein the matching text of the image identification includes a sentence or a description of the image identification phrase.
  • the memory may be specifically used to store a corresponding instruction program on which the processor is based.
  • the embodiment of the present specification also provides a computer storage medium based on the display method of the page of the drawings.
  • the computer storage medium stores computer program instructions, which are implemented when the computer program instructions are executed: the display contains the target file The target drawing page of the target drawing, wherein the target file includes drawing data and text data, the drawing data includes image identification, the text data includes a keyword group, and the target drawing contains image identification Receive a first type of trigger operation, wherein the first type of trigger operation is used to control the display of the matching text of the image identification contained in the target drawing; in response to the first type of trigger operation, in the page The matching text of the image identification contained in the target drawing is displayed, wherein the matching text of the image identification includes a sentence or phrase for describing the image identification.
  • This application also provides a method for displaying a text page.
  • FIG. 16 may include the following content:
  • S1601 Display a page containing target text in a target file, where the target file includes image data and text data, and the target text includes keyword groups;
  • S1603 Receive and respond to the trigger operation for the keyword group in the target text, and determine the triggered keyword group;
  • S1605 Display the matching drawings of the triggered keyword group on the page, wherein the matching drawings of the triggered keyword group include at least the drawings described by the triggered keyword group or the content in the drawings Object.
  • the above-mentioned trigger operation for the keyword group may specifically be triggered by a user clicking on the keyword group through the client device, or may be an input trigger by the user inputting a command sentence for the keyword group through the client device.
  • This specification does not limit the specific form or content of the above-mentioned trigger operation for the keyword group.
  • the above-mentioned matching drawings can be specifically understood as drawings containing image identifiers matching the keyword groups, that is, drawings containing the drawings indicated by the keyword groups or the content objects in the drawings.
  • the user when the user reads the text content in the text page displayed by the client device according to the text data of the target file, if the user is interested in one or several keyword groups in the text page, he wants to know the key Refer to Figure 17 for the picture information corresponding to the phrase. Click on the keyword group of interest to initiate a trigger operation. At this time, the matching picture of the corresponding keyword group will be displayed on the text page, that is, the corresponding keyword group will be displayed. The indicated drawing or the drawing of the content object in the drawing.
  • a matching drawing containing an image identifier matching the keyword group may be displayed in the form of a floating frame.
  • the manner of displaying matching drawings listed above is only a schematic illustration. During specific implementation, according to specific circumstances and user needs, other suitable methods may be used to display the matching drawings.
  • the target document may specifically include at least one of the following: patent document, thesis document, contract document, trademark document, etc.
  • the target files listed above are only schematic illustrations. During specific implementation, according to specific circumstances and user needs, other types of files can be introduced as the aforementioned target files. In this regard, this manual is not limited.
  • the method may also include the following content during specific implementation: sending an acquisition request for the target file; acquiring the drawing data and text of the target file Data, wherein the text data includes at least a keyword group, and related information such as the identification number and location information of the keyword group, and the drawing data includes at least image identification, and related information such as the identification number and location information of the image identification .
  • the matching drawings of the triggered keyword group can be determined based on the triggered keyword group, combined with the drawing data and text data of the target file, and then displayed on the page Matching picture of the triggered keyword group.
  • the matching drawing of the triggered keyword group is determined by combining the drawing data and text data of the target file.
  • the following content may be included: according to the text Data, determine the identity serial number of the triggered keyword group; determine the image identity whose identity serial number matches the identity serial number of the triggered keyword group as the associated image identity; according to the location information of the associated image identity Acquire the picture data containing the associated image identifier from the picture data as the matched picture data; and then generate the matched picture of the triggered keyword group according to the matched picture data.
  • the client device may first obtain, from the server, the image data and text data of the target file pre-processed by the server and containing the target file whose image identifier and keyword group have been identified and determined. It can also be that the client device first obtains the drawing data and text data of the target file provided by the server, and then performs corresponding processing on the above drawing data and text data respectively, and recognizes and determines the image identification in the drawing data and the text data.
  • the keyword group is referred to identify the image identification in the drawing data and the text data.
  • the above is determined based on the triggered keyword group, combined with the drawing data and text data of the target file
  • the server can also be responsible for the data processing process of matching drawings of the triggered keyword group. That is, the client device receives the user's trigger operation for the keyword group, and sends the identity serial number of the keyword group triggered by the user to the server. The server then determines the matching drawing corresponding to the triggered keyword group according to the above keyword group, and feeds it back to the client device. The client device displays the matching drawings in the text page displayed to the user. This can reduce the computational burden of the client and improve processing efficiency.
  • the text page display method receives and responds to the user's triggering operation on the keyword group in the displayed text page through the client device, and determines the triggered keyword group; then according to the obtained target file Matching drawings of the triggered keyword group are displayed on the attached text page to compare and display the drawings indicated by the keyword groups or the content objects in the drawings, so that users no longer You need to go back and forth in the drawings of the target file to find the drawings related to the keyword group of interest in the target text, so that users can efficiently read and understand the text content in the attached text side, thereby simplifying user operations and improving Improve the user experience.
  • the present application also provides a corresponding text page display device.
  • the device may specifically include the following structural modules:
  • the display module can be specifically used to display a page containing target text in a target file, where the target file includes image data and text data, and the target text includes keyword groups;
  • the receiving module can be specifically used to receive and respond to the trigger operation for the keyword group in the target text, and determine the triggered keyword group;
  • the display module may be specifically used to display the matching drawings of the triggered keyword group on the page, wherein the matching drawings of the triggered keyword group include at least the drawings described by the triggered keyword group or The content object in the picture.
  • This application also provides a specific client device, wherein the client device includes a network communication interface, a processor, and a memory, and the above structures are connected by internal cables, so that each structure can perform specific data interactions.
  • the network communication interface can be specifically used to obtain drawing data and text data of the target file.
  • the processor may be specifically used to display a page containing target text in a target file, where the target file includes image data and text data, and the target text includes keyword groups; receiving and responding to all The trigger operation of the keyword group in the target text determines the triggered keyword group; the matching drawing of the triggered keyword group is displayed on the page, wherein the matching drawing of the triggered keyword group is at least Include the drawings described by the triggered keyword group or the content objects in the drawings.
  • the memory may be specifically used to store a corresponding instruction program on which the processor is based.
  • the embodiment of the present specification also provides a computer storage medium based on the above-mentioned method for displaying pages of the drawings.
  • the computer storage medium stores computer program instructions, which are implemented when the computer program instructions are executed: the display contains the target file The target text page of the target text, wherein the target file includes drawing data and text data, and the target text contains keyword groups; receiving and responding to the trigger operation for the keyword groups in the target text, it is determined that the trigger is triggered The keyword group; the matching drawings of the triggered keyword group are displayed on the page, wherein the matching drawings of the triggered keyword group include at least the drawings or drawings described by the triggered keyword group The content object.
  • the present application also provides a method for generating a target drawing, wherein the method for generating the target drawing may include the following content when implemented:
  • S1 Acquire drawing data and text data of a target file, where the drawing data includes an image identifier, and the text data includes a keyword group;
  • S2 Receive and respond to a selection operation for the drawings in the target file, and generate a target drawing based on the drawing data and the text data, where the target drawing is the drawing selected by the selection operation, and the target attachment
  • the image includes an image identification and matching text of the image identification, and the matching text of the image identification is generated according to the image data and the text data.
  • the client device can respond to the use needs of different users in different application scenarios, and generate target drawings that are more friendly and intelligent for user interactions such as reading and use. That is, based on the original drawings included in the target file, the drawings are improved. Specifically, the corresponding text description or explanation of the image identification contained in the target drawing can be automatically added to the generated target drawing through the above-mentioned figure data containing the image mark and the text data containing the keyword group. That is to match the text. In this way, the target drawings that are generated and provided to users are more readable than the original drawings, contain more relevant information, and have higher data value.
  • the method may further include the following content: receiving a display request; and responding to the display request to display the target drawing.
  • the client device may first store the target drawing on the storage device of the client device after generating the target drawing.
  • the user wants to read it, after sending a display request, for example, the user After clicking the icon of the target drawing displayed by the client device; and then responding to the user's display request, showing the user the target drawing that carries matching text and has relatively higher readability and data value.
  • a display request for example, the user
  • responding to the user's display request showing the user the target drawing that carries matching text and has relatively higher readability and data value.
  • the method for generating a target drawing provided by this application generates a drawing containing matching text of the image identifier appearing in the target drawing based on the drawing data and text data of the target file, so that the generated
  • the target drawings are more readable, have higher data value, can meet the wider use needs of users, and improve the user experience.
  • This application also provides a specific client device, wherein the client device includes a network communication interface, a processor, and a memory, and the above structures are connected by internal cables, so that each structure can perform specific data interactions.
  • the network communication interface may be specifically used to obtain the drawing data and text data of the target file, wherein the drawing data includes an image identifier, and the text data includes a keyword group.
  • the processor may be specifically configured to receive and respond to a selection operation for a drawing in a target file, and generate a target drawing based on the drawing data and the text data, wherein the target drawing is selected by the selection operation A drawing, the target drawing contains an image mark and a matching text of the image mark.
  • the memory may be specifically used to store a corresponding instruction program on which the processor is based.
  • the embodiment of the present specification also provides a computer storage medium based on the above-mentioned method for displaying the page of the drawings.
  • the computer storage medium stores computer program instructions. When the computer program instructions are executed, it is realized: Get the drawings of the target file Data and text data, wherein the image data includes image identifiers, and the text data includes keyword groups; receiving and responding to the selection operation for the drawings in the target file, according to the image data and the text data A target drawing is generated, where the target drawing is a drawing selected by the selection operation, and the target drawing includes an image identifier and a matching text of the image identifier.
  • the embodiment of the present application also provides a method for displaying the drawing page in the situation where the client device involved is relatively simple.
  • the method can be applied to the above-mentioned business system (that is, the client device is included, and the method is used to provide users
  • the client device side of the platform that provides the target file query service may include the following content during specific implementation.
  • S1 Send a page display request for the target drawing in the target file.
  • S2 Receive target drawing data and matched text data; wherein the matched text data is generated based on the image identifier in the target drawing and the text data of the target file.
  • the above-mentioned target drawing can be specifically understood as the image data of the target file used to generate and display the target drawing in the target file requested by the user.
  • the above-mentioned matched text data can be specifically understood as a type of text data contained in the target file, which contains a keyword group corresponding to the image identifier in the target drawing, and can generate a matching of the image identifier.
  • the text data of the text can be specifically understood as a type of text data contained in the target file, which contains a keyword group corresponding to the image identifier in the target drawing, and can generate a matching of the image identifier.
  • the matching text of the aforementioned image identifier may be a text phrase, for example, a phrase used to describe the drawing indicated by the image identifier or the name of the content object in the drawing. It can also be a text sentence, for example, a descriptive sentence extracted from the text data of the target file on the attribute, use, function and other characteristics of the figure indicated by the image identifier or the content object in the figure, or based on The above-mentioned descriptive sentence is obtained through semantic recognition and contains a certain general text recognition result that contains the main content of the above-mentioned descriptive sentence.
  • the matching text listed above is only a schematic description. This specification does not limit the specific form and content of the matching text.
  • the client device may receive the above-mentioned target drawing data determined and fed back by the server in response to the page display request for the target drawing in the target file in a wired or wireless manner. And matching text data.
  • the client device can generate and show to the user that it can be more intelligent according to the target drawing data and the matched text data.
  • the drawing page that clearly displays the target drawing that is, in the displayed drawing page, not only will the corresponding target drawing selected by the user be displayed based on the target drawing data, but also based on the matched text data, Automatically display the text that matches the image identification contained in the target drawing on this page.
  • This application also provides another method for displaying pages with drawings.
  • This method can be specifically applied to the server side of the above-mentioned business system.
  • the method can specifically include the following content.
  • the above-mentioned server may be coupled with the client device in a wired or wireless manner for data interaction.
  • the server may receive the page display request for the target drawing in the target file sent by the client device in a wired or wireless manner according to the corresponding transmission protocol.
  • the above-mentioned target documents may specifically include at least one of the following: patent documents, dissertation documents, contract documents, trademark documents, and so on.
  • the target files listed above are only to better illustrate the embodiments of this specification.
  • the above-mentioned target file may also include other types of files. In this regard, this manual is not limited.
  • the server may perform corresponding processing on the text data and drawing data contained in the target file in a targeted manner in advance.
  • the image identifiers in the figure data of the target file and the keyword groups in the text data are respectively identified and determined, and the figure data and text data of the processed target file are obtained (that is, they contain the image identifiers that have been identified and determined, and the image The image data of the target file of the identified related information, and the text data of the target file containing the identified keyword group and the related information of the keyword group).
  • the server can respond to the above page display request, and by performing data analysis and other processing on the page display request, determine the target drawing in the target file requested by the client device to display, and then the target drawing can be obtained from the target
  • the target drawing data used to generate the target drawing is retrieved from the drawing data of the file.
  • the server may further perform corresponding identification processing on the target drawing data, such as searching for whether there is an image identification serial number in the target drawing data, etc. To determine the image identification contained in the target drawing data.
  • the above-mentioned matched text data can be specifically understood as a kind of data generated based on the data in the text data of the target file that matches the image identifier contained in the target drawing data.
  • the above-mentioned obtaining matching text data based on the image identification may include the following content: determining the identification number of the image identification; retrieving the identification number from the text data of the target file A keyword group matching the identity number of the image identification; obtaining a text sentence containing the keyword group from the text data; determining the matched text data according to the text sentence containing the keyword group.
  • a keyword group whose identity serial number is the same as the identity serial number of the image identification or whose difference value is less than the preset difference threshold can be retrieved from the text data during specific implementation as the matched Keyword group.
  • the identity serial number of the image identification " Figure 1" is "f1". Based on the identity serial number, an identical identity serial number "f1" can be retrieved from the text data of the target file as the matching identity serial number, and the identity The keyword group "Figure 1" corresponding to the serial number is determined to be a keyword group matching the image identifier " Figure 1".
  • the identity serial number of the image identification "101" is "label2_101”. According to the identity serial number, a similar identity serial number "101” can be retrieved from the text data of the target file as the matching identity serial number, and the identity The keyword group "component 101" corresponding to the serial number is determined as a keyword group matching the image identifier "101".
  • the above-mentioned method for determining the matched keyword group is only a schematic illustration. During specific implementation, other suitable methods can also be used to determine the matching keyword group of the image identifier according to the specific situation.
  • the matching keyword group can be efficiently found in the text data according to the position information of the keyword group corresponding to the identity serial number. Furthermore, sentence separators (such as period, semicolon, etc.) can be searched forward and backward for the matched keyword group in the text data to find the text sentence that is determined to contain the keyword group from the text data.
  • sentence separators such as period, semicolon, etc.
  • the text sentence including the keyword group can be directly used as the matched text data for the image identification. It is also possible to use only the combination of the noun and the keyword group connected to the keyword group in the text sentence as the matched text data. If the text sentence containing the keyword group is relatively complicated, the semantic recognition of the text sentence containing the keyword group can also be performed, for example, by inputting a trained semantic recognition model to obtain the corresponding semantic recognition result. Furthermore, the semantic recognition result can be used as matched text data and so on.
  • the above-mentioned method for determining matching text data is only a schematic illustration. During specific implementation, according to specific application scenarios and processing requirements, other suitable methods may also be used to obtain the above-mentioned matched text data. In this regard, this manual is not limited.
  • the text sentence containing the keyword group after the text sentence containing the keyword group is determined, it can be further determined based on the non-numeric characters that make up the keyword group that the image identifier corresponding to the keyword group indicates that the entire drawing (for example, attached Figure serial number), or a certain content object in the figure (for example, a certain component or module in the figure). Specifically, for example, if the non-numeric characters that make up the keyword group are "figure", "fig.”, or "drawing", etc., it can be determined that the image identifier corresponding to the keyword group is an identifier for indicating the serial number of the drawing Information; otherwise, it is the identification information used to indicate the content object in the drawing.
  • the above-mentioned drawing types may specifically include at least one of the following drawing types: plan view, perspective view, oblique view (or perspective view), cross-sectional view, schematic view, exploded view, enlarged view (or partial view) , Concept map or example map, etc.
  • drawing types may specifically include at least one of the following drawing types: plan view, perspective view, oblique view (or perspective view), cross-sectional view, schematic view, exploded view, enlarged view (or partial view) , Concept map or example map, etc.
  • the types of drawings listed above are only schematic illustrations. During specific implementation, other types of drawings may be included according to specific conditions and application scenarios. In this regard, this manual is not limited.
  • the figure type of the figure can be subsequently displayed to the user as a matching text.
  • the above-mentioned view types may specifically include at least one of the following: left view, right view, front view, back view, top view, bottom view, and so on.
  • the above-listed view types are only schematic illustrations. During specific implementation, other view types may be included according to specific conditions and application scenarios. In this regard, this manual is not limited.
  • the view type of the drawing may be subsequently displayed to the user as a matching text. It is also possible to establish a corresponding trigger link according to the relationship between the view types of different drawings. For example, when the user is looking at the front view, if the user is interested in the left view, he can click on the left side of the front view to issue a trigger, and then the client can respond to the user’s trigger and jump to the view corresponding to the front view The drawings whose type is the left view are shown to the user.
  • S4 Send the target drawing data and the matched text data, where the target drawing data and the matched text data are used by the client device to generate and display a drawing page containing the target drawing,
  • the page also contains the matching text of the image identification contained in the target drawing, and the matching text of the image identification contained in the target drawing is generated according to the matched text data.
  • the server may send the target drawing data and the matching text data to the client in a wired or wireless manner. equipment.
  • the client device can generate and display to the user a drawing page containing the target drawing that the user wants to query and understand based on the above-mentioned target drawing data, and at the same time, according to the matched text data, display and
  • the matching text matched by the image identification in the target drawing enables the user to obtain the relevant information in the target drawing more conveniently and efficiently, and improve the user's practical experience.
  • the client device may redraw a display page through HTML Canvas according to the target drawing for the page display request and the matched text data and display it for the user to read. It can also be that the server first redraws a display page through HTML Canvas according to the target drawing and the matched text data, and then sends the display page to the client device, and the client device displays it to the user; or through a patent
  • the text layout algorithm displays the changed data, and then displays the page containing the matching text of the image identification; you can also display the page containing the matching text of the image identification for users to read by opening the operation in the setting of the reading tool such as the browser and many more.
  • the present application also provides a method for displaying the drawing page applied to the business system including the server and the client device.
  • the method may specifically include the following content.
  • S1 The client device sends a page display request for the target drawing in the target file to the server;
  • the server obtains target drawing data from the drawing data of the target file; determines and obtains matching text from the text data of the target file according to the image identifier in the target drawing data Data; sending the target drawing data and matching text data to the client device;
  • the client device displays the page containing the target drawing according to the target drawing data and the matched text data, and displays the matching text of the image identifier contained in the target drawing on the page.
  • the server and the client device can respond to the user's drawing page display request, and intelligently display the drawing of the target drawing that contains the matching text content and meets the user's requirements for the user.
  • the page is convenient for users to read and understand the target drawings, so that users no longer need to search the text content of the target file for text sentences corresponding to the image identifiers. This simplifies user operations and improves user experience.
  • Figure 20 can be specifically understood as a column page (for example, a corresponding graphic-text display page), which is equivalent to setting two mutually independent windows on the same display page :
  • the first window and the second window where the page displayed in each window and the page displayed in the other window may not be related to each other, or may be linked to each other.
  • the user can read the text content or the content of the drawings at different positions of the target file at the same time, or read the content of the drawings and the text of the target file at the same time.
  • the left side column page (ie the second window) of the page displayed by the client device can be used to read the target text of the target file;
  • the right column page (ie the first window) to read the target drawing of the target file. That is, in the same complete page displayed by the client device, the target drawing and target text of the target file can be simultaneously displayed to the user through the first window and the second window independent of each other in the display page.
  • this application also provides a method for displaying a page, which can be specifically applied to the client device side.
  • the method may include the following:
  • S2101 Display a page containing target text and target drawings in a target file, where the target file includes drawing data and text data, and the target drawing is displayed in the first window of the page, so The target text is displayed in a second window on the page, the target drawing includes an image mark, the target text includes a keyword group, and the keyword group is used to indicate the drawing in the text data, And/or, the character combination of the content object in the drawing;
  • S2102 Receive a trigger operation for the image identifier in the first window, and/or a trigger operation for the keyword group in the second window;
  • S2103 In the case of receiving a trigger operation for the image identifier in the first window, in response to the trigger operation for the image identifier in the first window, determine the triggered image identifier;
  • S2104 Display the matching text of the triggered image identification in the second window, where the matching text of the triggered image identification includes at least a sentence or phrase used to describe the triggered image identification;
  • S2105 In the case of receiving a trigger operation for the keyword group in the second window, in response to the trigger operation for the keyword group in the second window, determine the triggered keyword group;
  • S2106 Display the matching drawings of the triggered keyword group in the first window, where the matching drawings of the triggered keyword group include at least the drawings described by the triggered keyword group or in the drawings Content object.
  • the client device can process the target file according to the drawing data and text data provided by the server or processed by itself, and according to the drawing data and text data of the target file, and the user's selection, respectively.
  • the target drawing is displayed in the first window, and the target text is displayed in the second window.
  • the client device can receive and respond to the user's triggering operation on the image identification in the first window, and jump the second window to the page containing the matching text. And in the second window, the matching text corresponding to the triggered image identifier in the first window is displayed in a display manner different from other texts.
  • the above-mentioned matching text may specifically be a keyword group corresponding to the triggered image identifier in the first window (that is, an associated keyword group that is the same as the identity sequence number of the triggered image identifier), or text containing the corresponding keyword group Statement.
  • the client device can also receive and respond to the user's triggering operation on the keyword group in the second window, and jump the first window to the page containing the matching drawing. And in the first window, in a display manner that is different from other image identifiers, the image identifiers corresponding to the keyword groups triggered in the second window are displayed.
  • the above-mentioned matching drawing may specifically be a drawing containing an image identifier corresponding to the triggered keyword group in the second window (ie, an image identifier that is the same as the identity number of the triggered keyword group).
  • the target document may specifically include at least one of the following: patent documents, dissertation documents, contract documents, trademark documents, etc.
  • the matching text of the triggered image identifier is displayed in the second window.
  • it may include the following content: determine the identity serial number of the triggered image identifier according to the drawing data of the target file; According to the text data of the target file, the keyword group whose identity serial number matches the identity serial number of the triggered image identifier is retrieved as the associated keyword group; the text containing the associated keyword group is displayed in the second window as The matching text of the triggered image ID.
  • the method may further include the following content during specific implementation:
  • the preset display mode is to display the associated keyword group in the matching text of the triggered image identification, wherein the preset display mode is at least one of the following: display in a highlighted manner; Display in bold font; display in preset font color; display in italic font; display in font size larger.
  • displaying the matching text of the triggered image identification in the second window may also include: in the case of retrieving multiple associated keyword groups, according to the The location information, displaying in the second window the text containing the associated keyword group at the top of the location information as the matched text of the triggered image identification.
  • the method may further include: displaying multiple associated keyword groups; The phrase selection operation determines the associated keyword group selected by the user; the text containing the associated keyword group selected by the user is displayed in the second window as the matched text of the triggered image identification.
  • related information such as the location information of the related keyword groups can also be displayed at the same time for the user's reference, so that the user can better select the related key to be displayed. phrase.
  • the text containing the associated keyword group is displayed in the second window as the matching text of the triggered image identification.
  • it may include the following content: controlling the second window to jump to A page containing the matching text of the triggered image identification; and in the second window, the associated keyword group is displayed in the matching text of the triggered image identification through a preset display mode.
  • the preset display mode includes at least one of the following: display in a highlighted manner; display in a bold font; display in a preset font color; display in an italic font; Display in a way that the font size becomes larger, etc.
  • the client device can also respond to the above triggering operation by retrieving multiple triggered matching texts with the image identifier "100" in the target text, and then can use the target text displayed in the second window to
  • the preset display methods such as highlighting show multiple triggered matching texts of the image mark "100".
  • the target text displayed in the second window can also only be displayed in a preset display mode such as highlighting to display the matching of multiple triggered image identifiers "100" The matching text with the triggered image identifier "100" at the top of the target text in the text.
  • the method may further include the following content during specific implementation: At the preset display position in the first window, the associated keyword group of the triggered image identification is displayed, wherein the preset display position includes the direction away from the center point of the target drawing and is in line with the The distance of the triggered image mark is less than or equal to the preset first distance threshold.
  • the above-mentioned displaying the associated keyword group of the triggered image identification may include the following content in specific implementation: displaying characters of less than a preset length in the matching text of the triggered image identification ; Receive a trigger operation for characters less than a preset length in the matched text of the triggered image identification; in response to the trigger operation, display the matched text of the triggered image identification.
  • the matching drawing of the triggered keyword group is displayed in the first window.
  • it may include the following content: determine the identity serial number of the triggered keyword group according to the text data of the target file; According to the attached image data of the target file, the image identifier with the identity sequence number matching the identity sequence number of the triggered keyword group is retrieved as the associated image identifier; the associated image identifier is displayed in the first window The picture of is used as the matching picture of the triggered keyword group.
  • the method may also include the following content when the method is specifically implemented : Through a preset display mode, the associated image identifier is displayed in the matching drawing of the triggered keyword group, wherein the preset display mode is at least one of the following: displayed in a highlighted manner ; Display in a bold font; display in a preset font color; display in an italic font; display in a larger font size, etc.
  • the method may further include the following content during specific implementation: obtaining drawing data and text data of the target file, where: The drawing data includes image identifiers, and the text data includes keyword groups; receiving display operations for the target drawing in the target file, and display operations for the target text in the target file; responding to the target drawing The display operation is to display the target drawing in the first window of the page according to the drawing data of the target file; in response to the display operation for the target text, in the page according to the text data of the target file The second window shows the target text.
  • the image identifier contained in the drawing data may be specifically determined in the following manner: the drawing data containing a preset page header is filtered out from the drawing data; Set the first range area in the figure data of the page header to perform image data recognition, and use the recognized first character group as the image identification, wherein the first character group includes: numbers and characteristic characters of the characterization map Combination; according to the identified first character group, the second range area is determined in the picture data containing the preset header; the second range area in the picture data containing the preset header Perform image data recognition in the range area, and use the recognized second character group as the image identifier, where the second character group includes: the distance from the characteristic line in the drawing data containing the preset header A combination of numbers and/or letters less than the preset second distance threshold, the characteristic line includes: a straight line carrying an arrow, and/or a curve carrying an arrow.
  • the method may further include the following content when the method is specifically implemented: according to the image identification, the identity serial number of the corresponding image identification is generated, And the identity serial number of the image identification is stored in the drawing data.
  • the above-mentioned determination of the keyword group in the text data may include the following content during specific implementation: searching for section titles in the text data; dividing the text data into multiple sections according to the section titles A text area; a text area that meets the preset requirements is selected from the multiple text areas as the third range area, where the third range area at least includes the section titled description of the drawing or BRIFE DESCRIPTION OF DRAWINGS Text area; according to the third range area in the text data, determine the keyword group in the text data.
  • the method may further include the following content: using a preset area division model to The text data is processed to determine a third range area from the text data, wherein the preset area division model is obtained by pre-training a neural network using sample text data.
  • the keyword group in the text data is determined according to the third range area in the text data.
  • it may include the following content: text the third range area in the text data
  • the third character group is recognized, wherein the third character group includes at least one of the following: a combination of numbers and nouns, a combination of letters and nouns, a combination of numbers and characters in a characterization map;
  • the third character group serves as the keyword group.
  • the method may further include the following content during specific implementation: counting each of the plurality of third character groups with similar semantics The appearance frequency of the third character group in the text data; the third character group with the highest appearance frequency is determined as the keyword group.
  • the method may further include the following content when implementing the method: counting the text of each third character group in the multiple third character groups The number of characters contained in the paragraph; the third character group with the least number of characters contained in the text paragraph is used as the keyword group.
  • the third character group may further include: a combination of a plurality of digits and separator symbols, and data processing is performed on the third range area in the text data, and it is recognized that a plurality of digits and After the third character group of the combination of the separator symbol, when the method is specifically implemented, the method may further include the following: identifying the type of the separator symbol; according to the split rule matching the type of the separator symbol, dividing the multiple Splitting a number into a plurality of digital characters to be spliced; splicing the plurality of digital characters to be spliced with the nouns, letters or characters of the characterization graph in the third character group respectively to obtain a plurality of keyword groups.
  • the method may further include the following content during specific implementation: according to the keyword group, the identity serial number of the corresponding keyword group is generated, and the The identity serial number of the keyword group is stored in the text data.
  • the page display method provided by this application uses the client device to first display the target drawing selected by the user in the first window and the second window of the same page according to the drawing data and text data of the target file. And the target text; and then by receiving and responding to the user's triggering operation for the image identification in the first window, the text displayed in the second window on the same page displays the matching text corresponding to the triggered image identification; or, again Receive and respond to the user's triggering operation for the keyword group in the second window, and display the matching drawings corresponding to the triggered keyword group in the drawings displayed in the first window on the same page; thereby achieving intelligent and efficient implementation
  • the linkage of graphics and text makes it easier for users to combine the text data of the target file with the data of the drawings to read and understand the specific content of the target file, which further improves the user experience.
  • the present application also provides a page display device, which can be referred to as shown in FIG. 25.
  • the device may specifically include the following structural modules:
  • the display module can be specifically used to display a page containing target text in a target file and a target drawing, where the target file contains drawing data and text data, and the target drawing is the first page in the page. Displayed in a window, the target text is displayed in a second window on the page, the target drawing includes an image mark, and the target text includes a keyword group;
  • the receiving module may be specifically configured to receive a trigger operation for the image identifier in the first window
  • the determining module may be specifically used to determine the triggered image identifier in response to the trigger operation for the image identifier in the first window;
  • the display module may be specifically used to display the matching text of the triggered image identification in the second window, wherein the matching text of the triggered image identification includes at least a sentence for describing the triggered image identification Or phrase.
  • the embodiment of this specification also provides a computer storage medium based on the above-mentioned page display method.
  • the computer storage medium stores computer program instructions, which are realized when the computer program instructions are executed: displaying the target contained in the target file A page of text and a target drawing, wherein the target file includes drawing data and text data, the target drawing is displayed in the first window on the page, and the target text is displayed in the first window of the page.
  • the target drawing includes an image identifier
  • the target text includes a keyword group
  • the keyword group is used to indicate the drawing in the text data, and/or the content object in the drawing Character combination
  • receiving a trigger operation for the image identifier in the first window, and/or a trigger operation for the keyword group in the second window receives a trigger operation for the image identifier in the first window
  • the triggered image identifier is determined; the matching text of the triggered image identifier is displayed in the second window, wherein the triggered image identifier
  • the matching text of the image identification of the at least includes a sentence or phrase used to describe the triggered image identification; in the case of receiving a trigger operation for the keyword group in the second window, respond to the key in the second window
  • the phrase trigger operation determines the triggered keyword group; the matching picture of the triggered keyword group is displayed in the first window, wherein the matching picture of the triggered keyword
  • the embodiment of this specification also provides a client device based on the above-mentioned page display method, wherein the client device includes a processor and a memory for storing executable instructions of the processor, and when the processor executes the instructions Realizing display of a page containing target text and target drawings in a target file, wherein the target file includes drawing data and text data, and the target drawing is displayed in the first window of the page, the The target text is displayed in the second window of the page, the target drawing includes an image identifier, the target text includes a keyword group, and the keyword group is used to indicate the drawing in the text data, and /Or, the character combination of the content object in the drawing; receiving the trigger operation for the image identification in the first window, and/or, the trigger operation for the keyword group in the second window; In the case of the trigger operation of the image identification in the first window, the triggered image identification is determined in response to the trigger operation for the image identification in the first window; the triggered image identification is displayed in the second window Matching text, wherein the matching text of
  • the Momo patent search website in order to meet the diverse needs of users, often provide a variety of different tab pages for users to choose.
  • the user can select the page type that suits him by clicking the label at the top of the page to display the target file to be consulted.
  • the user can select the "patent details" tab, and the client device will use the tab page corresponding to the patent details tab to show the user the drawings and text in the patent document.
  • the user can also select “Image and Text Comparison”, and the client device will use the label page corresponding to the image and text label to show the user the drawings and text of the patent document.
  • the user may initially only want to have a rough look at the content of the drawings in the patent document, so first choose to read the drawings of the patent document in the patent details tab page.
  • I read a certain image mark in the drawings on the label page of the patent details displayed on the client device I suddenly became interested in this image mark, and temporarily wanted to switch to the label page that uses the image-text comparison to use the text and attachments.
  • Use the method of image comparison to study the specific characteristics of the content object indicated by the image mark more carefully.
  • the client device can select the label of the graphic and text comparison, so that the client device can switch to display the patent document through the label page of the graphic and text comparison.
  • the client device when the client device switches to display a new tab page, it often illegally retains the display record in the previous tab page. That is, in the initial graphic-text tab page displayed to the user by the client device in response to the user's selection, the first window of the graphic-text tab page initially displays the initial part of the drawings in the patent document by default (For example, Figure 1), at the beginning of the second window, the initial part of the text in the patent file will be displayed by default, instead of being intelligently displayed directly in the first window of the switched graphic-text tab page If you have just browsed the drawings that contain the image identification of interest in the tab page of the previous patent details, it will not directly display in the second window that contains the keyword group corresponding to the image identification of interest. text.
  • Figure 1 the initial part of the drawings in the patent document by default
  • the user must search the first window and the second window in the tab page of the newly switched graphic and text respectively to find the image that contains the image identification of interest, and contains the key corresponding to the image identification.
  • the text of the phrase is compared and read, the operation is cumbersome and troublesome, and the user experience is poor.
  • the embodiment of this specification also provides another page display method. You can refer to FIG. 26 and the related content shown in FIG. 27. When the method is specifically implemented, it may include the following content:
  • S2601 Display the first tab page containing the target drawing in the target file, where the target drawing includes an image identifier, the target file includes drawing data and text data, and the drawing data includes There is an image identifier, the text data includes a keyword group, and the keyword group is a combination of characters in the text data used to indicate the drawing, and/or the content object in the drawing;
  • S2603 Receive and respond to the trigger operation for the image identifier in the first tab page, and determine and record the triggered image identifier in the first tab page;
  • S2607 Display in the second tab page that the target file contains the text of the keyword group that matches the triggered image identifier in the first tab page as the target text.
  • the client device receives and responds to the trigger operation for the image identification in the first tab page, and when it determines the triggered image identification in the first tab page, it will automatically record it through the storage resources of the client device Store the latest triggered image ID. During specific recording, the client device may use the newly recorded triggered image identification to replace the previously recorded and stored triggered image identification for subsequent use.
  • the above-mentioned first tab page and second tab page can be specifically understood to be different types of tab pages for displaying target files selected by the platform that provides users with target file query and reading services.
  • the aforementioned tag pages may include: patent details tag pages, graphic-text comparison tag pages, PDF original text tag pages, and so on. This manual does not limit the specific types of the above-mentioned tab pages.
  • the target file contains the text of the keyword group that matches the triggered image identification in the first tab page as the target text.
  • the specific implementation can be understood as the customer
  • the end device displays the new second tab page to the user, it will display the text containing the keyword group matching the triggered image identification in the first tab page as the target text in the initial second tab page. In this way, it is possible to better fit the user's behavior habits, avoid the user to search for the text or picture content of interest in the new second tab page, and improve the user experience.
  • the first tab page may specifically include a picture detail page, etc.
  • the second tab page may specifically include a picture text comparison page and the like.
  • first tab page and the second tab page listed above are only schematic illustrations. During specific implementation, it can also be other types of tab pages according to specific conditions and scene requirements. In this regard, this manual is not limited.
  • the second tab page when the second tab page is a graphic comparison page, the second tab page includes a first window and a second window, wherein the first window is used to display the target attachment.
  • the second window is used to display target text.
  • the method is specifically implemented , May also include the following content: displaying the target text in the second window of the second tab page.
  • the method when the text of the target file containing the keyword group matching the triggered image identification in the first tab page is displayed in the second tab page as the target text, the method is specifically implemented At this time, the following content may also be included: displaying in the first window of the second tab page the drawing in the target file containing the image identification triggered in the first tab page as the target drawing.
  • the page display method provided by this application records the image identification triggered by the user in the first tab page through the client device, and displays the second tab page when receiving and responding to the user's tab page switching operation According to the recorded image identification triggered by the user in the first tab page, the corresponding target text and/or target drawing can be determined as the initial displayed page content to be displayed on the second tab page, so as to be more in line with the user The behavior and usage habits of the users further improve the user experience.
  • This application also provides a page display device, which may include the following structural modules during specific implementation:
  • the first display module can be specifically used to display the first tab page containing the target drawing in the target file, wherein the target drawing includes an image identifier;
  • the determining module may be specifically configured to receive and respond to the trigger operation for the image identifier in the first tab page, and determine and record the triggered image identifier in the first tab page;
  • the second display module can be specifically used to respond to the switching operation of the tab page and display the second tab page containing the target text in the target file;
  • the display module can be specifically configured to display, in the second tab page, the text in the target file that contains the keyword group that matches the triggered image identifier in the first tab page as the target text.
  • the embodiment of the present specification also provides a client device based on the above-mentioned page display method, including a processor and a memory for storing executable instructions of the processor.
  • the display includes The first tab page of the target drawing in the target file, where the target drawing contains an image identifier, the target file contains drawing data and text data, and the drawing data contains an image identifier, so
  • the text data includes a keyword group, which is a combination of characters in the text data used to indicate the drawing and/or the content object in the drawing; receiving and responding to the image identification in the first tab page To determine and record the triggered image identification in the first tab page; respond to the switching operation of the tab page, display the second tab page containing the target text in the target file; display in the second tab page
  • the target file contains the text of the keyword group that matches the triggered image identifier in the first tab page as the target text.
  • the embodiment of this specification also provides a computer storage medium based on the above-mentioned page display method.
  • the computer storage medium stores computer program instructions, which are realized when the computer program instructions are executed: displaying the target contained in the target file The first tab page of the drawing, wherein the target drawing includes an image identifier, the target file includes image data and text data, the drawing data includes an image identifier, and the text data includes A keyword group, the keyword group is a combination of characters in the text data used to indicate the drawing, and/or the content object in the drawing; receiving and responding to the trigger operation for the image identification in the first tab page, determining And record the image identification triggered in the first tab page; in response to the switching operation of the tab page, display the second tab page containing the target text in the target file; in the second tab page, it is displayed that the target file contains The text of the matching keyword group identified by the triggered image in the first tab page is used as the target text.
  • controllers in addition to implementing the controller in a purely computer-readable program code manner, it is entirely possible to program the method steps to make the controller use logic gates, switches, application specific integrated circuits, programmable logic controllers and embedded The same function can be realized in the form of a microcontroller, etc. Therefore, such a controller can be regarded as a hardware component, and the devices included in the controller for realizing various functions can also be regarded as a structure within the hardware component. Or even, the device for realizing various functions can be regarded as both a software module for realizing the method and a structure within a hardware component.
  • program modules include routines, programs, objects, components, data structures, classes, etc. that perform specific tasks or implement specific abstract data types.
  • This specification can also be practiced in distributed computing environments, in which tasks are performed by remote processing devices connected through a communication network.
  • program modules can be located in local and remote computer storage media including storage devices.

Abstract

一种附图页面的展示方法和装置、文本页面的展示方法和装置。其中,附图页面的展示方法通过展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,目标附图中包含有图像标识(100,101,102) (S301);接收并响应针对目标附图中的图像标识(100,101,102)的触发操作,确定出被触发的图像标识(100,101,102) (S303);并根据目标文件的附图数据和文本数据,在页面中显示出被触发的图像标识(100,101,102)的匹配文本,其中,所述被触发的图像标识(100,101,102)的匹配文本至少包括用于描述所述被触发的图像标识(100,101,102)的语句或词组(S305)。从而可以使用户不再需要自己来回地在目标文件的文本中翻找与目标附图中的所感兴趣的图像标识(100,101,102)相关的文本语句,就能准确地获取到与该图像标识(100,101,102)的其他相关信息,方便用户高效地阅读、理解附图页面中的附图内容,简化了用户操作,提高了用户的使用体验。

Description

附图页面的展示方法和装置、文本页面的展示方法和装置
本申请要求2019年03月18日递交的申请号为201910203855.7、发明名称为“附图页面的展示方法和装置、文本页面的展示方法和装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本说明书属于互联网技术领域,尤其涉及附图页面的展示方法和装置、文本页面的展示方法和装置。
背景技术
网站平台通常会将用户关注或想查询的文件的具体内容通过网页页面展示给用户,供用户查询、使用。其中,有些文件(例如专利文件等)中会同时包含有附图内容和文本内容。而上述附图内容和文本内容往往又是相互关联的。例如,在文本内容中,某一段文本语句可能是对附图内容中的某个图像标识所指示的内容对象的具体解释。但受限于文件的格式或者排版等限制,上述文件中的附图内容和文本内容大多是相互分开、相互独立的。相应的,导致在网页页面上显示时,上述文件中与附图对应的文本内容的显示往往和所描述的附图的显示是分开、割裂的。此外,对于诸如专利文件等法律性文件,为了尽量保证权利人的权利范围最大化,同时也为了申请时降低被检索到的风险,上述文件在撰写时所使用的文本词句通常也会相对比较晦涩。上述多种原因导致用户在阅读、理解上述相关文件内容时相对比较麻烦,不够方便,难以理解。
例如,可以参阅图1所示,用户A在某专利检索网站上检索到了一篇感兴趣的专利文件。用户A想要先浏览下该专利文件中的说明书附图,以便先对该专利文件所涉及到的技术方案有个整体性的了解。具体的,在该网站展示给用户A的包含有附图的页面中,只显示出了专利文件的原附图中所包含的内容:附图编号、附图中内容对象,以及指示附图中内容对象的图标。这时如果用户A注意到附图1中的图标为101的部件,想要进一步了解下该部件具体是什么,有什么用。但基于现有的附图页面的展示方法所展示的附图页面,用户是无法直接从当前所展示的附图页面中获取到除该附图以外该部件的其他相关信息的(例如该部件的名称等)。这时,用户只能通过繁琐的操作自行从当前展示有附图内容的页面跳转到展示有该文件的文本内容的页面中,再自行逐字逐句地在展示有文本内容的页面中搜索寻找涉及到部件101的相关语句描述,以便获取、了解到该部件的其他相关信息。
综上可见,现有的附图页面的展示方法在展示附图页面时往往不够智能,无法同时展示出文件文本中与附图相关联的文本内容,导致用户阅读、理解时,相对较麻烦、不方便,用户使用体验相对较差。
针对上述问题,目前尚未提出有效的解决方案。
发明内容
本说明书目的在于提供一种附图页面的展示方法和装置、文本页面的展示方法和装置,以达到在附图页面中向用户展示用户所选择的目标附图的同时,响应用户对目标附图中图像标识的触发,自动确定并展 示出被触发的图像标识的匹配文本,与用户进行交互,方便用户阅读、理解文件的附图内容,提高了用户的使用体验。
本说明书提供的一种附图页面的展示方法和装置、文本页面的展示方法和装置是这样实现的:
一种附图页面的展示方法,包括:展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述目标附图中包含有图像标识;接收并响应针对所述目标附图中的图像标识的触发操作,确定出被触发的图像标识;根据所述附图数据和所述文本数据,在所述页面中显示出被触发的图像标识的匹配文本,其中,所述被触发的图像标识的匹配文本至少包括用于描述所述被触发的图像标识的语句或词组。
一种附图页面的展示方法,包括:展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述附图数据包含有图像标识,所述文本数据包含有关键词组,所述目标附图中包含有图像标识;接收第一类触发操作,其中,所述第一类触发操作用于控制显示所述目标附图中所包含的图像标识的匹配文本;响应所述第一类触发操作,在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本,其中,所述图像标识的匹配文本包括用于描述图像标识的语句或词组。
一种文本页面的展示方法,包括:展示包含有目标文件中的目标文本的页面,其中,所述目标文件包括附图数据和文本数据,所述目标文本中包含有关键词组;接收并响应针对所述目标文本中的关键词组的触发操作,确定出被触发的关键词组;在所述页面中显示出被触发的关键词组的匹配附图,其中,所述被触发的关键词组的匹配附图至少包括被触发的关键词组所描述的附图或者附图中的内容对象。
一种附图页面的展示装置,包括:展示模块,用于展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述目标附图中包含有图像标识;确定模块,用于接收并响应针对所述目标附图中的图像标识的触发操作,确定出被触发的图像标识;显示模块,用于根据所述附图数据和所述文本数据,在所述页面中显示出被触发的图像标识的匹配文本,其中,所述被触发的图像标识的匹配文本至少包括用于描述所述被触发的图像标识的语句或词组。
一种附图页面的展示装置,包括:展示模块,用于展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述附图数据包含有图像标识,所述文本数据包含有关键词组,所述目标附图中包含有图像标识;确定模块,用于接收第一类触发操作,其中,所述第一类触发操作用于控制显示所述目标附图中所包含的图像标识的匹配文本;显示模块,用于响应所述第一类触发操作,在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本,其中,所述图像标识的匹配文本包括用于描述图像标识的语句或词组。
一种文本页面的展示装置,包括:展示模块,用于展示包含有目标文件中的目标文本的页面,其中,所述目标文件包括附图数据和文本数据,所述目标文本中包含有关键词组;接收模块,用于接收并响应针对所述目标文本中的关键词组的触发操作,确定出被触发的关键词组;显示模块,用于在所述页面中显示出被触发的关键词组的匹配附图,其中,所述被触发的关键词组的匹配附图至少包括被触发的关键词组所描述的附图或者附图中的内容对象。
本说明书提供的一种附图页面的展示方法和装置、文本页面的展示方法和装置,通过客户端设备接收并响应用户针对附图页面中图像标识的触发操作,确定出被触发的图像标识;再根据所获取的目标文件的附图数据和文本数据,在所述附图页面中显示出被触发的图像标识的匹配文本,以对图像标识所指示的附 图或者附图中的内容对象进行描述说明,使得用户不再需要自己来回地在目标文件的文本中翻找与目标附图中的所感兴趣的图像标识相关的文本语句,就能准确地获取到与该图像标识相关的其他信息,方便用户高效地阅读、理解附图页面中的附图内容,从而简化了用户操作,提高了用户的使用体验。
附图说明
为了更清楚地说明本说明书实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本说明书中记载的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。
图1是基于现有的附图页面的展示方法的一种实施例的示意图;
图2是基于业务系统为用户进行附图页面展示的一种实施例的示意图;
图3是本申请提供的附图页面的展示方法的流程的一种实施例的示意图;
图4是在一个场景示例中应用本申请提供的附图页面的展示方法的一个实施例示意图;
图5是在一个场景示例中应用本申请提供的附图页面的展示方法的一个实施例示意图;
图6是在一个场景示例中应用本申请提供的附图页面的展示方法的一个实施例示意图;
图7是在一个场景示例中应用本申请提供的附图页面的展示方法的一个实施例示意图;
图8是在一个场景示例中应用本申请提供的附图页面的展示方法的一个实施例示意图;
图9是在一个场景示例中应用本申请提供的附图页面的展示方法的一个实施例示意图;
图10是在一个场景示例中应用本申请提供的附图页面的展示方法的一个实施例示意图;
图11是在一个场景示例中应用本申请提供的附图页面的展示方法的一个实施例示意图;
图12是本申请提供的附图页面的展示装置的结构组成示意图;
图13是本申请提供的客户端设备的结构组成示意图;
图14是本申请提供的附图页面的展示方法的流程的一种实施例的示意图;
图15是在一个场景示例中应用本申请提供的附图页面的展示方法的一个实施例示意图;
图16是本申请提供的文本页面的展示方法的流程的一种实施例的示意图;
图17是在一个场景示例中应用本申请提供的文本页面的展示方法的一个实施例示意图;
图18是本申请提供的文本页面的展示装置的结构组成示意图
图19是本申请提供的应用于业务系统的附图页面的展示方法的流程的一种实施例的示意图;
图20是在一个场景示例中应用本申请提供的页面的展示方法的一个实施例示意图;
图21是本申请提供的页面的展示方法的流程的一种实施例的示意图;
图22是在一个场景示例中应用本申请提供的页面的展示方法的一个实施例示意图;
图23是在一个场景示例中应用本申请提供的页面的展示方法的一个实施例示意图;
图24是在一个场景示例中应用本申请提供的页面的展示方法的一个实施例示意图;
图25是本申请提供的页面的展示装置的结构组成示意图;
图26是本申请提供的页面的展示方法的流程的一种实施例的示意图;
图27是在一个场景示例中应用本申请提供的页面的展示方法的一个实施例示意图;
图28是本申请提供的一个实施例示意图;
图29是本申请提供的一个实施例示意图;
图30是本申请提供的一个实施例示意图。
具体实施方式
为了使本技术领域的人员更好地理解本说明书中的技术方案,下面将结合本说明书实施例中的附图,对本说明书实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本说明书一部分实施例,而不是全部的实施例。基于本说明书中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都应当属于本说明书保护的范围。
本申请提供的附图页面的展示方法,可以应用于包含有服务器(例如,为用户提供目标文件查询服务的平台的后台处理服务器)和客户端设备的业务系统中。可以参阅图2所示,基于该业务系统,用户可以通过客户端设备向服务器发送所感兴趣的文件(即目标文件)的数据获取请求,得到该文件的附图数据和文本数据,其中,该文件的附图数据和文本数据具体可以是服务器事先处理好的(在另一个实施例中也可以是客户端设备处理好的),分别包含有文件的附图中的图像标识,和文本中的关键词组,以及相关信息(例如图像标识的身份序号、位置信息,以及关键词组的身份序号、位置信息等等)的数据。进而客户端设备可以根据上述附图数据和文本数据,响应用户在客户端设备上具体操作,在附图页面上展示出用户感兴趣、关注的附图(即目标附图)的同时,还可以根据用户需要展示出附图中的图像标识的匹配文本,以便可以对附图中图像标识所指示的附图整体,或者附图中的某个内容对象,例如附图中的零部件或者模块等,在附图页面中进行对应描述、说明。这样在这一个附图页面上,用户就可以方便、高效地了解到该附图原本的内容(例如附图中具体图形结构等),以及该附图中原本内容没有的,而是记载在文本中的与附图中的各个图像标识相关的文本信息,从而可以方便用户更好地阅读、理解文件中附图,提高用户的使用体验。
在本实施例中,上述服务器具体可以是一种应用于网站平台一侧,能够实现数据传输、数据处理等功能的后台服务器。具体的,所述服务器可以为一个具有数据运算、存储功能以及网络交互功能的电子设备;也可以为运行于该电子设备中,为数据处理、存储和网络交互提供支持的软件程序。在本实施例中,并不具体限定服务器的数量。上述服务器具体可以为一个服务器,也可以包括多个服务器,或者,由若干服务器形成的服务器集群。
在本实施例中,上述客户端设备具体可以是一种应用于用户一侧,能够实现数据采集、数据传输、数据处理等功能的前端设备。具体的,上述客户端设备例如可以为台式电脑、平板电脑、笔记本电脑、智能手机、数字助理、智能可穿戴设备、导购终端、具有网络访问功能的电视机等。或者,所述客户端设备也可以为能够运行于上述电子设备中的软件应用。例如,可以为在智能手机上运行的某APP等。
本申请提供了一种附图页面的展示方法,该方法可以应用于上述业务系统中的客户端设备一侧,参阅图3所示,具体实施时,可以包括以下内容。
S301:展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述目标附图中包含有图像标识。
在本实施例中,上述目标文件具体可以理解为一种用户关注、想要查阅的文件数据。其中,上述目标文件可以是包含有文本数据和附图数据两种不同类型数据的文件数据。
具体的,上述目标文件可以是专利文件。例如,同时包含有权利要求书、说明书等文本数据,以及说明书附图、摘要附图等附图数据的申请文件。也可以是商标文件。还可以是携带有附图的论文文件或者合同文件等等。具体实施时,根据具体的应用场景,上述目标文件还可以是除上述所列举的文件类型之外其他类型的包含有文本数据和附图数据的文件数据。对于上述目标文件的具体类型和内容,本说明书不作限定。
在本实施例中,上述目标附图具体可以是目标文件的附图数据中所包含的多个附图,也可以是目标文件的附图数据中所包含的多个附图中某一个或几个由用户选中的附图。例如,用户可以单独选择专利文件中的说明书附图中的附图2作为目标附图,也可以不作选择,默认说明书附图中所包含所有附图,例如,附图1、附图2和附图3,作为目标附图。
在本实施例中,上述目标附图通常还会包含有一个或多个的图像标识。可以参阅图1所示。图中的“100”、“101”和“102”就可以理解为图1中的图像标识。
其中,上述图像标识具体可以理解为附图数据中一种用于指示附图整体或者附图中的某个内容对象的标识信息。
具体的,上述图像标识,可以是用于指示附图整体序号的标识信息。例如,位于附图外部下方的包含有字符“图”和数字“1”的组合:“图1”。也可以是位于附图中,用于指示附图中的内容对象的标识信息。例如,位于附图中,通过携带有箭头的直线(可以理解为一种特征线)与附图中的某个零部件相连的数字(例如“101”),或字母(例如“A”),还或者字母和数字的组合(例如“A101”)等等。当然,需要说明的是,上述所列举的图像标识只是为了更好地说明本说明书实施例。具体实施时,根据具体情况和用户需求,上述图像标识还可以是其他类型的标识信息。例如,还可以是位于附图中的方框内,没有特征线相连的数字和文字的组合,例如“模块21”等等。对此,本说明书不作限定。
在本实施例中,需要说明的是,客户端设备可以事先获取得到目标文件的附图数据和文本数据。其中,上述附图数据和文本数据具体可以是预先进行处理后得到的,包含有已经识别确定出图像标识的附图数据,和包含有已经识别确定出关键词组的文本数据。
其中,上述关键词组具体可以是指目标文件的文本中用于描述或者指示目标文件中的附图整体,和/或,附图中所包含的内容对象的字符组合。其中,文本数据中的每一个关键词组分别与附图数据中的一个图像标识匹配对应,并且相互匹配对应的关键词组和图像标识所指示的附图整体或者附图中的内容对象相同。
具体的,上述关键词组可以是文本数据中国的数字和名词的字符组合,例如“101部件”或者“部件101”。也可以是字母和名词的字符组合,例如“A接口”或者“接口A”。还可以是数字和表征图的字符的组合,例如“图1”或者“fig1”等等。此外,根据具体情况,还可以将包含有多个词组的文本语句也作为一种关键词组。具体的,可以将目标文件的附图说明中对某一个附图整体的描述的语句作为一种关键词组。例如可以将“图1是基于现有的附图页面的展示方法的一种实施例的示意图”作为与图像标识“图1”对应的关键词组。当然,需要说明的是,上述所列举的关键词组只是为了更好地说明本说明书实施例。具体实施时,根据具体情况和处理要求,还可以引入其他类型的字符组合作为关键词组。对此,本说明书不 作限定。
在本实施例中,具体实施时,客户端设备可以先接收用户针对目标文件中的目标附图的展示操作;响应所述展示操作,将用户选择中要展示的附图确定为目标附图,并根据所述目标文件的附图数据,以及文本数据,向用户生成并展示出包含有目标附图的页面。其中,在上述页面的目标附图中,还会显示有该目标附图的图像标识。具体可以参阅图4所示的相关内容。
S303:接收并响应针对所述目标附图中的图像标识的触发操作,确定出被触发的图像标识。
在本实施例中,上述针对图像标识的触发操作也可以理解为一种对图像标识选中操作。例如,当用户对所展示的附图页面中的某个图像标识比较感兴趣时,可以通过手指或鼠标等选中所感兴趣的图像标识,从而发起针对所述目标附图中的图像标识的触发操作。
具体的,上述针对目标附图中的图像标识的触发操作具体可以包括以下至少之一:按压操作、选中操作、点击操作等等。当然需要说明的是,上述所列举的触发操作只是为了更好地说明本说明书实施例。对于上述触发操作的具体形式和类型,本说明书不作限定。
在本实施例中,上述被触发的图像标识具体可以理解为用户相对关注、感兴趣,想进一步理解相关文本信息的,被用户所选中触发的图像标识。
在本实施例中,上述接收并响应针对所述目标附图中的图像标识的触发操作,确定出被触发的图像标识,具体实施时,可以包括以下内容:接收预设触发操作,其中,所述预设触发操作包括以下至少之一:按压操作、选中操作、点击操作;确定接收到所述预设触发操作的接收位置;将与所述接收位置的距离小于等于预设触发距离阈值的图像标识确定为所述被触发的图像标识。
在本实施例中,上述与所述接收位置的距离小于等于预设触发距离阈值的图像标识具体可以理解为被所述触发操作直接接触到的图像标识,或者与上述触发操作所对应的触发操作的接收位置较为靠近的图像标识等。需要说明的是,上述所列举的确定被触发的图像标识的方法只是一种示意性说明。具体实施时,根据具体情况,也可以采用其他合适的方式确定出上述被触发的图像标识。例如,也可以通过客户端设备向用户展示待触发的图像标识的输入框。用户可以通过该输入框输入想要触发的图像标识所对应的标识信息,从而发起针对该图像标识的触发操作等等。对此,本说明书不作限定。
具体的,例如,可以参阅图4所示,用户对图像标识“101”所指示的零部件感兴趣,可以通过鼠标点击当前附图页面中图1内的图像标识“101”发起针对该图像标识的触发操作。客户端设备可以接收并响应该触发操作,将图像标识“101”确定为被触发的图像标识。
S305:在所述页面中显示出被触发的图像标识的匹配文本,其中,所述被触发的图像标识的匹配文本至少包括用于描述所述被触发的图像标识的语句或词组。
在本实施例中,上述被触发的图像标识的匹配文本具体可以理解为一种用于对被触发的图像标识所指示的附图或附图中的内容对象进行描述或说明的文本内容。具体的,上述被触发的图像标识的匹配文本至少可以包括用于描述所述被触发的图像标识的语句或词组等等。当然需要说明的是,上述所列举的匹配文本只是一种示意性说明。具体实施时,根据具体情况和应用场景,上述匹配文本还可以包括有其他与被触发的图像标识相关的文本内容。对此,本说明书不作限定。
在本实施例中,具体实施时,客户端设备可以先根据目标文件的附图数据和文本数据,确定出与被触发的图像标识匹配的关键词组,作为关联的关键词组;再根据文本数据,确定出文本数据中包含有关联的 关键词组的文本数据,进而可以根据上述包含有关联的关键词组的文本数据,确定上述匹配文本。
在本实施例中,上述目标文件的附图数据中具体包含有图像标识,以及图像标识的相关信息,例如,用于表征图像标识在附图数据中的具体位置的位置信息,和与图像标识对应的身份序号等等。类似的,上述目标文件的文本数据中具体包含有关键词组,以及关键词组的相关信息,例如,用于表征关键词组在附图数据中的具体位置的位置信息,和与关键词组对应的身份序号等等。
其中,上述图像标识的身份序号与该图像标识一一对应,即每一个图像标识对应一个身份序号。同样,每一个关键词组的身份序号与该关键词组一一对应,即每一个关键词组对应一个身份序号。进一步,用于描述或指示同一附图,或者同一附图中的同一内容对象的关键词组和图像标识的身份序号相互之间存在匹配对应的关系。例如,用于描述或指示同一附图,或者同一附图中的同一内容对象的关键词组和图像标识的身份序号完全相同,或者近似,只相差一两个字符等。
在本实施例中,具体实施时,可以先根据目标文件的附图数据确定出被触发的图像标识的身份序号;进而可以根据被触发的图像标识的身份序号,对目标文件的文本数据中所包含的关键词组进行检索,检索到身份序号与被触发的图像标识的身份序号匹配的关键词组,作为上述关联的关键词组。
在本实施例中,在确定出关联的关键词组后,进一步,可以根据关联的关键词组的位置信息,从目标文件的文本数据中确定出包含有关联的关键词组的语句。进而,可以根据上述包含有关联的关键词组的语句,生成对应的匹配文本。
具体实施时,可以直接将上述关联的关键词组本身作为一种匹配文本。也可以将文本数据中关联的关键词组所在的整个语句作为匹配文本。还可以将较为复杂的包含有关联的关键词组的语句输入至预先训练好的语义识别模型中,得到更为精炼的、概括性的语义识别结果,进而可以将上述语义识别结果作为匹配文本等。其中,上述语义识别模型具体可以是预先获取包括有关键词组的文本语句作为样本数据;再对上述样本数据中的与关键词组相关的语义进行标注,得到标注后的样本数据;利用上述标注后的样本数据对神经网络进行模型训练,从而建立得到上述语义识别模型。当然,需要说明的是,上述所列举的确定匹配文本的方式只是为了更好地说明本说明书实施例。具体实施时,根据具体的应用场景和用户需求,还可以采用其他合适的方式来根据包含有关联的关键词组的语句生成对应的匹配文本。对此,本说明书不作限定。
在本实施例中,客户端设备在确定出被触发的图像标识的匹配文本后,可以在当前展示给用户的附图页面中显示出对应被触发的图像标识的匹配文本,以便用户可以方便地在当前页面中了解到给图像标识所指示的附图或附图中的内容对象的相关信息。
在本实施例中,上述在所述页面中显示出被触发的图像标识的匹配文本,具体实施时,可以在所述页面中的预设显示位置处,显示所述被触发的图像标识的匹配文本。
在本实施例中,为了方便用户阅读,上述预设显示位置具体可以为在附图页面中与被触发的图像标识邻近的区域。客户端设备在具体显示时,可以在附图页面中的与被触发的图像标识靠近的预设显示位置处显示出与被触发的图像标识对应的匹配文本。这样用户在阅读到附图中的这个图像标识时,能够非常自然、方便地在该图像标识附近的预设显示位置处看到与该图像标识对应的匹配文本,从而可以及时、高效地了解到与该图像标识所指示的附图或附图中的内容对象相关的文本信息,便于用户更好地理解附图内容。具体的,例如,可以参阅图4所示,客户端设备可以在被触发的图像标识“101”的左下方位置处显示出对应的匹配内容“部件101为连接件”等等。
在一个实施例中,上述在所述页面中显示出被触发的图像标识的匹配文本,具体实施时,还可以包括以下内容:在所述页面中,以悬浮框的方式显示出被触发的图像标识的匹配文本。
在本实施例中,上述显示有被触发的图像标识的匹配文本的悬浮框,可以根据客户的阅读习惯或者喜好被拖动到任意位置。进一步而言,当以悬浮框的方式显示出被触发的图像标识的匹配文本遮挡住了其他图像标识或者其他图像标识的匹配文本时,用户可以通过拖动悬浮框,将上述显示有被触发的图像标识的匹配文本的悬浮框移动至不会对其他图像标识或者其他图像标识的匹配文本造成遮挡的位置处进行显示。
在本实施例中,具体实施时,为了区别中附图中原本就包含的例如图像标识等信息,还可以以悬浮框的方式在你页面中的预设显示位置处显示出被触发的图像标识的匹配文本。具体的,可以参阅图5所示,可以以悬浮框的形式在图像标识“101”的左下方位置处显示出对应的匹配文本:“部件101为连接件”。
在一个实施例中,在显示被触发的图像标识的匹配文本的同时,还可以显示出用于生成该匹配文本的关联的关键词组在该目标文件的文本数据中的位置信息。具体的,例如,可以参阅图5所示,在图像标识“101”附近的悬浮框内除了会显示有出对应匹配文本:“部件101为连接件”外,还会显示出关联的关键词组的位置信息:“说明书第2页第5行”,并用“()”的形式标识出来,以区别匹配文本,提示用户该匹配文本在文本数据中的相应来源。这样用户如果想再对目标文件的文本中记载的与部件101相关的文本内容进行进一步的阅读、研究,可以以上述悬浮框中所显示的位置信息作为索引,准确地在目标文件的文本内容中的说明书第2页第5行中找到与该部件101相关的文本内容进行阅读,从而可以方便用户在阅读目标文件的附图时,对目标文件的文本进行针对性的查询和阅读。
在一个实施例中,上述位置信息还可以被设置为一种与目标文件的文本数据关联的触发链接。具体实施时,例如,用户在阅读附图页面所显示的附图时,如果想要进一步了解在目标文件的文本中关于上述图像标识“101”所指示的内容对象更加具体、详细的文本描述时,可以通过直接点击匹配文本中的“()”内的位置信息“说明书第2页第5行”发出针对该触发链接的触发指令。客户端设备进而可以响应上述触发指令,根据被触发的触发链接所对应的位置信息,根据目标文件的文本数据,检索并确定出位于文本中的上述位置“说明书第2页第5行”处的文本内容;再以弹出对话框或者跳转文本页面转等方式,向用户展示出该位置处的具体文本内容,使得用户可以更方便地阅读、了解到文本数据中针对该图像标识所指示的内容对象的相关信息。
当然,具体实施时,还可以将上述位置信息与对应的图像标识关联,即还可以将图像标识本身设置为一种触发链接。这样页面中也可以不显示出上述关键词组的位置信息,如果用户想要进一步了解在目标文件的文本数据中关于图像标识所指示的内容对象的更加具体、详细的文本信息时,还可以直接通过点击该图像标识发出触发指令。这时,客户端设备可以根据被触发的图像标识这个触发链接所对应的关键词组的位置信息,找到目标文件的文本数据中包含有关联的关键词组文本内容,再通过弹出对话框的等方式,向用户展示出上述的具体文本内容。
在一个实施例中,又考虑到有时客户端设备所要展示的目标附图比较复杂,例如目标附图本身的图形结构相对比较复杂,或者目标附图中所包含的图像标识的数量相对较多,分布较为密集,而向用户展示的页面范围区域往往有限。这时如果直接在上述页面中在目标附图中被触发的图像标识的附近区域展示出匹配文本,会导致所展示的页面相对较为混乱,甚至还会出现某些位置相近的图像标识的匹配文本互相遮盖的现象,影响用户的阅读和使用体验。
基于上述考虑,具体实施时,客户端设备在上述附图页面中显示被触发的图像标识的匹配文本时,可以先检测出被触发的图像标识在附图中的位置与附图的中心点的位置关系,进而可以根据被触发的图像标识在附图中的位置与附图的中心点的位置关系,再来确定用于显示该图像标识的匹配文本的显示区域,即预设显示位置,再在该预设显示位置中显示对应的匹配文本,以避免出现与其他图像标识的匹配文本或其他图像标识等图像信息相互遮挡,影响用户阅读的问题。
在本实施例中,具体的,客户端设备可以根据被触发的图像标识在附图中的位置与附图的中心点的位置关系,确定出靠近被触发图像标识(即与被触发的图像标识的距离小于第一距离阈值),且沿偏离附图的中心点的方向上的区域位置作为上述预设显示位置,来显示被触发的图像标识的匹配文本。
具体的,例如,可以参阅图6所示,在显示图像标识“100”的匹配文本时,可以先确定图像标识“100”在附图中的位置坐标,以及附图的中心点的位置坐标;再根据图像标识“100”在附图中的位置坐标,以及附图的中心点的位置坐标,确定出该图像标识与附图的中心点的位置关系为:位于附图的中心点的上方偏左的方向上。进而可以根据上述位置关系,确定出该图像标识的匹配文本的预设显示位置为靠近该图像标识,但沿偏离附图的中心点(即该图像标识上方偏左)方向上的位置作为预设显示位置,进而可以在该预设显示位置处中显示出该图像标识“100”的匹配文本。按照类似的方式,也可以分别确定出用于显示附图中其他的图像标识“102”和“101”的匹配文本的预设显示位置,并且在上述图像标识被触发后分别在对应的预设显示位置处显示出各自的匹配文本。这样可以使得同一附图中不同图像标识的匹配文本在显示的时候能够尽量分散,避免出现互相遮挡的情况,使得用户能够更加方便、清楚地阅读图像标识的匹配文本,也不会影响用户对附图本身的阅读。
在一个实施例中,为了进一步避免出现显示混乱,例如,由于匹配文本所包含的字符数量较多,即使在上述预设位置处显示匹配文本,也可能导致不同图像标识的匹配文本相互之间出现遮挡,影响用户阅读的问题。具体显示时,还可以先在上述附图页面中被触发的图像标识的预设显示位置处,先显示出对应的匹配文本中的少于预设长度的字符,即不完整的匹配文本。其中,上述预设长度具体是指匹配文本原本所包含的字符数。当用户想要读取完整的匹配文本时,再通过接收并响应用户针对上述所显示的少于预设长度的字符的触发操作,再显示出图像标识的完整的匹配文本。
具体的,例如,可以先在被触发的图像标识的预设显示位置处显示对应的匹配文本中的第一个字符;再接收到用户针对上述匹配文本中的第一个字符的触发操作后;再响应所述触发操作,显示出包含所述第一字符的该图像标识的完整的匹配文本。
在一个实施例中,上述附图页面中所显示的图像标识,以及图像标识的匹配文本具体可以是以一种与上述页面的主体结构相对独立的形式(例如可拖动分离的对话框等)在页面中进行显示的,即上述页面中的图像标识,以及图像标识的匹配文本的显示位置、显示角度具体还可以是不固定,不固定于页面主体结构或者页面中附图的,而是能根据用户的具体指令或操作进行相应拖动、调整的。
例如,可以参阅图7所示,客户端设备所展示的上述附图页面中会先默认在被触发的图像标识的预设显示位置处显示出被触发的图像标识的匹配文本。用户根据自己的需求或者阅读习惯,可以先选中当前在预设显示位置处显示的匹配文本,并将该匹配文本拖动到页面中的其他自己觉得合适的区域位置处进行显示。例如,可以将图像标识“101”的匹配文本从页面的左下角拖动至右下角的位置处进行显示。
此外,根据用户具体的需要,用户还可以按照类似的方式先选中该匹配文本,再通过具体的触发旋转 的操作,对该匹配文本进行包括360度的旋转调整,以便最终可以以合适用户的旋转角度旋转后再进行显示。
再例如,还可以参阅图8所示,用户根据自己的阅读习惯,想要将目标附图旋转90度后进行阅读浏览。这时,用户可以通过针对附图旋转操作将该附图旋转90度进行显示,但由于附图中被触发的图像标识的匹配文本是以一种相对独立的形式进行显示的,因此,虽然附图发生了旋转,但附图中的被触发图像标识的匹配文本可以保持不旋转,从而能避免匹配文本随附图发生旋转对用户的阅读造成不便。
在一个实施例中,附图页面中的目标附图、图像标识,以及图像标识的匹配文本可以是相对独立的。
具体实施时,客户端设备还可以接收用户的附图旋转指令;根据附图旋转指令将附图页面中的目标附图旋转指定角度,以方便用户更好地阅读目标附图中的内容。并且在旋转目标附图的过程中,客户端设备可以保持附图页面中的图像标识,和/或图像标识的匹配文本不随着目标附图一同旋转。
此外,客户端设备也可以在旋转目标附图的过程中,先将附图页面中的图像标识,和/或图像标识的匹配文本连同目标附图一同旋转,使得图像标识,和/或图像标识的匹配文本的显示方向与旋转后的目标附图一致;进一步,可以根据默认指令,或者用户发出的字符方向校正指令,单独再将附图页面中的图像标识,和/或图像标识的匹配文本进行旋转校正,使得校正后的图像标识,和/或图像标识的匹配文本可以以适合用户正常阅读的显示方向显示在附图页面中。从而可以满足用户多样化的阅读需求,提高用户的使用体验。
具体的,例如,可以参阅图28所示,附图页面上还可以展示有附图旋转控制图标,用户在阅览该附图页面时,如果觉得当前目标附图的显示方向不方便自己阅读,可以点击上述附图旋转控制图标,发出附图旋转指令。客户端设备可以接收并根据上述附图旋转指令,单独将目标附图沿顺时针方向旋转90度,同时使得附图页面中的图像标识的匹配文本保持原来的显示方向不变,方便用户旋转完目标附图后还可以继续正常地阅读附图页面中图像标识的匹配文本。在图28所展示的示例中,旋转后相对于旋转前,所展示的目标附图以及目标附图中原有的图标(图中较小的数字字符,例如,230)的显示方向都沿顺时针旋转了90度,但在目标附图上显示的图像标识的匹配文本(图中较大的字符组合,例如,230.legs)的显示方向却没有随着目标附图发生改变。
在本实施例中,用户在发出附图旋转指令时,还可以自定义设置旋转方向,以及指定角度。进而,客户端设备可以根据用户的附图旋转指令,将目标附图沿用户自定义设置的旋转方向旋转指定角度,从而可以在附图页面上以满足用户的显示方向来显示目标附图。
在一个实施例中,在上述页面中显示出被触发的图像标识的匹配文本时,还可以根据具体情况,从多种预设的字符尺寸中选择合适的字符尺寸来显示被触发的图像标识的匹配文本。例如,客户端设备在所展示的上述页面中显示被触发的图像标识的匹配文本前,可以先统计待显示的匹配文本中所包含的字符数量,如果匹配文本中所包含的字符数量较多,例如大于第一字符数阈值100个,为了能减少出现互相遮挡的发生几率,可以自动从多种预设的字符尺寸中选择以相对较小的字符尺寸作为默认的字符尺寸来显示匹配文本。这样也可以减少出现遮挡对用户阅读造成的影响。进一步,上述页面中所显示的被触发的图像标识的匹配文本中字符的大小尺寸可以允许用户自行调整。具体实施时,客户端设备可以按照上述方式先确定出合适的字符尺寸作为默认尺寸来显示被触发的图像标识的匹配文本。在显示出匹配文本后,用户可以根据自己的情况需要,例如页面上所显示的某个匹配文本中某个字符过小不够清楚,用户想要辨识出该字符,用户可以先选中该匹配文本,再通过对所选中的匹配文本进行拉伸或收缩来调整匹配文本中字符的尺 寸大小,从而能更好地满足不同用户的个性化要求,显示出被触发的图像标识的匹配文本。
在一个实施例中,又考虑到有时页面中目标附图所包含的图像标识的数量相对较多,相应的所显示的被触发的图像标识的匹配文本的数量也可能会相对较多,导致很容易造成用户阅读时对图像标识和匹配文本的对应关系出现混乱。例如,由于第一图像标识和第二个图像标识较为靠近,可能出现用户将第二个图像标识的匹配文本误认为是第一图像标识的匹配文本。为了避免出现上述情况,进一步提高用户体验,具体实施时,还可以根据图像标识和图像标识的匹配文本的对应关系,用同一种颜色来显示的相互对应的被触发的图像标识和匹配文本。具体的,例如,可以在所展示的页面中按照默认的颜色规则用红色显示被触发的第一个图像标识和该图像标识的匹配文本,用绿色显示第二个被触发的图像标识和该图像标识的匹配文本。以此类推,可以用不同的颜色作为区分,在页面中显示出不同的被触发的图像标识和对应的图像标识的匹配文本。当然,具体实施时,也可以接收并根据用户自己自定义的颜色,来显示某一被触发的图像标识,以及该图像标识的匹配文本。这样能够有效地减少用户在图像标识和匹配文本对应关系上出现混乱和错误,进一步提高了用体验。
在一个实施例中,为了能够更加清楚地显示出匹配文本,方便用户准确阅读匹配文本中所包含的文本信息,进一步提高用户的使用体验。具体实施时,客户端设备还可以根据所展示的页面中附图的布局特点,以及相邻图像标识之间的距离位置关系,从预设的多种排版方式中选择出合适的排版方式来显示该附图中的被触发的图像标识的匹配文本。其中,上述预设的排版方式具体可以包括竖向排版和横向排版等。当然,上述所列举的预设的排版方式只是为了更好地说明本说明书实施例,具体实施时,根据具体的场景要求,还可以引入其他类型的排版方式。对此,本说明书不作限定。
在一个实施例中,考虑到用户在读目标文本和目标附图时,是为了和其他文件内容进行比对使用等特殊需求,用户在阅读匹配文本时,还可以根据实际需求,对目标附图中匹配文本进行自定义编辑,编辑成用户期望的内容,便于用户记忆和理解;但是目标文件中其他部分的所述匹配文本可进行相应变更也可不变更。
具体的,例如,客户端设备在检测到某个图像标识与横向相邻的图像标识(例如左方相邻的图像标识和右方相邻的图像标识)的距离相对较小,而与竖向相邻的图像标识(例如上方相邻的图像标识和下方相邻的图像标识)的距离相对较大时,为了减少出现遮挡几率,使得用户能够更加清楚地阅读该图像标识的匹配文本,可以选择竖向排版方式作为默认的排版方式来显示该图像标识的匹配文本。相对的,如果检测到某个图像标识与横向相邻的图像标识的距离相对较大,而与竖向相邻的图像标识的距离相对较小时,则可以选择横向排版方式作为默认的排版方式来显示匹配文本。当然,也可以接收并根据用户自己选择的排版方式来显示匹配文本,从而能够满足不同用户的个性化要求,进一步提高用户体验。
在一个实施例中,还可以参阅图9所示,还可以在所展示的页面上预先设置控制图标。例如图9中位于页面右上角的“匹配文本显示”。用户可以通过对控制图标的操作设置,根据自己的需求,来控制图像标识的匹配文本是否在所展示的页面中显示。例如,在图9中,如果用户当前只想阅读目标附图,觉得所展示的页面所显示的匹配文本太多,干扰了自己对附图本身的阅读,可以将上述“匹配文本显示”向右拨动至“off”的位置,这时在上述页面中会隐藏掉显示出来的被触发的图像标识的匹配文本,这样用户可以清楚地阅读目标附图的附图信息。如果用户又想要阅读图像标识的匹配文本时,参阅图10所示,则可以将上述“匹配文本显示”向左拨动至“on”的位置,这时在上述页面中会重现显示出各个被触发的图像标识的 匹配文本。当然,上述所列举的控制图标只是为了更好地说明本说明书实施例。具体实施时,根据具体情况和使用情景还可以提供其他形式类型的开关控制按钮,以便用户可以根据自己的需要,灵活地控制页面中的匹配文本的显示与否。对此,本说明书实施例不作限定。
又例如在一个实施例中,为了避免被触发的图像标识的匹配文本的显示对用户阅读目标附图的干扰和影响,客户端设备还可以在页面中目标附图附近的空白区域处(或者其他不影响用户阅读目标附图的区域处)生成并展示出一个数据列表。
其中,该数据列表中具体可以包括有该目标附图中所包含的各个图像标识,以及与各个图像标识对应的匹配文本。可以参阅图11所示。这样用户可以在阅读页面中的目标附图的信息时,可以很方便地对照参照目标附图旁边的数据列表,从而能够准确、高效地读取了解到各个图像标识所对应的相关文本信息,不会由于页面中匹配文本显示影响对目标附图的阅读和理解。
在一个实施例中,具体实施时,上述数据列表还可以与目标附图中的图像标识相互关联。具体的,在没有图像标识被触发的情况下,数据列表中的图像标识和图像标识的匹配文本可以以一种灰度较浅或者较透明的字体形式显示。当用户通过点击等方式触发附图中某个图像标识时,客户端设备会在上述数据列表中将目标附图中被触发的图像标识,以及被触发的图像标识的匹配文本以区别于数据列表中其他未被触发的图像标识,以及未被触发的图像标识的匹配文本的预设的显示方式(例如灰度更深或者不透明的字体形式)在上述数据列表中显示出来,以提示用户目标附图中被触发的图像标识在数据列表中的位置。这样用户可以清楚、准确地从数据列表中识别、确定出被触发的图像标识,以及被触发的图像标识的匹配文本。
其中,上述预设的显示方式具体可以包括:以标亮的方式显示;以加粗字体的方式显示;以预设的字体颜色显示;以悬浮框的方式显示等等。具体实施时,根据具体的应用场景和用户的个性化需求也可以采用除上述所列举的显示方式,或者多种显示方式组合的方式来在数据列表中显示被触发的图像标识,以及该被触发的图像标识的匹配文本。
具体的,例如,用户点击触发了目标附图中的图像标识9,则数据列表中的被触发的图像标识9,以及被触发的图像标识9的匹配文本可以以标亮的方式显示出来。当然,上述所列举的显示方式只是一种示意性说明。具体实施时,也可以通过对被触发的图像标识,以及被触发的图像标识的匹配文本进行颜色加深,或字体加粗,或加下划线,或斜体,或字体的字号变大等其他多种区别于未被触发的图像标识,以及未被触发的图像标识的匹配文本的方式显示出来。对此,本说明书实施例不作限定。
当然,需要说明的是,上述所列举在页面中显示出被触发的图像标识的匹配文本的方式都是为了更好地说明本说明书实施例。具体实施时,也可以根据具体情况和应用场景,以及满足不同用户的多样化的需求,还可以采用其他合适的显示方式来显示上述被触发的图像标识的匹配文本。对此,本说明书不作限定。
在一个实施例中,所述在所述页面中显示出被触发的图像标识的匹配文本,具体实施时,可以包括:在所述页面中,以与页面的主体结构相独立的形式显示出被触发的图像标识的匹配文本;相应的,所述方法具体还可以包括:接收并根据用户的操作指令,在所述页面中将所显示的被触发的图像标识的匹配文本移动至指定位置。其中,上述指定位置可以根据用户的需求灵活设置。例如,上述指定位置可以是一个符合用户个人浏览阅读习惯的位置,也可以是一个不会对附图中的其他信息造成遮挡或干扰的位置等。
在一个实施例中,在所述页面中将所显示的被触发的图像标识的匹配文本移动至指定位置后,所述方法具体还可以包括:记录所述指定位置。这样如果用户后续再次触发该图像标识时,会优先在上述指定位 置处显示被触发的图像标识的匹配文本,从而可以进一步提高用户的使用体验。
在一个实施例中,所述方法具体实施时还可以包括:在所述页面中生成并展示数据列表,其中,所述数据列表包括目标附图中所包括的图像标识,以及图像标识的匹配文本;其中,所述数据列表与目标附图中的图像标识相互关联。
在一个实施例中,所述在所述页面中显示出被触发的图像标识的匹配文本,具体实施时,还可以包括:在所述数据列表中以预设的显示方式显示出被触发的图像标识,以及被触发的图像标识的匹配文本。
在一个实施例中,相对的,如果用户通过点击等触发方式触发数据列表中的某个图像标识和/或某个图像标识的匹配文本,在目标附图中也可以以预设的显示方式显示出被触发的图像标识。
在一个实施例中,所述预设的显示方式具体可以包括以下至少之一:以标亮的方式显示;以加粗字体的方式显示;以预设的字体颜色显示;以悬浮框的方式显示;以字体为斜体的方式显示等。
在一个实施例中,在根据所述附图数据和所述文本数据,在所述页面中显示出被触发的图像标识的匹配文本后,所述方法具体实施时,还可以包括:接收并响应用户的保存操作,保存显示了被触发的图像标识的匹配文本的目标附图页面。这样可以方便用户保存下载上述显示有匹配文本的目标附图,以便用户后续使用,进一步提高了用户的使用体验。
由上可见,本申请提供的附图页面的展示方法,通过客户端设备接收并响应用户针对附图页面中图像标识的触发操作,确定出被触发的图像标识;再根据所获取的目标文件的附图数据和文本数据,在所述附图页面中显示出被触发的图像标识的匹配文本,以对图像标识所指示的附图或者附图中的内容对象进行描述说明,使得用户不再需要自己来回地在目标文件的文本中翻找与目标附图中的所感兴趣的图像标识相关的文本语句,就能准确地获取到与该图像标识相关的其他信息,方便用户高效地阅读、理解附图页面中的附图内容,从而简化了用户操作,提高了用户的使用体验。
在一个实施例中,所述图像标识具体可以包括:用于指示附图序号的标识信息,和/或,用于指示附图中的内容对象的标识信息等等。
在一个实施例中,所述目标文件具体可以包括以下至少之一:专利文件、论文文件、合同文件、商标文件等等。当然,上述所列举的目标文件只是为了更好地说明本说明书实施例。具体实施时,根据具体的应用场景,还可以引入其他类型的文件作为上述目标文件。对此,本说明书不作限定。
在一个实施例中,在展示包含有目标文件中的目标附图的页面前,所述方法具体实施时,还可以包括以下内容:
S1:获取目标文件的附图数据和文本数据,其中,所述附图数据包含有图像标识,所述文本数据包含有关键词组,所述关键词组为文本数据中用于指示附图,和/或,附图中的内容对象的字符组合;
S2:接收针对目标文件中的目标附图的展示操作;
S3:响应所述展示操作,根据所述目标文件的附图数据,展示包含有目标文件中的目标附图的页面。
在本实施例中,需要说明的是,上述客户端设备所获取的目标文件的附图数据和文本数据具体可以是客户端设备事先对目标文件的附图数据和文本数据进行相应处理,识别确定出附图数据中图像标识,和文本数据中的关键词组后的得到数据;也可以是由服务器事先对目标文件的附图数据和文本数据进行相应处理,识别确定出附图数据中图像标识,和文本数据中的关键词组后再发送给客户设备的数据。
在一个实施例中,上述接收并响应针对所述目标附图中图像标识的触发操作,确定出被触发的图像标 识,具体实施时,可以包括以下内容:接收预设触发操作,其中,所述预设触发操作包括以下至少之一:按压操作、选中操作、点击操作;确定接收到所述预设触发操作的接收位置;将与所述接收位置的距离小于等于预设触发距离阈值的图像标识确定为所述被触发的图像标识。当然,上述所列举的确定出目标附图中被触发的图像标识的方式只是一种示意性说明。具体实施时,根据具体情况,也可以采用其他合适的方式来确定目标附图中被触发的图像标识。对此,本说明书不作限定。
在一个是实施例中,上述根据所述附图数据和所述文本数据,在所述页面中显示出被触发的图像标识的匹配文本,具体实施时,可以包括以下内容:根据所述目标文件的附图数据,确定出所述被触发的图像标识的身份序号;从所述目标文件的文本数据所包含的关键词组中,检索出身份序号与所述被触发的图像标识的身份序号匹配的关键词组,作为关联的关键词组;根据所述关联的关键词组,确定所述被触发的图像标识的匹配文本;在所述页面中的预设显示位置处,显示所述被触发的图像标识的匹配文本。当然,上述所列举的实施方式只是一种示意性说明,具体实施时,根据具体情况也可以采用其他合适的方式来根据所述附图数据和所述文本数据,在所述页面中显示出被触发的图像标识的匹配文本。对此,本说明书不作限定。
在一个实施例中,所述预设显示位置具体可以按照以下方式确定:确定被触发的图像标识的在目标附图中的位置,以及目标附图的中心点的位置;根据被触发的图像标识的在目标附图中的位置,以及目标附图的中心点的位置,确定所述预设显示位置,其中,所述预设显示位置包括沿远离所述目标附图的中心点方向,且与所述被触发的图像标识的距离小于等于预设的第一距离阈值的位置。当然,需要说明的是,上述所列举的确定预设显示位置的方式只是一种示意性说明。具体实施时,根据具体情况还可以采用其他合适的方式来确定预设显示位置。对此,本说明书不作限定。
在一个实施例中,在所述页面中的预设显示位置处,显示所述被触发的图像标识的匹配文本,具体实施时,还可以包括以下内容:在所述页面中预设显示位置处,显示所述被触发的图像标识的匹配文本中的少于预设长度的字符;接收针对所述被触发的图像标识的匹配文本中的少于预设长度的字符的触发操作;响应所述触发操作,显示所述被触发的图像标识的匹配文本。
在一个实施例中,上述显示所述被触发的图像标识的匹配文本,具体实施时,还可以包括:通过预设的显示方式显示被触发的图像标识的匹配文本,其中,所述预设的显示方式包括以下至少之一:以标亮的方式显示;以加粗字体的方式显示;以预设的字体颜色显示;以悬浮框的方式显示;以字体为斜体的方式显示;以字体的字号变大的方式显示等等。当然,需要说明的是,上述所列举的显示被触发的图像标识的匹配文本的方式只是为了更好地说明本说明书实施例。具体实施时,根据具体应用场景和客户需求还可以引入除上述所列举的显示方式以外其他的显示方式来显示被触发的图像标识的匹配文本。对此,本说明书不作限定。
在一个实施例中,具体实施前,客户端设备或者服务器可以预先对目标文件中附图数据和文本数据进行对应处理,以便分别识别确定出附图数据中的图像标识(以及包括图像标识的身份序号、位置信息等相关信息),和文本数据中的关键词组(以及包括关键词组的身份序号、位置信息等相关信息)。这样客户端设备可以按照上述方式根据预先处理好的目标文件的附图数据和文本数据,快速地响应用户的触发操作,确定出被触发的图像标识,并显示出被触发的图像标识的匹配文本。下面将以目标文件为专利文件为例,说明具体如何识别确定出目标文件的附图数据中的图像标识,和文本数据中的关键词组。对于其他类 型的目标文件,可以参照以下针对专利文件的实施方式,根据对应的目标文件具体特点执行。对此,本说明书不作赘述。
在一个实施例中,所述附图数据中所包含的图像标识具体可以按照以下方式确定:
S1:从所述附图数据中筛选出包含有预设页头的附图数据;
S2:对所述包含有预设页头的附图数据中的第一范围区域进行图像数据识别,将识别到的第一字符组作为所述图像标识,其中,所述第一字符组包括:数字和表征图的特征字符的组合;
S3:根据所识别到的第一字符组,在所述包含有预设页头的附图数据中确定出第二范围区域;
S4:对所述包含有预设页头的附图数据中的第二范围区域进行图像数据识别,将识别到的第二字符组作为所述图像标识,其中,所述第二字符组包括:与所述包含有预设页头的附图数据中的特征线的距离小于预设的第二距离阈值的数字和/或字母的组合,所述特征线包括:携带有箭头的直线,和/或,携带有箭头的曲线。
在本实施例中,上述预设页头具体可以理解为包含有“说明书附图”或者类似语义字符的页头。通常,在专利文件中,包含有上述预设页头的附图数据往往是包含有构成专利文件中各种具体附图的附图数据,而附图中往往会存在图像标识。因此,具体实施时,可以先根据上述预设页头,从目标文件的附图数据中筛选出包含有预设页头的附图数据,以便后续对该部分附图数据进行具体处理,识别确定出所包含的图像标识。
在一个实施例中,在从目标文件附图数据中筛选出包含有预设页头的附图数据后,进一步,又考虑到专利文件中附图部分的格式特点:即通常会在每张附图的外部下侧(即第一范围区域),且与该附图邻近的范围区域中设置有用于指示该附图整体的序号(或编号)的由字符组成的标识信息(即第一种图像标识,例如caption),并且上述标识信息往往是由数字与表征图的字符(例如“图”或者“fig”等)组合而成(对应于上述第一字符组)。例如,图1中位于附图的外部下侧位置处的标识信息“图1”。而在附图的内部,且与附图中内容对象相近或者通过特征线(例如携带有箭头的直线或曲线等)与内容对象相连的范围区域(即第二范围区域中)中还会设置有用于指示该内容对象的由字符组成的标识信息(即第二种图像标识),并且上述标识信息往往是由单独的数字或者单独的字母、汉字,或者数字、字母和汉字中的一种或两种字符组合而成(对应于上述第二字符组,例如,part label)。例如,图1内部中的标识信息“101”。因此,可以针对上述两种不同种类的图像标识所具备的不同的位置和字符组合特点,分别进行对应的图像数据识别处理,进而可以从目标文件的包含有预设页头的附图数据中分别准确地识别确定出附图中所包含的第一字符组和第二字符组,作为上述图像标识。
因此,具体实施时,可以根据专利文件中附图的不同种类的图像标识所具备的上述特点,有针对性地分别对附图数据中不同种类的图像标识进行对应的图像数据识别处理。
其中,上述图像数据识别处理具体可以是一种基于OCR(光学字符识别)的图像数据识别处理。当然,也可以是基于其他原理的,例如基于其他图像特征的图像数据识别处理。具体实施时,根据具体情况可以采用其他合适的方式来进行图像数据识别处理。下面主要以通过光学字符识别进行图像数据识别处理为例,具体说明如何识别确定出附图数据中的图像标识。
在本实施例中,上述第一范围区域具体可以理解为是附图外部下侧的临近区域。具体实施时,可以先从包含有预设页头的附图数据中确定出符合第一范围区域的附图数据;再通过对该区域进行光学字符识 别。进而可以有针对性、高效地识别出用于包含有数字和表征图的特征字符的组合的用于指示该附图的附图序号字符组合,即第一字符组,作为附图数据中的一种图像标识(例如第一种图像标识)。
在本实施例中,上述第二范围区域具体可以理解为是附图内部中,通常还与附图中的特征线相连或相邻(即距离小于预设的第二距离阈值,例如5毫米)的区域。其中,上述特征线往往还会连接有附图中某个具体的内容对象(即所指示的内容对象)。例如,系统结构中的某个零件或者装置中某个功能模块等。上述特征线的具体形式可以是携带有箭头的直线或曲线,也可以是没有携带箭头的连接线等等。具体的实施时,在确定出第一范围区域,并在第一范围区域中识别出第一字符组后,可以进一步确定附图数据中上述第一范围区域的上方区域是包含有第一字符组所指示的附图的具体内容的数据区域,即存在上述第二范围区域。进而可以再对上述第二范围区域进行光学字符识别,以便有针对性、高效地识别出用于指示附图内部中的内容对象的数字和/或字母的组合,即第二字符组,作为附图数据中的另一种图像标识(例如第二种图像标识)。
在本实施例中,又考虑到由于目标文件中的附图尺寸不同,导致附图可能是横向排版的,也可能是纵向排版的,而附图不同的排版方式又会使得上述第一字符组和第二字符在附图中的布设角度也会存在差异。例如当附图的尺寸较大被横向排版时,该附图中的第一字符组和第二字符组相对于普通的附图纵向排版情况下的布设角度会相差90度。基于上述情况,在按照上述方式检测识别第一字符组和第二字符组以确定图像标识时,可以先检测确定出文本字符相对于附图的布设角度,再根据文本字符的布设角度,按照对应的角度对字符进行识别,以准确地识别出字符,进而提高识别确定图像标识的准确度。
在本实施例中,为了进一步提高识别确定图像标识的准确度,还可以结合从文本数据中识别确定出的关键词组对所识别出的图像标识(包括第一字符组和第二字符组)进行对照验证,以减少图像标识的识别误差。
在一个实施例中,在从目标文件附图数据中筛选出包含有预设页头的附图数据后,为了避免出现遗漏,也可以直接对所述包含有预设页头的附图数据进行全范围检索。具体检索时也可以结合与预设的图像标识(例如模板图像标识)相关的信息,检索出符合要求的第一字符组,和/或,第二字符组确定出附图数据中的图像标识。
当然,需要说明的是,上述所列举的识别确定附图数据中的图像标识的方式只是一种示意性说明。具体实施时,根据目标文件的附图数据中图像标识的具体特点,还可以采用其他合适的方式进行图像标识的识别确定处理。例如,还可以利用基于神经网络的预先训练好的图像标识识别模型对上述附图数据进行图像数据识别处理,以确定出图像数据中所包含的图像标识。对此,本说明书不作限定。
在一个实施例中,为了提高所确定出的图像标识的准确度,具体实施时,可以通过利用多种不同的光学字符识别方式对附图数据进行图像数据识别处理,以提高所确定出的图像标识的精确度。
在一个实施例中,在确定出附图数据中的图像标识后,进一步,客户端设备或服务器还可以根据该图像标识,生成对应的图像标识的身份序号,并将所述图像标识的身份序号保存于所述附图数据中。
在本实施例中,上述身份序号也可以理解为一种ID信息,上述身份序号与图像标识是一一对应的。即,通过一个身份序号可以确定出附图中的一个图像标识。同样,通过一个图像标识可以识别确定出一个身份序号。
在本实施例中,上述身份序号具体可以是一种按照预设规则生成的字符串。具体的,可以根据图像标 识中所包含的数字、字母或汉字等字符,生成与所述图像标识对应的身份序号。例如,图像标识“101”的身份序号可以表示为:0101。这样后续具体处理时,通过该身份序号就可以准确地锁定出所对应的图像标识“101”。
当然,具体实施时,还可以根据图像标识中所包含的数字、字母或汉字等字符,同时结合图像标识其他的特征信息(例如图像标识中所包含的数字、字母或汉字等字符的组合方式特征等)更加准确地生成对应的身份序号。
具体的,还可以根据图像标识中所包含的数字和/或字母,和图像标识的类型特征(例如属于第一种图像标识还是第二种图像标识)来生成对应的身份序号。例如,图像标识“101”的身份序号可以表示为:label2_101。其中,该身份序号的前半部分“label2”可以用于指示该身份序号所对应的图像标识为第二种图像标识(对应第二字符组),后半部分“00101”可以用于指示所对应的图像标识所包含的数字为“101”。又例如,图像标识“图1”的身份序号可以表示为:label1_f1。其中,“label1”可以用于指示该身份序号所对应的图像标识为第一种图像标识(对应第一字符组),“001”可以用于指示该身份序号所对应的图像标识包含的数字为“1”,“f”则可以用于指示该图像标识所包含的表征图“fig”字符。这样可以确定出能更加精确地表征出所指示的图像标识,并与其它相近的图像标识区分开的身份序号。当然,需要说明的是,上述所列举的生成与所述图像标识对应的身份序号的方式只是一种示意性说明。具体实施时,根据具体情况,也可以根据其他合适的规则来生成与图像标识对应的身份序号。对此,本说明书不作限定。
在一个实施例中,在确定出附图数据中的图像标识和所对应的身份序号后,进一步,服务器或客户端设备还可以确定出图像标识的位置信息。其中,上述图像标识的位置信息具体可以理解为一种用于指示图像标识在目标文件中的附图中的位置的地址信息,可以记为pos。上述位置信息具体可以是一种位置坐标,通过该位置信息,可以准确地确定出该图像标识在附图中的具体位置。
在本实施例中,在确定出图像标识的身份序号和位置信息后,再将上述识别确定出的图像标识的身份序号和位置信息保存在附图数据中。例如,在附图数据中建立上述识别确定出的图像标识与其身份序号和位置信息的对应关系。从而完成对目标文件的附图数据处理,得到处理好的包含有图像标识的附图数据。这样后续具体处理时,可以基于上述附图数据,根据图像标识的身份序号确定出该图像标识在附图数据中的具体位置,同样根据客户端设备接收到触发的接收位置与图像标识的位置信息确定出被触发的图像标识。
在一个实施例中,确定所述文本数据中的关键词组,具体实施时,可以包括以下内容:
S1:检索所述文本数据中的小节标题;
S2:根据所述小节标题,将所述文本数据划分为多个文本区域;
S3:从所述多个文本区域中筛选出符合预设要求的文本区域,作为第三范围区域,其中,所述第三范围区域至少包括小节标题为附图说明或者BRIFE DESCRIPTION OF DRAWINGS的文本区域;
S4:根据所述文本数据中的第三范围区域,确定所述文本数据中的关键词组。
在一个实施例中,考虑到对于专利文件的文本内容在撰写时往往也遵守相应的格式要求。例如,通常专利文件中的文本内容在撰写时会被要求分成多个不同的文本区块(或者文本区域)进行撰写。其中,每个文本区域的起始位置处会设置针对该文本区域的小节标题,以对该文本区域中文本内容进行提示。其中,上述小节标题也是满足预设格式要求。例如,小节标题中的字符是加粗的,或者小节标题的字符的字体大 小是大于普通文本中的字符的。
具体的,例如,在专利文件的说明书部分中开始部分会出现第一个文本区域,并且在该区域的起始位置处会通过使用加粗字体设置有小节标题“技术领域”。在第一文本区域之后,会连接有第二个文本区域,并且在第二个文本区域的起始位置处会通过使用4号字体(即略大于普通文本字符的字体)设置有小节标题“背景技术”。接着会连接有第三个文本区域,并且在第三个文本区域的起始位置处通过使用4号字体设置有小节标题“发明内容”等等。因此,具体实施时,可以根据各个文本区域的小节标题将目标文件的文本数据划分为多个不同的文本区域。
具体实施时,还可以通过预设的文件解析器(例如可以预先设定好的description parser)对目标文件的文本数据进行处理,以确定目标文件的文本数据中各个文本区域所对应的文本数据的范围区域。具体的,可以通过上述预设的文件解析器先将目标文件的文本数据切分成多个内容部分(即多个文本区域),其中,所述多个内容部分分别携带有上述满足预设格式要求的小节标题;识别并根据各个内容部分所携带的小节标题确定该内容部分在目标文件中所对应的名称或类型;再根据上述名称或类型,确定出各个内容部分的名称(即文本区域的名称)。
在一个实施例中,又考虑到对于一些非专利文件,其他类型的文件,可能根据撰写格式要求,文本中本来就不含有上述小节标题,在检索所述文本数据,确定所述文本数据不包含有小节标题的情况下,所述方法具体实施时,还可以包括以下内容:利用预设的区域划分模型对所述文本数据进行处理,以从所述文本数据中确定出第三范围区域,其中,所述预设的区域划分模型为预先利用样本文本数据对神经网络进行模型训练得到的。需要说明的是,上述用于训练预设的区域划分模型的样本数据具体可以是人工划分出各个文本区域,并给各个文本区域按照统一规则进行命名的文件的文本数据。
在本实施例中,上述第三范围区域具体可以理解为文本数据中具有较大概率可能包含有与附图中的图像标识相关的关键词组的范围区域。
在一个实施例中,考虑到专利文件中文本内容的撰写特点:在文本内容中小节标题为“附图说明”(针对中文的专利文件)或者“BRIFE DESCRIPTION OF DRAWINGS”(针对英文的专利文件)文本区域会比较集中、全面地出现针对附图,以及附图中的图像标识的文本描述。因此,为了使得识别确定关键词组的过程更有针对性,避免对文本数据全部进行文本数据处理,减少数据处理量,提高处理效率。在根据所述小节标题,将所述文本数据划分为多个文本区域后,可以先从所述多个文本区域中筛选出符合预设要求的文本区域,作为第三范围区域,例如可以将包括小节标题为附图说明或者BRIFE DESCRIPTION OF DRAWINGS等可能存在关键词组的较为集中或概率相对较高的文本区域筛选出来作为第三范围区域,再只对文本数据中的第三范围区域进行针对性的文本数据处理,以高效地识别确定出文本数据中的关键词组。需要说明的是,上述所列举的第三范围区域具体可以是包括小节标题为附图说明或者BRIFE DESCRIPTION OF DRAWINGS的文本区域。当然根据具体情况也可以引入其他符合要求的文本区域作为上述第三范围区域。对此,本说明书不作限定。
在本实施例中,考虑到专利文件中文本的撰写特点:在包含有附图说明所对应的文本区域中往往会对专利文件所包含的各个附图以及附图中出现的图像标识分别进行较为全面的描述说明,即会出现与图像标识相关联的关键词组。例如,在该文本区域中往往会出现有如下的文本语句“图1是本说明书提供的XX系统的结构示意图”或者“图2是本说明书提供的XX方法的流程示意图”等等。其中,上述文本语句中出现 的词组“图1”、“图2”分别是文本数据中与图像标识“图1”、“图2”对应的关键词组。还可以将包含有多个词组的语句也作为一种关键词组,例如,还可以将“图1是本说明书提供的XX系统的结构示意图”、“图2是本说明书提供的XX方法的流程示意图”作为分别与图像标识“图1”、“图2”对应的关键词组。因此,在该类文本区域中较大概率会出现与第一种图像标识对应的关键词组。此外,在这类文本区域中还会出现对应第二种图像标识的关键词组。例如,附图说明所对应的文本区域中还会出现如下的文本语句“附图中的101为连接件”或者“101连接件”等等。其中,上述语句中出现的词组“101为连接件”、“101连接件”分别是文本数据中与图像标识“101”对应的关键词组。
需要补充的是,在小节标题为具体实施方式和背景技术的文本区域中,在论述具体实施方式和论述相关背景情况的过程中,有时也会提及出现附图中的图像标识。例如,在背景技术所对应的文本区域中可能会出现如下的文本语句“参阅图1所示,现有方法是xxxx”等。在具体实施方式所对应的文本区域可能会出现“通过附图2中连接件101,达到xxx效果”等。
正是考虑到上述情况,具体实施时,为了扩大确定关键词组的文本区域范围,避免遗漏,还除了将小节标题为附图说明的文本区域列入第三范围区域外,还可以将目标文件的文本数据中小节标题为具体实施方式、背景技术等具有较大概率出现与图像标识对应的关键词组的其他文本区域也列为第三范围区域,进而可以有针对性地再对上述文本数据中的第三范围区域进行具体的文本数据处理,从而能够更全面、准确地从文本数据中识别确定出所包含的关键词组。
具体实施时,可以利用针对专利文件设计的预设的文件解析器对专利文件的文本数据进行文本数据处理时,可以先根据小节标题或空行等格式特征将专利文件的文本数据分成多个内容部分。再根据不同内容部分中所携带的小节标题,确定各个内容部分的名称或类型。例如,某一内容部分所携带的小节标题为“背景技术(或者BACKGROUND ART)”,则可以确定该内容部分对应的内容名称为背景技术,即确定出专利文件的文本数据中与背景技术对应的范围区域。类似的,对另一内容部分,由于所携带的小节标题为“附图说明(或者BRIEF DESCRIPTION OF DRAWINGS)”则可以确定该内容部分的对应的内容名称为附图说明。再根据各个内容部分所对应的内容名称,选出符合要求的内容部分,例如附图说明等部分作为上述第三范围区域。
在一个实施例中,又考虑到针对专利文件这类目标文件,往往文本中的附图说明中会具有较大概率存在对图像标识所指示的附图或附图中的内容对象的解释或说明,且附图说明中用于解释或说明的文本数据相对较为集中、精炼,因此,具体实施时,也可以只将上述附图说明等区域作为第三范围区域进行具体的文本数据识别,以识别确定出文本数据中所包含的关键词组,从而减小数据处理量,提高数据处理效率。
在本实施例中,上述文本数据识别处理具体可以是一种基于NLP(自然语言识别处理)的文本数据处理。当然,也可以是基于其他原理的,例如基于其他文本特征的文本数据识别处理。具体实施时,根据具体情况可以采用其他合适的方式来进行文本数据识别处理。下面主要以通过自然语言识别处理进行第三范围区域中的文本数据进行识别处理为例,具体说明如何识别确定出文本数据中的关键词组。
在本实施例中,上述关键词组具体可以理解为在文本数据中出现的与附图数据中的图像标识存在对应关系的字符组合。即关键词组所描述的附图或附图中的内容对象,和图像标识所指示的附图或者附图中的内容对象是相同的。具体的,上述关键词组具体可以是数字和名词的组合,例如“101部件”或者“部件101”。也可以是字母和名词的组合,例如“A接口”或者“接口A”。还可以是数字和表征图的字符的组 合,例如“图1”或者“fig1”等等。当然,需要说明的是,上述所列举的关键词组只是为了更好地说明本说明书实施例。具体实施时,根据具体情况和处理要求,还可以引入其他类型的字符组合作为关键词组。对此,本说明书不作限定。
在一个实施例中,在具体对所述文本数据中的第三范围区域进行自然语言识别处理时,可以先以数字或者字母作为关键字符,从文本数据中搜索出上述关键字符。再确定与上述关键字符相邻的字符组合(即第三字符组),包括关键字符向前连接的字符组合(例如位于关键字符前面的相连的字符组合),以及关键字符向后连接的字符组合(例如位于关键字符后面的相连的字符组合)。其中,所述第三字符组具体包括以下至少之一:数字和名词的组合、字母和名词的组合、数字和表征图的字符的组合等。继而可以先判断上述相邻的字符组合是否是名词。如果都不是名词可以判定该关键字符并不能构成关键词组,结束对该关键字符的文本处理。如果相邻的字符组合中的一个字符组合为名词,则可以认为该关键字符与该确定为名词的相邻的字符组合可能构成一个关键词组,该字符组合为符合要求的第三字符组。进而可以将该字符组合确定为一个关键词组。
如果上述相邻的字符组合中的两个字符组合都为名词,或者为了进一步提高所确定的关键词组的准确度,降低误差,还可以对确定为名词的相邻的字符组合进行语义识别,根据语义识别结果与由常用的关键词组构成的关键词组库中的词组进行语义比对。如果语义识别结果与上述关键词组库中的某个词组语义的差异程度小于预设的阈值程度,即语义识别结果与上述关键词组库中的某个词组相同或接近,则可以确定该相邻的字符组合与数字或字母构成了一个关键词组。当然,还可以获取通过对附图数据进行预设的图像处理得到的图像标识,参照图像标识所包含的字符,判定相邻的字符组合是否能够和数字或字母构成一个关键词组。当然,上述所列举的确定关键词组的方式只是一种示意性说明。具体实施时,根据具体的应用场景和精度要求,还可以采用其他合适的方式从文本数据中确定出关键词组。对此,本说明书不作限定。
在一个实施例中,在识别到多个表征语义近似(或者所包含的字符相同,但组合方式存在差异)的第三字符组时,可以根据各个第三字符组在文本数据的中的出现频次,从多个第三字符组中筛选出出现频次最高的一个第三字符作为所述关键词组。例如,服务器在处理时,发现以下几个语义近似的词组:“附图2”、“图2”、“第2个图”,统计上述三个第三字符组在文本数据中出现的频次分别为:1次、9次、2次,这时可以将上述词组中的出现频次最高的“图2”确定为一个关键词组。这样可以有效地降低噪声干扰,提高所确定的关键词组的准确性。
在一个实施例中,当识别到的第三字符组的结构较为复杂,例如,可能包含有多个数字和分隔符号(例如“图1、2、3”)或者是表征范围的数字和分隔符号的组合(例如“图1-3”)。这时,服务器可以先识别并抽取出第三字符组中的分隔符号,并将所抽取的分隔符号与预设的符号进行比较,确定分隔符号的类型,再根据分隔符号的类型使用与所述分隔符号的类型匹配的拆分规则,将多个数字拆分为多个数字字符,再进行重新拼接,得到多个关键词组。
具体实施时,例如,如果所抽取的分隔符号为第一类预设分隔符号,即与逗号(“,”)、顿号(“、”)等符号相同的分隔符号,则可以依次抽取第三字符组中所包含的多个数字中的各个数字与该第三字符组中除数字和分隔符号以外的其他字符,进行组合拼接得到多个不同的关键词组。例如,根据原来的关键词组“图1、2、3”,进行拆分组合拼接可以得到:“图1”、“图2”和“图3”这3个关键词组。如果所抽取的分隔符号为第二类预设分隔符号,即与破折号(“-”)、曲线(“~”)等符号相同的分隔符号,则可 以根据该分隔符号两端的数字,生成一个数字序列,再依次从该数字序列中抽取一个数字与该第三字符组中除数字和分隔符号以外的其他字符,进行组合拼接得到多个不同的关键词组。例如,根据原来的第三字符组“图1-3”,可以先抽取分隔符“-”两端的数字“1”和“3”,并根据上述两个数字分别作为序列的起始数字和终止数字,生成对应的一个数字序列:“1、2、3”。再将依次从上述数字序列中分别抽取一个数字与“图”进行组合拼接,得到:“图1”、“图2”和“图3”3个关键词组。当然,需要说明的是,上述所采用的关键词组的确定方法只是一种示意性说明。具体实施时,根据具体情况也可以采用预先建立的编号生成器自动对上述情况的包含有分隔符号的第三字符组进行处理,得到多个经拆分组合拼接后的关键词组。对此,本说明书不作限定。
在一个实施例中,在确定出文本数据中的关键词组后,进一步,服务器或客户端设备可以根据该关键词组生成用于指示该关键词组的身份序号。
在本实施例中,类似于图像标识的身份序号,上述关键词组的身份序号也可以理解为一种ID信息,上述身份序号与关键词组是一一对应的。通过一个身份序号可以确定出文本数据中的一个关键词组。同样,通过一个关键词组可以确定一个对应的身份序号。
在本实施例中,上述身份序号具体可以是一种按照与确定图像标识的身份序号所依据的规则相同的规则所生成的字符串。具体的,生成所述关键词组的身份序号,可以包括:根据所述关键词组所包含的数字和/或字母,生成与所述关键词组对应的身份序号。例如,关键词组“部件101”的身份序号可以表示为:0101。这样通过该身份序号就可以准确地指示出所对应的关键词组“101”。
当然,需要说明的是,上述所列举的生成与所述关键词组对应的身份序号的方式只是一种示意性说明。具体实施时,根据具体情况,也可以根据其他合适的规则来生成与关键词组对应的身份序号。对此,本说明书不作限定。
在一个实施例中,在确定出文本数据中的关键词组和所对应的身份序号后,还可以进一步确定出关键词组的位置信息。其中,关键词组的位置信息具体可以理解为一种用于指示关键词组在目标文件中的文本中的位置的地址信息,可以记为pos。上述位置信息具体可以是一种位置坐标,通过该位置信息,可以准确地确定出关键词组在文本中的具体位置。例如,说明书第2页第3行第4列等。当然,上述所列举的位置信息只是一种示意性说明。本说明书对上述位置信息的具体形式不作限定。
在本实施例中,在确定出关键词组的身份序号和位置信息后,客户端设备或服务器可以再将上述识别确定出的关键词组的身份序号和位置信息保存在文本数据中。例如,可以在文本数据中建立上述识别确定出的关键词组与其身份序号和位置信息的对应关系。从而完成对目标文件的文本数据的处理,得到处理好的包含有关键词组的文本数据。这样后续具体处理时,可以基于上述文本数据,根据关键词组的身份序号确定出该关键词组在文本数据中的具体位置。
在一个实施例中,在确定出所述附图数据中所包含的图像标识后,所述方法具体实施时,还可以包括以下内容:根据所述图像标识,生成对应的图像标识的身份序号,并将所述图像标识的身份序号保存于所述附图数据中。
在一个实施例中,在检索所述文本数据,确定所述文本数据不包含有小节标题的情况下,所述方法具体实施时,还可以包括以下内容:利用预设的区域划分模型对所述文本数据进行处理,以从所述文本数据中确定出第三范围区域,其中,所述预设的区域划分模型为预先利用样本文本数据对神经网络进行模型训 练得到的。
在一个实施例中,根据所述文本数据中的第三范围区域,确定所述文本数据中的关键词组,具体实施时,可以包括以下内容:对所述文本数据中的第三范围区域进行文本数据识别,识别出第三字符组,其中,所述第三字符组包括以下至少之一:数字和名词的组合、字母和名词的组合、数字和表征图的字符的组合;将所识别出的第三字符组作为所述关键词组。
在一个实施例中,在识别出多个语义近似的第三字符组的情况下,所述方法具体实施时,还可以包括以下内容:统计所述多个语义近似的第三字符组中的各个第三字符组在文本数据中的出现频次;将出现频次最高的第三字符组确定为所述关键词组。
在一个实施例中,又考虑到为了提高处理效率,后续可以尽量选择包含有关键词组但相对较为精炼的文本数据来生成匹配文本。因此,在识别出多个第三字符组的情况下,所述方法具体实施时,还可以包括以下内容:统计所述多个第三字符组中的各个第三字符组所在文本段落所包含的字符数;将所在的文本段落所包含的字符数最少的第三字符组作为所述关键词组。
在一个实施例中,所述第三字符组具体还可以包括:多个数字和分隔符号(例如“、”、“,”或“-”)的组合,在对所述文本数据中的第三范围区域进行数据处理,识别出包含有多个数字和分隔符号的组合的第三字符组后,所述方法具体实施时还可以包括以下内容:识别所述分隔符号的类型;根据与所述分隔符号的类型匹配的拆分规则,将所述多个数字拆分为多个待拼接的数字字符;将所述多个待拼接的数字字符与所述第三字符组中的名词、字母或表征图的字符分别进行拼接,得到多个关键词组。
在一个实施例中,在确定出所述文本数据中的关键词组后,所述方法具体实施时,还可以包括以下内容:根据所述关键词组,生成对应的关键词组的身份序号,并将所述关键词组的身份序号保存于所述文本数据中。
由上可见,本申请提供的附图页面的展示方法由于通过客户端设备接收并响应用户针对附图页面中图像标识的触发操作,确定出被触发的图像标识;再根据所获取的目标文件的附图数据和文本数据,在所述附图页面中显示出被触发的图像标识的匹配文本,以对图像标识所指示的附图或者附图中的内容对象进行描述说明,使得用户不再需要自己来回地在目标文件的文本中翻找与目标附图中的所感兴趣的图像标识相关的文本语句,就能准确地获取到与该图像标识相关的其他信息,方便用户高效地阅读、理解附图页面中的附图内容,从而简化了用户操作,提高了用户的使用体验;还通过事先对目标文件的附图数据和文本数据分别进行对应的识别处理,以确定出附图数据中的图像标识、图像标识的身份序号,以及文本数据中的关键词组、关键词组的身份序号,并分别保存在对应的附图数据和文本数据中,从而客户端设备后续可以根据被触发的图像标识的身份序号,快速、精确地从文本数据中确定出匹配的关键词组,进而生成对应的匹配文本,显示给用户,提高了确定、显示匹配文本的精度和效率,进一步提高了用户体验。
基于上述附图页面的展示方法,本申请还提供了一种对应的附图页面的展示装置,可以参阅图12所示,具体可以包括以下的结构模块:
展示模块,具体可以用于展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述目标附图中包含有图像标识;
确定模块,具体可以用于接收并响应针对所述目标附图中的图像标识的触发操作,确定出被触发的图像标识;
显示模块,具体可以用于根据所述附图数据和所述文本数据,在所述页面中显示出被触发的图像标识的匹配文本,其中,所述被触发的图像标识的匹配文本至少包括用于描述所述被触发的图像标识的语句或词组。
在一个实施例中,所述图像标识具体可以包括:用于指示附图序号的标识信息,和/或,用于指示附图中的内容对象的标识信息等等。
在一个实施例中,所述目标文件具体包括以下至少之一:专利文件、论文文件、合同文件、商标文件等等。
在一个实施例中,所述装置还包括获取模块,具体可以用于在展示包含有目标文件中的目标附图的页面前,获取目标文件的附图数据和文本数据,其中,所述附图数据包含有图像标识,所述文本数据包含有关键词组,所述关键词组为文本数据中用于指示附图,和/或,附图中的内容对象的字符组合;以及接收响应模块,具体可以用于接收针对目标文件中的目标附图的展示操作;响应所述展示操作,根据所述目标文件的附图数据,展示包含有目标文件中的目标附图的页面。
在一个实施例中,为了能够接收并响应针对所述目标附图中图像标识的触发操作,确定出被触发的图像标识,所述接收响应模块具体可以包括以下结构单元:
接收单元,具体可以用于接收预设触发操作,其中,所述预设触发操作包括以下至少之一:按压操作、选中操作、点击操作;
第一确定单元,具体可以用于确定接收到所述预设触发操作的接收位置;
第二确定单元,具体可以用于将与所述接收位置的距离小于等于预设触发距离阈值的图像标识确定为所述被触发的图像标识。
在一个实施例中,为了能够根据所述附图数据和所述文本数据,在所述页面中显示出被触发的图像标识的匹配文本,所述显示模块具体可以包括以下结构单元:
第三确定单元,具体可以用于根据所述目标文件的附图数据,确定出所述被触发的图像标识的身份序号;
第一检索单元,具体可以用于从所述目标文件的文本数据所包含的关键词组中,检索出身份序号与所述被触发的图像标识的身份序号匹配的关键词组,作为关联的关键词组;
第四确定单元,具体可以用于根据所述关联的关键词组,确定所述被触发的图像标识的匹配文本;
第一显示单元,具体可以用于在所述页面中的预设显示位置处,显示所述被触发的图像标识的匹配文本。
在一个实施例中,所述第一显示单元具体可以按照以下程序确定预设显示位置:确定被触发的图像标识的在目标附图中的位置,以及目标附图的中心点的位置;根据被触发的图像标识的在目标附图中的位置,以及目标附图的中心点的位置,确定所述预设显示位置,其中,所述预设显示位置包括沿远离所述目标附图的中心点方向,且与所述被触发的图像标识的距离小于等于预设的第一距离阈值的位置。
在一个实施例中,为了能够在所述页面中的预设显示位置处,显示所述被触发的图像标识的匹配文本,所述第一显示单元具体可以包括以下结构子单元:
第一显示子单元,具体可以用于在所述页面中预设显示位置处,显示所述被触发的图像标识的匹配文本中的少于预设长度的字符;
第一接收子单元,具体可以用于接收针对所述被触发的图像标识的匹配文本中的少于预设长度的字符的触发操作;
第二显示子单元,具体可以用于响应所述触发操作,显示所述被触发的图像标识的匹配文本。
在一个实施例中,所述第二显示子单元具体实施时,可以按照以下程序显示所述被触发的图像标识的匹配文本:通过预设的显示方式显示被触发的图像标识的匹配文本,其中,所述预设的显示方式包括以下至少之一:以标亮的方式显示;以加粗字体的方式显示;以预设的字体颜色显示;以悬浮框的方式显示;以字体为斜体的方式显示;以字体的字号变大的方式显示等。当然,需要说明的是,上述所列举的预设的显示方式只是一种示意性说明。本说明书对上述预设的显示方式不作限定。
在一个实施例中,所述装置还包括数据处理模块,所述数据处理模块具体可以用于对目标文件的文本数据和附图数据分别进行相应处理,得到所述获取模块所得到的包含有识别确定出的图像标识的附图数据和包含有识别确定出的关键词组的文本数据。
其中,所述数据处理模块具体包括附图数据处理单元和文本数据处理单元。所述附图数据处理单元具体可以用于识别确定出附图数据中所包含的图像标识。所述文本数据处理单元具体可以用于识别确定出文本数据中所包含的关键词组。
在一个实施例中,所述附图数据处理单元具体实施时,可以按照以下方式确定出所述附图数据中的图像标识:从所述附图数据中筛选出包含有预设页头的附图数据;对所述包含有预设页头的附图数据中的第一范围区域进行图像数据识别,将识别到的第一字符组作为所述图像标识,其中,所述第一字符组包括:数字和表征图的特征字符的组合;根据所识别到的第一字符组,在所述包含有预设页头的附图数据中确定出第二范围区域;对所述包含有预设页头的附图数据中的第二范围区域进行图像数据识别,将识别到的第二字符组作为所述图像标识,其中,所述第二字符组包括:与所述包含有预设页头的附图数据中的特征线的距离小于预设的第二距离阈值的数字和/或字母的组合,所述特征线包括:携带有箭头的直线,和/或,携带有箭头的曲线。
在一个实施例中,所述附图数据处理单元,在确定出所述附图数据中所包含的图像标识后,具体还可以用于根据所述图像标识,生成对应的图像标识的身份序号,并将所述图像标识的身份序号保存于所述附图数据中。
在一个实施例中,所述文本数据处理单元具体实施时,可以按照以下方式确定所述文本数据中的关键词组,包括:检索所述文本数据中的小节标题;根据所述小节标题,将所述文本数据划分为多个文本区域;从所述多个文本区域中筛选出符合预设要求的文本区域,作为第三范围区域,其中,所述第三范围区域至少包括小节标题为附图说明或者BRIFE DESCRIPTION OF DRAWINGS的文本区域;根据所述文本数据中的第三范围区域,确定所述文本数据中的关键词组。
在一个实施例中,在检索所述文本数据,确定所述文本数据不包含有小节标题的情况下,所述文本数据处理单元具体实施时,还可以用于利用预设的区域划分模型对所述文本数据进行处理,以从所述文本数据中确定出第三范围区域,其中,所述预设的区域划分模型为预先利用样本文本数据对神经网络进行模型训练得到的。
在一个实施例中,为了能够根据所述文本数据中的第三范围区域,确定所述文本数据中的关键词组,所述文本数据处理单元具体实施时,可以对所述文本数据中的第三范围区域进行文本数据识别,识别出第 三字符组,其中,所述第三字符组包括以下至少之一:数字和名词的组合、字母和名词的组合、数字和表征图的字符的组合;将所识别出的第三字符组作为所述关键词组。
在一个实施例中,在识别出多个语义近似的第三字符组的情况下,所述文本数据处理单元具体实施时,还可以用于:统计所述多个语义近似的第三字符组中的各个第三字符组在文本数据中的出现频次;将出现频次最高的第三字符组确定为所述关键词组。
在一个实施例中,在识别出多个第三字符组的情况下,所述文本数据处理单元具体实施时,还可以用于:统计所述多个第三字符组中的各个第三字符组所在文本段落所包含的字符数;将所在的文本段落所包含的字符数最少的第三字符组作为所述关键词组。
在一个实施例中,所述第三字符组还包括:多个数字和分隔符号的组合,所述文本数据处理单元在对所述文本数据中的第三范围区域进行数据处理,识别出包含有多个数字和分隔符号的组合的第三字符组后,具体实施时,所述文本数据处理单元具体还可以按照以下程序执行:识别所述分隔符号的类型;根据与所述分隔符号的类型匹配的拆分规则,将所述多个数字拆分为多个待拼接的数字字符;将所述多个待拼接的数字字符与所述第三字符组中的名词、字母或表征图的字符分别进行拼接,得到多个关键词组。
在一个实施例中,在确定出所述文本数据中的关键词组后,所述文本数据处理单元具体实施时,还可用于根据所述关键词组,生成对应的关键词组的身份序号,并将所述关键词组的身份序号保存于所述文本数据中。
需要说明的是,上述实施例阐明的单元、装置或模块等,具体可以由计算机芯片或实体实现,或者由具有某种功能的产品来实现。为了描述的方便,描述以上装置时以功能分为各种模块分别描述。当然,在实施本说明书时可以把各模块的功能在同一个或多个软件和/或硬件中实现,也可以将实现同一功能的模块由多个子模块或子单元的组合实现等。以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。
由上可见,本申请提供的附图页面的展示装置由于通过接收并响应用户针对展示模块所展示的附图页面中图像标识的触发操作,确定出被触发的图像标识;再通过显示模块根据所获取的目标文件的附图数据和文本数据,在所述附图页面中显示出被触发的图像标识的匹配文本,以对图像标识所指示的附图或者附图中的内容对象进行描述说明,使得用户不再需要自己来回地在目标文件的文本中翻找与目标附图中的所感兴趣的图像标识相关的文本语句,就能准确地获取到与该图像标识相关的其他信息,方便用户高效地阅读、理解附图页面中的附图内容,从而简化了用户操作,提高了用户的使用体验。
本申请还提供了一种具体的客户端设备,可以参阅图13所示,其中,所述客户端设备包括网络通信接口、处理器以及存储器,上述结构通过内部线缆相连,以便各个结构可以进行具体的数据交互。
其中,所述网络通信接口,具体可以用于获取目标文件的附图数据和文本数据,并接收针对目标文件中的目标附图的展示操作。
所述处理器,具体可以用于展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述目标附图中包含有图像标识;接收并响应针对所述目标附图中的图像标识的触发操作,确定出被触发的图像标识;根据所述附图数据和所述文本数据,在所述页面中显示出被触发的图像标 识的匹配文本,其中,所述被触发的图像标识的匹配文本至少包括用于描述所述被触发的图像标识的语句或词组。
所述存储器,具体可以用于存储所述处理器所基于的相应的指令程序。
在本实施例中,所述网络通信接口可以是与不同的通信协议进行绑定,从而可以发送或接收不同数据的虚拟端口。例如,所述网络通信接口可以是负责进行web数据通信的80号端口,也可以是负责进行FTP数据通信的21号端口,还可以是负责进行邮件数据通信的25号端口。此外,所述网络通信接口还可以是实体的通信接口或者通信芯片。例如,其可以为无线移动网络通信芯片,如GSM、CDMA等;其还可以为Wifi芯片;其还可以为蓝牙芯片。
在本实施例中,所述处理器可以按任何适当的方式实现。例如,处理器可以采取例如微处理器或处理器以及存储可由该(微)处理器执行的计算机可读程序代码(例如软件或固件)的计算机可读介质、逻辑门、开关、专用集成电路(Application Specific Integrated Circuit,ASIC)、可编程逻辑控制器和嵌入微控制器的形式等等。本说明书并不作限定。
在本实施例中,所述存储器可以包括多个层次,在数字系统中,只要能保存二进制数据的都可以是存储器;在集成电路中,一个没有实物形式的具有存储功能的电路也叫存储器,如RAM、FIFO等;在系统中,具有实物形式的存储设备也叫存储器,如内存条、TF卡等。
本说明书实施例还提供了一种基于上述附图页面的展示方法的计算机存储介质,所述计算机存储介质存储有计算机程序指令,在所述计算机程序指令被执行时实现:展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述目标附图中包含有图像标识;接收并响应针对所述目标附图中的图像标识的触发操作,确定出被触发的图像标识;根据所述附图数据和所述文本数据,在所述页面中显示出被触发的图像标识的匹配文本,其中,所述被触发的图像标识的匹配文本至少包括用于描述所述被触发的图像标识的语句或词组。
在本实施例中,上述存储介质包括但不限于随机存取存储器(Random Access Memory,RAM)、只读存储器(Read-Only Memory,ROM)、缓存(Cache)、硬盘(Hard Disk Drive,HDD)或者存储卡(Memory Card)。所述存储器可以用于存储计算机程序指令。网络通信单元可以是依照通信协议规定的标准设置的,用于进行网络连接通信的接口。
在本实施例中,该计算机存储介质存储的程序指令具体实现的功能和效果,可以与其它实施方式对照解释,在此不再赘述。
为了满足不同用户的多样化要求,针对另一种应用场景,参阅图14所示,本申请实施例还提供了另一种附图页面的展示方法,具体实施时,可以包括以下内容:
S1401:展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述附图数据包含有图像标识,所述文本数据包含有关键词组,所述目标附图中包含有图像标识。
在本实施例中,在客户端设备展示包含有目标文件中的目标附图的页面前,所述方法具体实施时还可以包括接收用户发送的针对目标文件中的目标附图的页面展示请求。
在本实施例中,上述页面展示请求具体可以理解为一种携带有能够指示用户所关注或者想要查询的目标文件中的目标附图的请求数据。客户端设备可以根据该页面展示请求,确定出用户所关注或者想要查询的目标文件,以及相应选中的目标附图,进而可以根据目标文件的附图数据和文本数据生成并向用户展示 包含有上述目标附图的附图页面。
在本实施例中,具体实施时,用户可以在客户端设备所展示的查询框内输入自己想要查询的文件的名称或编号,来指示目标文件。这时,客户端设备可以根据用户输入的目标文件的名称或编号,展示出针对该目标文件的目录页面。进一步,用户可以在该目录页面中通过点击或勾选等操作选择出想要查询的附图作为目标附图,从而客户端设备可以接收到用户针对目标文件中的目标附图的页面展示请求。当然,上述所列举的接收针对目标文件中的目标附图的页面展示请求的方式只是一种示意性说明。具体实施时,根据具体情况,用户也可以直接在所展示的查询框内按照预设的输入规则,同时输入想要查询的文件的名称或编号,以及想要查询的附图的具体编号,并用预设的连接符(例如“&”)进行分隔,客户端设备可以通过接收用户按照预设的输入规则输入的上述参数数据接收到针对目标文件中的目标附图的页面展示请求等。对于上述接收针对目标文件中的目标附图的页面展示请求的具体实现方式,本说明书不作限定。
在本实施例中,在展示包含有目标文件中的目标附图的页面前,所述方法具体实施时,还包括:获取目标文件的附图数据和文本数据,其中,所述附图数据中包含有已经预先识别确定出的图像标识,所述文本数据中包含有预先识别确定出的关键词组。
在本实施例中,上述展示包含有目标文件中的目标附图的页面,具体实施时,可以包括:客户端设备根据用户发送的针对目标文件中的目标附图的页面展示请求,确定用户所选中的目标文件中的目标附图;进而可以获取目标文件的附图数据中包含有目标附图的附图数据;再根据上述包含有目标附图的附图数据生成并向用户展示出包含有目标文件的目标附图的附图页面。其中,目标附图中包含有图像标识。
在本实施例中,在客户端设备根据上述包含有目标附图的附图数据生成并展示包含有目标附图的附图页面的同时,还会对上述包含有目标附图的附图数据进行检索,确定上述包含有目标附图的附图数据所包含的图像标识,再获取目标文件的文本数据,并根据上述包含有目标附图的附图数据所包含的图像标识,检索目标文件的文本数据,确定出上述图像标识对应的关键词组,进而可以根据上述关键词组确定出上述图像标识的匹配文本。需要说明的是,在未接收进一步的操作指示之前,客户端设备暂时不会在上述附图页面上式显示出目标附图所包含的图像标识的匹配文本。
S1403:接收第一类触发操作,其中,所述第一类触发操作用于控制显示所述目标附图中所包含的图像标识的匹配文本。
在本实施例中,上述第一类触发操作具体可以理解为一种用于控制显示所述目标附图中所包含的图像标识的匹配文本的预设触发操作。具体实施时,可以根据预先协商好的规则,确定以下所列举的多个预设触发操作中的任意一个作为第一类触发操作:对页面中某个特定位置的按压操作、选中操作、点击操作等。当然,需要说明的是,上述所列举的第一类触发操作只是为了更好地说明本说明书实施例。具体实施时,根据具体情况和客户需要,还可以引入其他合适的操作作为上述第一类触发操作。对此,本说明书不作限定。
在本实施例中,为了配合支持上述第一类触发操作,具体实施时,可以参阅图9所示,可以预先在所展示的附图页面上设置有控制图标。例如图9中位于页面右上角的“匹配文本显示”图标。用户可以通过对控制图标的操作设置,根据自己的需求,来控制图像标识的匹配文本在所展示的页面中显示。例如,如果用户想要阅读图像标识的匹配文本时,参阅图9所示,用户可以将上述“匹配文本显示”图标向左拨动至“on”的位置处,即可以视为发出了第一类触发操作。相应的,客户端设备可以接收上述第一类触发操 作。当然,上述所列举的控制图标只是为了更好地说明本说明书实施例。具体实施时,根据具体情况和使用情景还可以提供其他形式类型的开关控制按钮,以便用户可以根据自己的需要,灵活地控制页面中的匹配文本的显示。对此,本说明书实施例不作限定。
S1405:响应所述第一类触发操作,在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本,其中,所述图像标识的匹配文本包括用于描述图像标识的语句或词组。
在本实施例中,客户端设备在接收到上述第一类触发操作,进一步可以响应上述第一类触发操作,将之前根据目标文件的附图数据和文本数据所确定出的目标附图中所包含的各个图像标识的匹配文本,在当前的附图页面上一起显示出来,供用户阅读使用。
具体的,例如,可以参阅图9所示,当用户将上述“匹配文本显示”图标向左拨动至“on”的位置处时,客户端设备会响应上述触发操作,在该页面上显示出当前所展示的附图中所包含的所有图像标识的匹配文本。
在本实施例中,具体实施时,客户端设备可以响应所述第一类触发操作,根据已经确定出目标附图所包含的图像标识的匹配文本,在当前附图页面的基础上,在预设显示位置处增添显示出各个图像标识的匹配文本。
在本实施例中,具体实施时,客户端设备还可以响应所述第一类触发操作,根据上述包含有目标附图的附图数据,以及已经确定出的目标附图中所包含的图像标识的匹配文本,通过HTML Canvas重新绘制出一个展示页面展示给用户阅读;也可以通过专利文字排算法,显示变更后的数据,例如各个图像标识的匹配文本,从而向用户展示出包含的图像标识的匹配文本的页面;还可以通过在浏览器等阅读工具的设置开启的操作,展示包含的图像标识的匹配文本的页面给用户阅读等。
本申请提供的一种附图页面的展示方法,由于通过客户端设备先向用户展示包含有目标文件的目标附图页面;在用户需要时,接收到用户发出的第一类触发操作的情况下,再在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本,使得用户不再需要自己来回地在目标文件的文本中翻找与目标附图中的所感兴趣的图像标识相关的文本语句,就能准确地获取到与该图像标识相关的其他信息,方便用户高效地阅读、理解附图页面中的附图内容,简化了用户操作,同时也不会对用户阅读目标附图本身内容造成影响,能够满足用户个性化的需求,从而进一步提高了用户的使用体验。
在一个实施例中,当用户只想看目标附图原本的内容,不想被页面所显示的匹配文本影响时,所述方法具体实施时,还可以包括以下内容:
S4:接收第二类触发操作,其中,所述第二类触发操作用于控制隐藏所述目标附图中所包含的图像标识的匹配文本。
在本实施例中,上述第二类触发操作具体可以理解为一种与第一类触发操作相对的,用于控制隐藏所述目标附图中所包含的图像标识的匹配文本的预设触发操作。具体实施时,类似于第一类触发操作,可以根据预先协商好的规则,确定以下所列举的多个预设触发操作中的任意一个区别于第一类触发操作的触发操作作为第二类触发操作:对页面中某个特定位置的按压操作、选中操作、点击操作等。当然,需要说明的是,上述所列举第二类触发操作只是为了更好地说明本说明书实施例。具体实施时,根据具体情况和客户需要,还可以引入其他合适的操作作为上述第二类触发操作。对此,本说明书不作限定。
在本实施例中,为了配合支持上述第二类触发操作,具体实施时,可以参阅图9所示,可以预先在所 展示的附图页面上设置有控制图标。例如图9中位于页面右上角的“匹配文本显示”图标。用户可以通过对控制图标的操作设置,根据自己的需求,来控制图像标识的匹配文本不在所展示的页面中显示。例如,如果用户不想要阅读图像标识的匹配文本时,参阅图10所示,用户可以将上述“匹配文本显示”图标向右拨动至“off”的位置处,即可以视为发出了第二类触发操作。相应的,客户端设备可以接收上述第二类触发操作。当然,上述所列举的控制图标只是为了更好地说明本说明书实施例。具体实施时,根据具体情况和使用情景还可以提供其他形式类型的开关控制按钮,以便用户可以根据自己的需要,灵活地控制页面中的匹配文本的不显示。对此,本说明书实施例不作限定。
S5:响应所述第二类触发操作,在所述页面中隐藏所述目标附图中所包含的图像标识的匹配文本。
在本实施例中,客户端设备在接收到上述第二类触发操作后,可以响应上述第二类触发操作通过多种隐藏方式隐藏所述页面中显示的图像标识的匹配文本。例如,客户端设备可以通过利用颜色为目标附图的背景色的图块覆盖所述图像标识的匹配文本,来隐藏所述图像标识的匹配文本。还可以通过将图像标识的匹配文本的字符调整设置为透明的字符,来隐藏所述图像标识的匹配文明。或者还可以只根据上述包含有目标附图的附图数据,通过HTML Canvas重新绘制出一个不携带有图像标识的匹配文本的展示页面,替换当前展示页面展示给用户达到隐藏图像标识的匹配文本的效果;也可以通过专利文字排算法,显示变更后的数据,进而展示出不包含的图像标识的匹配文本的页面;还可以通过在浏览器等阅读工具的设置关闭的操作,展示不携带的图像标识的匹配文本的页面给用户阅读等。当然上述所列举的隐藏匹配文本的方式只是为了更好地说明本说明书实施例。具体实施时,根据具体情况,还可以采用其他合适方式来隐藏图像标识的匹配文本。对此,本说明书不作限定。
在一个实施例中,考虑到为了满足不同用户的附图阅读习惯,也为了方便用户能够更加集中的了解目标附图中的图像标识的匹配文本,减少所显示的匹配文本对目标附图本身内容的影响,以及避免不同图像标识的匹配文本之间出现遮挡影响用户阅读,参阅图15所示,上述响应所述第一类触发操作,在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本,具体实施时,还可以包括以下内容:响应所述第一类触发操作,在所述页面中显示出目标附图的数据列表,其中,所述数据列表中包含有所述目标附图中的图像标识,以及图像标识的匹配文本。
在本实施例中,上述数据列表具有可以理解为一种按照一定的设置顺序设置有目标附图中所包含的各个图像标识,以及与各个图像标识对应的匹配文本的数据。上述数据列表可以是包含有分隔线框的列表,也可以是不包含有分隔线框,但相同的图像标识与匹配文本的对应关系清晰的文本。具体的,所述数据列表可以位于目标附图中的空白区域内,也可以位于所展示的附图页面中,目标附图外部,与目标附图接近的空白区域内。当然,上述所列举的数据列表只是一种示意性说明。具体实施时,根据具体情况还可以选择其他形式的数据列表。对于数据列表的具体形式本说明书不作限定。
基于页面所显示的上述数据列表,用户可以在阅读页面中的目标附图的信息时,很方便地对照参照目标附图旁边的数据列表,从而能够准确、高效地读取了解到各个图像标识所对应的相关文本信息,又不会由于页面中匹配文本显示影响对目标附图的阅读和理解。
在本实施例中,具体实施时,客户端设备可以在接收到第一类触发操作时,可以响应第一类触发操作,在当前所展示附图页面中显示出目标附图的数据列表,以便用户可以方便地了解目标附图中各个图像标识的匹配文本;在接收到第二类触发操作时,可以响应第二类触发操作,在当前所展示附图页面中隐藏所显 示的目标附图数据的数据列表,以便用户可以清晰不受干扰地阅读目标附图本身内容,满足用户的多样化使用需求。
在本实施例中,上述数据列表具体可以是根据与目标附图数据对应的匹配的文本数据生成的。其中,上述目标附图数据具体可以理解为目标文件的附图数据中用于生成目标附图的附图数据,其中,上述目标附图数据中包含有该目标附图所包含的已经预先识别确定出的图像标识。上述匹配的文本数据具体可以理解为目标文件的文本数据中用于生成目标附图中所包含的各个图像标识的匹配文本的文本数据,其中,上述匹配文本数据中包含有预先被确定出的与目标附图中的各个图像标识分别对应的关键词组。
在一个实施例中,在响应所述第一类触发操作,在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本后,所述方法具体实施时,还可以包括以下内容:接收并响应针对所述目标附图中的图像标识的触发操作,确定出目标附图中被触发的图像标识;在所述数据列表中以预设的显示方式显示所述被触发的图像标识和被触发的图像标识的匹配文本。
在本实施例中,具体实施时,上述附图页面所展示的数据列表中的图像标识和匹配文本,与目标附图中的图像标识可以设置相互对应关联。
在本实施例中,具体实施时,可以参阅图15所示,为了方便用户阅读,进一步提高用户的使用体验,具体实施时,在附图页面所显示的目标附图中的图像标识没有被触发的情况下,数据列表中的图像标识和图像标识的匹配文本可以先以一种正常未被触发的显示方式显示(例如可以以灰度较浅或者较透明的字体形式显示)。当用户对目标附图中的某个图像标识感兴趣,并针对该图像标识发出了触发操作时,客户端设备可以接收并响应上述触发操作,确定出目标附图中的被触发的图像标识。进而可以根据目标附图中触发的图像标识的身份序号,确定出在数据列表中相同图像标识,和该相同图像标识的匹配文本作为数据列表中被触发的图像标识,和被触发的图像标识的匹配文本。再通过预设的显示方式(即区别上述正常未被触发的显示方式,例如通过灰度更深或者不透明的字体,或者悬浮框的形式进行显示)显示出数据列表中被触发的图像标识,和被触发的图像标识的匹配文本。这样可以为用户从包含有多个图像标识的匹配文本中凸显出用户所选择的图像标识的匹配文本,进一步提高用户体验。
在一个实施例中,对应的,在响应所述第一类触发操作,在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本后,所述方法具体实施时,还可以包括以下内容:接收并响应针对所述数据列表中的图像标识的触发操作,确定出数据列表中被触发的图像标识;在所述目标附图中以预设的显示方式显示所述被触发的图像标识。
在本实施例中,具体实施时,可以参阅图15所示,具体实施时,在附图页面所显示的数据列表中的图像标识(或者图像标识的匹配文本)没有被触发的情况下,目标附图中的图像标识可以先以一种正常未被触发的显示方式显示(例如可以以灰度较浅或者较透明的字体形式显示)。当用户对数据列表中的某个图像标识感兴趣,并针对该图像标识发出了触发操作时,客户端设备可以接收并响应上述触发操作,确定出数据列表中的被触发的图像标识。进而可以根据数据列表中被触发的图像标识的身份序号(或者被触发的匹配文本所对应的图像标识的身份序号),确定出在目标附图中相同图像标识作为目标附图中被触发的图像标识。再通过预设的显示方式(即区别上述正常未被触发的显示方式,例如通过灰度更深或者不透明的字体,或者悬浮框的形式进行显示)显示出目标附图中被触发的图像标识。这样可以更加清楚地为用户从包含有多个图像标识的目标附图中凸显出用户所选择的图像标识,进一步提高用户体验。
在一个实施例中,如果用户对数据列表中的多个图像标识感兴趣,可以对数据列表中的多个图像标识进行触发操作;相应的,在附图页面中会通过预设的显示方式显示出多个被触发的图像标识,和/或多个被触发的图像标识的匹配文本。
具体的,例如,可以参阅图29所示。用户同时对图像标识280、282和292都感兴趣。这时,用户可以在数据列表中通过分别点击数据列表中的图像标识280、282和292,发出触发操作。相应的,客户端设备可以接收并响应上述操作,在数据列表中,以标亮的方式同时显示出被触发的图像标识280、282和292,以及对应的匹配文本,即:“280.locking arm”、“282.central section”和“292.central section”。同时,还可以在附图页面的目标附图中,以标亮的方式对应显示出原本被隐藏的280、282和292的匹配文本。
当然,用户也可以在目标附图中通过分别点击图像标识280、282和292,发出触发操作。相应的,客户端设备可以接收并响应上述操作,在附图页面的目标附图中,以标亮的方式显示出原本被隐藏的280、282和292的匹配文本。同时,还可以在数据列表中,以标亮的方式对应显示出被触发的图像标识280、282和292,以及对应的匹配文本。
在本实施例中,在数据列表中还可以在被触发的图像标识的附近显示出预设的标识标记,例如,图29中的眼睛形状的图像标记,来标识出该图像标识,以及图像标识的匹配文本当前处于被触发的显示状态。
在一个实施例中,在展示包含有目标文件中的目标附图的页面前,所述方法具体实施时,还可以包括以下内容:接收针对所述目标文件中的目标附图的展示操作;响应针对所述目标文件中的目标附图的展示操作,生成并发送针对目标文件中的目标附图的数据获取请求;获取目标附图数据和匹配的文本数据,其中,所述目标附图数据包含有目标附图中的图像标识,所述匹配的文本数据包含有目标附图中的图像标识的匹配文本,所述目标附图数据和所述匹配的文本数据根据所述目标文件的附图数据和文本数据确定。
在本实施例中,具体实施时,客户端设备可以通过接收用户针对目标文件的选择操作,以及用户针对目标文件中的目标附图的选择操作,接收到上述针对目标文件中的目标附图的展示操作。
在本实施例中,客户端设备在接收到上述针对目标文件中的目标附图的展示操作,可以生成并发送针对目标文件中的目标附图的数据获取请求,以请求获取上述目标附图数据,和匹配的文本数据。
在本实施例中,上述获取目标附图数据和匹配的文本数据,具体实施时,所获取的上述目标附图数据,和匹配的文本数据具体可以是服务器根据目标文件的附图数据和文本数据进行相应处理后得到并提供给客户端设备的,也可以是客户端设备自行对目标文件的附图数据和文本数据进行相应处理所得到的。
在一个实施例中,在响应所述第一类触发操作,在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本后,所述方法具体实施时,还可以包括以下内容:接收第三类触发操作,其中,所述第三类触发操作用于控制所述目标附图中所包含的图像标识的匹配文本的显示位置;响应所述第三类触发操作,在所述页面中的预设显示区域,显示所述目标附图中所包含的图像标识的匹配文本,其中,所述预设显示区域包括页面中的空白区域。
在本实施例中,上述第三类触发操作具体可以理解为一种与第一类触发操作、第二类触发操作不同的,用于控制目标附图中所包含的图像标识的匹配文本的显示位置的预设触发操作。具体实施时,类似于第一类触发操作、第二类触发操作,可以根据预先协商好的规则,确定以下所列举的多个预设触发操作中的任意一个区别于第一类触发操作、第二类触发操作的触发操作作为第三类触发操作:对页面中某个特定位置的按压操作、选中操作、点击操作等。当然,需要说明的是,上述所列举第三类触发操作只是为了更好地 说明本说明书实施例。具体实施时,根据具体情况和客户需要,还可以引入其他合适的操作作为上述第二类触发操作。对此,本说明书不作限定。
在本实施例中,在用户觉得图像标识的匹配文本的显示位置不符合用户的阅读习惯,或者对用户的阅读造成影响,例如,所显示的匹配文本遮挡了目标附图原本的图像内容,则可以通过在客户端设备上发出第三类预设触发操作以对匹配文本的显示位置进行调整。
在本实施例中,客户端设备在接收到用户发出的第三类触发操作后,可以响应该第三类触发操作,在所述页面中的预设显示区域,显示所述目标附图中所包含的图像标识的匹配文本。其中,所述预设显示区域具体可以理解为页面中用于显示目标附图中所包含的各个图像标识的匹配文本的区域。具体可以包括页面中的空白区域等等。当然,上述所列举的预设显示区域只是一种示意性说明。具体实施时,根据具体情况,上述预设显示区域还可以包括多个不同类型的区域供用户灵活选择。例如,所述预设显示区域具体可以包括:目标附图内部的空白区域、页面中目标附图外部的空白区域、图像标识的左侧区域、图像标识的上方区域等等。此外,还可以提供用户自定义设置预设显示区域。例如,用户还可以通过触发第三类触发操作,拖动匹配文本到页面中任意区域作为预设显示区域,以显示该匹配文本。从而可以满足不同用户的多样化使用需求。
在一个实施例中,在响应所述第三类触发操作,在所述页面中的预设显示区域,显示所述目标附图中所包含的图像标识的匹配文本后,所述方法具体实施时,还可以包括以下内容:接收并响应针对所述目标附图中的图像标识的触发操作,确定出目标附图中被触发的图像标识;在所述预设显示区域中以预设的显示方式显示被触发的图像标识的匹配文本。
在本实施例中,具体实施时,为了方便用户辨识,凸显用选中的图像标识,上述也是显示区域中的图像标识与目标附图中的图像标识可以设置成相互关联对应。这样,当用户对预设显示区域中某个图像标识的匹配文本感兴趣,可以针对该匹配文本发起触发操作。客户端设备可以接收并响应针对该匹配文本的触发操作,确定出预设显示区域中被触发的图像标识的匹配文本;进而可以在目标附图中确定并以区别于其他图像标识的显示方式显示出目标附图中与预设显示区域中被触发的匹配文本对应的图像标识。
在本实施例中,上述预设的显示方式具体可以包括以下至少之一:以标亮的方式显示;以加粗字体的方式显示;以预设的字体颜色显示;以悬浮框的方式显示;以字体为斜体的方式显示;以字体的字号变大的方式显示等。当然,具体实施时,根据具体情况还可以采用其他合适的显示方式作为上述预设的显示方式。对此,本说明书不作限定。
在一个实施例中,在响应所述第三类触发操作,在所述页面中的预设显示区域,显示所述目标附图中所包含的图像标识的匹配文本后,所述方法具体实施时,还可以包括以下内容:接收并响应针对所述预设的显示区域中的匹配文本的触发操作,确定出预设的显示区域中被触发的匹配文本;在所述目标附图中以预设的显示方式显示被触发的匹配文本所对应的图像标识。
在本实施例中,具体实施时,当用户对目标附图中某个图像标识感兴趣,也可以针对该图像标识发起触发操作。客户端设备可以接收并响应针对该图像标识的触发操作,确定出目标附图中被触发的图像标识;进而可以在预设显示区域中确定并以区别与其他匹配文本的显示方式显示出预设显示区域中与目标附图中被触发的图像标识对应的匹配文本。
在一个实施例中,所述附图数据中所包含的图像标识具体可以按照以下方式识别确定:从所述附图数 据中筛选出包含有预设页头的附图数据;对所述包含有预设页头的附图数据中的第一范围区域进行图像数据识别,将识别到的第一字符组作为所述图像标识,其中,所述第一字符组包括:数字和表征图的特征字符的组合;根据所识别到的第一字符组,在所述包含有预设页头的附图数据中确定出第二范围区域;对所述包含有预设页头的附图数据中的第二范围区域进行图像数据识别,将识别到的第二字符组作为所述图像标识,其中,所述第二字符组包括:与所述包含有预设页头的附图数据中的特征线的距离小于预设的第二距离阈值的数字和/或字母的组合,所述特征线包括:携带有箭头的直线,和/或,携带有箭头的曲线。
在一个实施例中,在确定出所述附图数据中所包含的图像标识后,所述方法具体实施时还可以包括以下内容:根据所述图像标识,生成对应的图像标识的身份序号,并将所述图像标识的身份序号保存于所述附图数据中。
在一个实施例中,上述确定所述文本数据中的关键词组,具体实施时,可以包括以下内容:检索所述文本数据中的小节标题;根据所述小节标题,将所述文本数据划分为多个文本区域;从所述多个文本区域中筛选出符合预设要求的文本区域,作为第三范围区域,其中,所述第三范围区域至少包括小节标题为附图说明或者BRIFE DESCRIPTION OF DRAWINGS的文本区域;根据所述文本数据中的第三范围区域,确定所述文本数据中的关键词组。
在一个实施例中,在检索所述文本数据,确定所述文本数据不包含有小节标题的情况下,所述方法具体实施时,还可以包括以下内容:利用预设的区域划分模型对所述文本数据进行处理,以从所述文本数据中确定出第三范围区域,其中,所述预设的区域划分模型为预先利用样本文本数据对神经网络进行模型训练得到的。
在一个实施例中,上述根据所述文本数据中的第三范围区域,确定所述文本数据中的关键词组,具体实施时,还可以包括以下内容:对所述文本数据中的第三范围区域进行文本数据识别,识别出第三字符组,其中,所述第三字符组包括以下至少之一:数字和名词的组合、字母和名词的组合、数字和表征图的字符的组合;将所识别出的第三字符组作为所述关键词组。
在一个实施例中,在识别出多个语义近似的第三字符组的情况下,所述方法具体实施时,还可以包括以下内容:统计所述多个语义近似的第三字符组中的各个第三字符组在文本数据中的出现频次;将出现频次最高的第三字符组确定为所述关键词组。
在一个实施例中,在识别出多个第三字符组的情况下,所述方法具体实施时,还可以包括以下内容:统计所述多个第三字符组中的各个第三字符组所在文本段落所包含的字符数;将所在的文本段落所包含的字符数最少的第三字符组作为所述关键词组。
在一个实施例中,所述第三字符组还可以包括:多个数字和分隔符号的组合,在对所述文本数据中的第三范围区域进行数据处理,识别出包含有多个数字和分隔符号的组合的第三字符组后,所述方法具体实施时,还可以包括以下内容:识别所述分隔符号的类型;根据与所述分隔符号的类型匹配的拆分规则,将所述多个数字拆分为多个待拼接的数字字符;将所述多个待拼接的数字字符与所述第三字符组中的名词、字母或表征图的字符分别进行拼接,得到多个关键词组。
在一个实施例中,在确定出所述文本数据中的关键词组后,所述方法具体实施时,还可以包括以下内容:根据所述关键词组,生成对应的关键词组的身份序号,并将所述关键词组的身份序号保存于所述文本数据中。
由上可见,本申请提供的一种附图页面的展示方法,由于先通过客户端设备向用户展示包含有目标文件的目标附图页面;在用户需要时,接收到用户发出的第一类触发操作的情况下,再在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本,使得用户不再需要自己来回地在目标文件的文本中翻找与目标附图中的所感兴趣的图像标识相关的文本语句,就能准确地获取到与该图像标识相关的其他信息,方便用户高效地阅读、理解附图页面中的附图内容,简化了用户操作,同时也不会对用户阅读目标附图本身内容造成影响,能够满足用户个性化的需求,从而进一步提高了用户的使用体验;还通过以数据列表的方式在页面中显示出包含有目标附图中的图像标识的匹配文本,并通过将数据列表中的图像标识与目标附图中的图像标识进行对应关联,方便了用户阅读附图内容和文本内容,进一步提高了用户的使用体验。
基于上述附图页面的展示方法,本申请还提供了一种对应的附图页面的展示装置,具体可以包括以下的结构模块:
展示模块,具体可以用于展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述附图数据包含有图像标识,所述文本数据包含有关键词组,所述目标附图中包含有图像标识;
确定模块,具体可以用于接收第一类触发操作,其中,所述第一类触发操作用于控制显示所述目标附图中所包含的图像标识的匹配文本;
显示模块,具体可以用于响应所述第一类触发操作,在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本,其中,所述图像标识的匹配文本包括用于描述图像标识的语句或词组。
在一个实施例中,所述确定模块,具体还可以用于接收第二类触发操作,其中,所述第二类触发操作用于控制隐藏所述目标附图中所包含的图像标识的匹配文本;
所述显示模块,具体还可以用于响应所述第二类触发操作,在所述页面中隐藏所述目标附图中所包含的图像标识的匹配文本。
在一个实施例中,为了能够响应所述第一类触发操作,在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本,所述显示模块具体可以用于响应所述第一类触发操作,在所述页面中显示出目标附图的数据列表,其中,所述数据列表中包含有所述目标附图中的图像标识,以及图像标识的匹配文本。
在一个实施例中,所述确定模块,具体还可以用于接收并响应针对所述目标附图中的图像标识的触发操作,确定出目标附图中被触发的图像标识;
所述显示模块,具体还可以用于在所述数据列表中以预设的显示方式显示所述被触发的图像标识,和被触发的图像标识的匹配文本。
在一个实施例中,所述确定模块,具体还可以用于接收并响应针对所述数据列表中的图像标识的触发操作,确定出数据列表中被触发的图像标识;
所述显示模块,具体还可以用于在所述目标附图中以预设的显示方式显示所述被触发的图像标识。
在一个实施例中,所述装置还包括获取模块,所述获取模块具体可以用于在展示包含有目标文件中的目标附图的页面前,接收针对所述目标文件中的目标附图的展示操作;响应针对所述目标文件中的目标附图的展示操作,生成并发送针对目标文件中的目标附图的数据获取请求;获取目标附图数据和匹配的文本数据,其中,所述目标附图数据包含有目标附图中的图像标识,所述匹配的文本数据包含有目标附图中的图像标识的匹配文本,所述目标附图数据和所述匹配的文本数据根据所述目标文件的附图数据和文本数据 确定。
在一个实施例中,所述确定模块,具体还可以用于接收第三类触发操作,其中,所述第三类触发操作用于控制所述目标附图中所包含的图像标识的匹配文本的显示位置;
所述显示模块,具体还可以用于响应所述第三类触发操作,在所述页面中的预设显示区域,显示所述目标附图中所包含的图像标识的匹配文本,其中,所述预设显示区域包括页面中的空白区域。
在一个实施例中,所述确定模块,具体还可以用于接收并响应针对所述目标附图中的图像标识的触发操作,确定出目标附图中被触发的图像标识;
所述显示模块,具体还可以用于在所述预设显示区域中以预设的显示方式显示被触发的图像标识的匹配文本。
在一个实施例中,所述确定模块,具体还可以用于接收并响应针对所述预设的显示区域中的匹配文本的触发操作,确定出预设的显示区域中被触发的匹配文本;
所述显示模块,具体还可以用于在所述目标附图中以预设的显示方式显示被触发的匹配文本所对应的图像标识。
在一个实施例中,所述装置还包括数据处理模块,所述数据处理模块具体可以用于对目标文件的文本数据和附图数据分别进行相应处理,得到所述获取模块所得到的包含有识别确定出的图像标识的附图数据和包含有识别确定出的关键词组的文本数据。
其中,所述数据处理模块具体包括附图数据处理单元和文本数据处理单元。所述附图数据处理单元具体可以用于识别确定出附图数据中所包含的图像标识。所述文本数据处理单元具体可以用于识别确定出文本数据中所包含的关键词组。
在一个实施例中,所述附图数据处理单元具体实施时,可以按照以下方式确定出所述附图数据中的图像标识:从所述附图数据中筛选出包含有预设页头的附图数据;对所述包含有预设页头的附图数据中的第一范围区域进行图像数据识别,将识别到的第一字符组作为所述图像标识,其中,所述第一字符组包括:数字和表征图的特征字符的组合;根据所识别到的第一字符组,在所述包含有预设页头的附图数据中确定出第二范围区域;对所述包含有预设页头的附图数据中的第二范围区域进行图像数据识别,将识别到的第二字符组作为所述图像标识,其中,所述第二字符组包括:与所述包含有预设页头的附图数据中的特征线的距离小于预设的第二距离阈值的数字和/或字母的组合,所述特征线包括:携带有箭头的直线,和/或,携带有箭头的曲线。
在一个实施例中,所述附图数据处理单元,在确定出所述附图数据中所包含的图像标识后,具体还可以用于根据所述图像标识,生成对应的图像标识的身份序号,并将所述图像标识的身份序号保存于所述附图数据中。
在一个实施例中,所述文本数据处理单元具体实施时,可以按照以下方式确定所述文本数据中的关键词组,包括:检索所述文本数据中的小节标题;根据所述小节标题,将所述文本数据划分为多个文本区域;从所述多个文本区域中筛选出符合预设要求的文本区域,作为第三范围区域,其中,所述第三范围区域至少包括小节标题为附图说明或者BRIFE DESCRIPTION OF DRAWINGS的文本区域;根据所述文本数据中的第三范围区域,确定所述文本数据中的关键词组。
在一个实施例中,在检索所述文本数据,确定所述文本数据不包含有小节标题的情况下,所述文本数 据处理单元具体实施时,还可以用于利用预设的区域划分模型对所述文本数据进行处理,以从所述文本数据中确定出第三范围区域,其中,所述预设的区域划分模型为预先利用样本文本数据对神经网络进行模型训练得到的。
在一个实施例中,为了能够根据所述文本数据中的第三范围区域,确定所述文本数据中的关键词组,所述文本数据处理单元具体实施时,可以对所述文本数据中的第三范围区域进行文本数据识别,识别出第三字符组,其中,所述第三字符组包括以下至少之一:数字和名词的组合、字母和名词的组合、数字和表征图的字符的组合;将所识别出的第三字符组作为所述关键词组。
在一个实施例中,在识别出多个语义近似的第三字符组的情况下,所述文本数据处理单元具体实施时,还可以用于:统计所述多个语义近似的第三字符组中的各个第三字符组在文本数据中的出现频次;将出现频次最高的第三字符组确定为所述关键词组。
在一个实施例中,在识别出多个第三字符组的情况下,所述文本数据处理单元具体实施时,还可以用于:统计所述多个第三字符组中的各个第三字符组所在文本段落所包含的字符数;将所在的文本段落所包含的字符数最少的第三字符组作为所述关键词组。
在一个实施例中,所述第三字符组还包括:多个数字和分隔符号的组合,所述文本数据处理单元在对所述文本数据中的第三范围区域进行数据处理,识别出包含有多个数字和分隔符号的组合的第三字符组后,具体实施时,所述文本数据处理单元具体还可以按照以下程序执行:识别所述分隔符号的类型;根据与所述分隔符号的类型匹配的拆分规则,将所述多个数字拆分为多个待拼接的数字字符;将所述多个待拼接的数字字符与所述第三字符组中的名词、字母或表征图的字符分别进行拼接,得到多个关键词组。
在一个实施例中,在确定出所述文本数据中的关键词组后,所述文本数据处理单元具体实施时,还可用于根据所述关键词组,生成对应的关键词组的身份序号,并将所述关键词组的身份序号保存于所述文本数据中。
由上可见,本申请提供的附图页面的展示装置由于先通过展示模块向用户展示包含有目标文件的目标附图页面;通过确定模块和显示模块在用户需要时,接收到用户发出的第一类触发操作的情况下,再在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本,使得用户不再需要自己来回地在目标文件的文本中翻找与目标附图中的所感兴趣的图像标识相关的文本语句,就能准确地获取到与该图像标识相关的其他信息,方便用户高效地阅读、理解附图页面中的附图内容,简化了用户操作,同时也不会对用户阅读目标附图本身内容造成影响,能够满足用户个性化的需求,从而进一步提高了用户的使用体验。
本申请还提供了一种具体的客户端设备,其中,所述客户端设备包括网络通信接口、处理器以及存储器,上述结构通过内部线缆相连,以便各个结构可以进行具体的数据交互。
其中,所述网络通信接口,具体可以用于获取目标文件的附图数据和文本数据,并接收针对目标文件中的目标附图的展示操作。
所述处理器,具体可以用于展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述附图数据包含有图像标识,所述文本数据包含有关键词组,所述目标附图中包含有图像标识;接收第一类触发操作,其中,所述第一类触发操作用于控制显示所述目标附图中所包含的图像标识的匹配文本;响应所述第一类触发操作,在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本,其中,所述图像标识的匹配文本包括用于描述图像标识的语句或词组。
所述存储器,具体可以用于存储所述处理器所基于的相应的指令程序。
本说明书实施例还提供了一种基于上述附图页面的展示方法的计算机存储介质,所述计算机存储介质存储有计算机程序指令,在所述计算机程序指令被执行时实现:展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述附图数据包含有图像标识,所述文本数据包含有关键词组,所述目标附图中包含有图像标识;接收第一类触发操作,其中,所述第一类触发操作用于控制显示所述目标附图中所包含的图像标识的匹配文本;响应所述第一类触发操作,在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本,其中,所述图像标识的匹配文本包括用于描述图像标识的语句或词组。
在本实施例中,该计算机存储介质存储的程序指令具体实现的功能和效果,可以与其它实施方式对照解释,在此不再赘述。
本申请还提供了一种文本页面的展示方法,该方法具体实施时,可以参阅图16所示,可以包括以下内容:
S1601:展示包含有目标文件中的目标文本的页面,其中,所述目标文件包括附图数据和文本数据,所述目标文本中包含有关键词组;
S1603:接收并响应针对所述目标文本中的关键词组的触发操作,确定出被触发的关键词组;
S1605:在所述页面中显示出被触发的关键词组的匹配附图,其中,所述被触发的关键词组的匹配附图至少包括被触发的关键词组所描述的附图或者附图中的内容对象。
在本实施例中,上述针对关键词组的触发操作具体可以是用户通过客户端设备对关键词组的点击触发,也可以是用户通过客户端设备输入针对关键词组的命令语句的输入触发等。对于上述针对关键词组的触发操作的具体形式或内容,本说明书不作限定。
在本实施例中,上述匹配的附图具体可以理解为包含有与关键词组匹配的图像标识的附图,即包含有关键词组所指示的附图或者附图中的内容对象的附图。
在本实施例中,用户在客户端设备根据目标文件的文本数据所展示的文本页面中阅读文本内容时,如果对文本页面中的某个或某几个关键词组感兴趣,想要了解该关键词组对应的附图信息,可以参阅图17所示,通过鼠标点击感兴趣的关键词组发起触发操作,这时在该文本页面上会显示出对应关键词组的匹配附图,即包含有关键词组所指示的附图或者附图中的内容对象的附图。
在本实施例中,具体显示时,可以以悬浮框的形式展示出包含有与该关键词组匹配的图像标识的匹配附图。当然,上述所列举的显示匹配附图的方式只是一种示意性说明。具体实施时,根据具体情况和用户需求,还可以采用其他合适的方式展示上述匹配附图。
在本实施例中,所述目标文件具体可以包括以下至少之一:专利文件、论文文件、合同文件、商标文件等。当然,上述所列举的目标文件只是一种示意性说明。具体实施时,根据具体情况和用户需求,还可以引入其他类型的文件作为上述目标文件。对此,本说明书不作限定。
在本实施例中,在展示包含有目标文件中的目标文本的页面之前,所述方法,具体实施时还可以包括以下内容:发送针对目标文件的获取请求;获取目标文件的附图数据和文本数据,其中,所述文本数据中至少包括关键词组,以及例如关键词组的身份序号、位置信息等相关信息,所述附图数据至少包括图像标识,以及图像标识的身份序号、位置信息等相关信息。
在本实施例中,具体实施时,可以先根据被触发的关键词组,结合目标文件的附图数据和文本数据,确定出被触发的关键词组的匹配附图,再在所述页面中显示出被触发的关键词组的匹配附图。
在本实施例中,上述根据被触发的关键词组,结合目标文件的附图数据和文本数据,确定出被触发的关键词组的匹配附图,具体实施时,可以包括以下内容:根据所述文本数据,确定所述被触发的关键词组的身份序号;确定出身份序号与所述被触发的关键词组的身份序号匹配的图像标识,作为关联的图像标识;根据所述关联的图像标识的位置信息,从所述附图数据中,获取包含有关联的图像标识的附图数据,作为匹配的附图数据;再根据上述匹配的附图数据生成被触发的关键词组的匹配附图。
在本实施例中,客户端设备可以从服务器处先获取服务器预先处理好的包含有已经识别确定出图像标识和关键词组的目标文件的附图数据和文本数据。也可以是客户端设备先获取服务器提供的目标文件的附图数据和文本数据,再对上述附图数据和文本数据分别进行相应处理,识别确定出附图数据中的图像标识,以及文本数据中的关键词组。
在本实施例中,还需要说明的是,具体实施时,考虑到客户端设备的运算能力、运算资源可能有限,上述根据被触发的关键词组,结合目标文件的附图数据和文本数据,确定出被触发的关键词组的匹配附图的数据处理过程也可以由服务器负责。即由客户端设备接收用户针对关键词组的触发操作,并将用户触发的关键词组的身份序号发送给服务器。再由服务器根据上述关键词组,确定出与被触发的关键词组对应的匹配附图,反馈给客户端设备。由客户端设备在向用户所展示的文本页面中显示出匹配附图。这样可以降低客户端的计算负担,提高处理效率。
由上可见,本申请提供的文本页面的展示方法,通过客户端设备接收并响应用户针对所展示的文本页面中关键词组的触发操作,确定出被触发的关键词组;再根据所获取的目标文件的附图数据和文本数据,在所附文本页面中显示出被触发的关键词组的匹配附图,以对关键词组所指示的附图或者附图中的内容对象进行对照显示,使得用户不再需要自己来回地在目标文件的附图中翻找与目标文本中的所感兴趣的关键词组相关的附图,方便用户高效地阅读、理解附文本面中的文本内容,从而简化了用户操作,提高了用户的使用体验。
基于上述文本页面的展示方法,本申请还提供了一种对应的文本页面的展示装置,可以参阅图18所示,该装置具体可以包括以下的结构模块:
展示模块,具体可以用于展示包含有目标文件中的目标文本的页面,其中,所述目标文件包括附图数据和文本数据,所述目标文本中包含有关键词组;
接收模块,具体可以用于接收并响应针对所述目标文本中的关键词组的触发操作,确定出被触发的关键词组;
显示模块,具体可以用于在所述页面中显示出被触发的关键词组的匹配附图,其中,所述被触发的关键词组的匹配附图至少包括被触发的关键词组所描述的附图或者附图中的内容对象。
本申请还提供了一种具体的客户端设备,其中,所述客户端设备包括网络通信接口、处理器以及存储器,上述结构通过内部线缆相连,以便各个结构可以进行具体的数据交互。
其中,所述网络通信接口,具体可以用于获取目标文件的附图数据和文本数据。
所述处理器,具体可以用于展示包含有目标文件中的目标文本的页面,其中,所述目标文件包括附图数据和文本数据,所述目标文本中包含有关键词组;接收并响应针对所述目标文本中的关键词组的触发操 作,确定出被触发的关键词组;在所述页面中显示出被触发的关键词组的匹配附图,其中,所述被触发的关键词组的匹配附图至少包括被触发的关键词组所描述的附图或者附图中的内容对象。
所述存储器,具体可以用于存储所述处理器所基于的相应的指令程序。
本说明书实施例还提供了一种基于上述附图页面的展示方法的计算机存储介质,所述计算机存储介质存储有计算机程序指令,在所述计算机程序指令被执行时实现:展示包含有目标文件中的目标文本的页面,其中,所述目标文件包括附图数据和文本数据,所述目标文本中包含有关键词组;接收并响应针对所述目标文本中的关键词组的触发操作,确定出被触发的关键词组;在所述页面中显示出被触发的关键词组的匹配附图,其中,所述被触发的关键词组的匹配附图至少包括被触发的关键词组所描述的附图或者附图中的内容对象。
在本实施例中,该计算机存储介质存储的程序指令具体实现的功能和效果,可以与其它实施方式对照解释,在此不再赘述。
基于上述方案,本申请还提供了一种目标附图的生成方法,其中,上述目标附图的生成方法具体实施时,可以包括以下内容:
S1:获取目标文件的附图数据和文本数据,其中,所述附图数据包含有图像标识,所述文本数据包含有关键词组;
S2:接收并响应针对目标文件中附图的选择操作,根据所述附图数据和所述文本数据生成目标附图,其中,目标附图为所述选择操作选中的附图,所述目标附图包含有图像标识,以及图像标识的匹配文本,所述图像标识的匹配文本根据所述附图数据和文本数据生成。
在本实施例中,通过上述方法,客户端设备可以响应不同用户在不同应用场景下的使用需求,生成针对用户阅读、使用等交互更加友好、更加智能的目标附图。即,在目标文件所包含的原本附图的基础上,对附图进行了改进。具体的,可以通过上述包含有图像标识的附图数据和包含有关键词组的文本数据,自动在所生成的目标附图中增加了对目标附图所包含的图像标识的对应文本描述或解释,即匹配文本。这样可以使得所生成并提供给用户的目标附图,相对于原本的附图,具有更高的可读性,包含了更多的相关信息,具有更高的数据价值。
在一个实施例中,在生成目标附图后,所述方法具体实施时,还可以包括以下内容:接收展示请求;响应所述展示请求,展示所述目标附图。
在本实施例中,具体实施时,客户端设备在生成了目标附图后可以先将目标附图存储在客户端设备的存储设备上,当用户想要阅读,发出展示请求后,例如,用户点击了客户端设备所展示的该目标附图的图标后;再响应用户的展示请求,向用户展示携带有匹配文本的,具有相对更高的可读性和数据价值的目标附图。当然,需要说明的是,上述所列举的目标附图的应用只是一种示意性说明。具体实施时,根据具体的应用场景和用户需求,还可以利用所生成的目标附图进行其他类型的应用。对此,本说明书不作限定。
由上可见,本申请提供的目标附图的生成方法,通过根据目标文件的附图数据和文本数据,生成包含有目标附图中出现的图像标识的匹配文本的附图,从而使得所生成的目标附图更具有可读性,具有更高的数据价值,能够满足用户更加广泛的使用需求,提高了用户的使用体验。
本申请还提供了一种具体的客户端设备,其中,所述客户端设备包括网络通信接口、处理器以及存储器,上述结构通过内部线缆相连,以便各个结构可以进行具体的数据交互。
其中,所述网络通信接口,具体可以用于获取目标文件的附图数据和文本数据,其中,所述附图数据包含有图像标识,所述文本数据包含有关键词组。
所述处理器,具体可以用于接收并响应针对目标文件中附图的选择操作,根据所述附图数据和所述文本数据生成目标附图,其中,目标附图为所述选择操作选中的附图,所述目标附图包含有图像标识,以及图像标识的匹配文本。
所述存储器,具体可以用于存储所述处理器所基于的相应的指令程序。
本说明书实施例还提供了一种基于上述附图页面的展示方法的计算机存储介质,所述计算机存储介质存储有计算机程序指令,在所述计算机程序指令被执行时实现:获取目标文件的附图数据和文本数据,其中,所述附图数据包含有图像标识,所述文本数据包含有关键词组;接收并响应针对目标文件中附图的选择操作,根据所述附图数据和所述文本数据生成目标附图,其中,目标附图为所述选择操作选中的附图,所述目标附图包含有图像标识,以及图像标识的匹配文本。
在本实施例中,该计算机存储介质存储的程序指令具体实现的功能和效果,可以与其它实施方式对照解释,在此不再赘述。
本申请实施例还提供了一种针对所涉及的客户端设备比较简单的情形下的附图页面的展示方法,该方法可以应用于上述业务系统(即包含有客户端设备,和用于为用户提供目标文件查询服务的平台的服务器)中的客户端设备一侧,具体实施时,可以包括以下内容。
S1:发送针对目标文件中的目标附图的页面展示请求。
S2:接收目标附图数据,和匹配的文本数据;其中,所述匹配的文本数据为根据与所述目标附图中的图像标识和所述目标文件的文本数据生成的。
在本实施例中,上述目标附图具体可以理解为目标文件的附图数据中用于生成并展示用户所请求展示的目标文件中的目标附图的图像数据。
在本实施例中,上述匹配的文本数据具体可以理解为一种在目标文件的文本数据中所含有的,包含有与目标附图中的图像标识对应的关键词组的,可以生成图像标识的匹配文本的文本数据。
具体的,上述图像标识的匹配文本可以是一种文本词组,例如,用于描述图像标识所指示的附图或附图中的内容对象名称的词组。也可以是一种文本语句,例如,从目标文件的文本数据中摘取的对图像标识所指示的附图或附图中的内容对象的属性、使用、功能等特性的描述性语句,或者基于上述描述性语句通过语义识别得到的包含有上述描述性语句主要内容的具有一定概括性的文本识别结果等。当然,需要说明的是上述所列举的匹配文本只是一种示意性说明。对于上述匹配文本的具体形式和内容,本说明书不作限定。
在本实施例中,具体实施时,客户端设备可以通过有线或无线的方式,接收到由服务器响应针对目标文件中的目标附图的页面展示请求所确定并反馈回来的上述目标附图数据,和匹配的文本数据。
S3:根据所述目标附图,和匹配的文本数据,展示包含有目标附图的页面,并在所述页面中展示出目标附图中图像标识的匹配文本。
在本实施例中,客户端设备在接收到上述目标附图,和匹配的文本数据后,可以根据上述目标附图数据,和匹配的文本数据,向用户生成并展示出符合用户要求能够更加智能、清楚地显示目标附图的附图页面,即在所展示的附图页面中,不但会根据目标附图数据展示出对应的用户选中的目标附图,同时,还会 根据匹配的文本数据,自动在该页面中显示与目标附图所包含的图像标识的匹配文本。
本申请还提供了另一种附图页面的展示方法,该方法具体可以应用于上述业务系统中的服务器一侧,具体实施时,该方法具体可以包括以下内容。
S1:接收针对目标文件中的目标附图的页面展示请求。
在本实施例中,上述服务器可以通过有线或无线的方式与客户端设备耦合,以便进行数据交互。具体的,服务器可以根据相应的传输协议,通过有线或无线的方式接收由客户端设备发送的针对目标文件中的目标附图的页面展示请求。
在本实施例中,上述目标文件具体可以包括以下至少之一:专利文件、论文文件、合同文件、商标文件等等。当然,需要说明的是,上述所列举的目标文件只是为了更好地说明本说明书实施例。具体实施时,根据具体情况,上述目标文件还可以包括其他类型的文件。对此,本说明书不作限定。
在本实施例中,在服务器接收上述针对目标文件中的目标附图的页面展示请求之前,服务器预先可以有针对性地对目标文件所包含的文本数据和附图数据分别进行相应的处理,以先分别识别确定出目标文件的附图数据中的图像标识,以及文本数据中的关键词组,得到处理好的目标文件的附图数据和文本数据(即包含有已经识别确定出图像标识,及图像标识的相关信息的目标文件的附图数据,和包含有已经识别确定出关键词组,及关键词组的相关信息的目标文件的文本数据)。
S2:响应所述页面展示请求,从目标文件的附图数据中,获取目标附图数据,并确定所述目标附图中的图像标识。
在本实施例中,具体实施时,服务器可以响应上述页面展示请求,通过对页面展示请求进行数据解析等处理,确定出客户端设备所请求展示的目标文件中的目标附图,进而可以从目标文件的附图数据中检索出用于生成目标附图的目标附图数据。
在本实施例中,具体实施时,服务器在检索出目标附图数据后,进一步可以先对目标附图数据进行相应的识别处理,例如检索目标附图数据中是否存在图像标识的身份序号等,以确定出目标附图数据中所包含的图像标识。
S3:根据所述目标附图数据中所包含的图像标识,获取匹配的文本数据。
在本实施例中,上述匹配的文本数据具体可以理解为一种根据目标文件的文本数据中与目标附图数据中所包含的图像标识匹配的数据所生成的数据。
在本实施例中,上述根据所述图像标识,获取匹配的文本数据,具体实施时,可以包括以下内容:确定所述图像标识的身份序号;从所述目标文件的文本数据中检索出身份序号与所述图像标识的身份序号匹配的关键词组;从所述文本数据中获取包含有所述关键词组的文本语句;根据包含有所述关键词组的文本语句,确定所述匹配的文本数据。
在本实施例中,具体实施时可以根据图像标识的身份序号,从文本数据中检索出身份序号与所述图像标识的身份序号相同或者差异值小于预设的差异阈值的关键词组,作为匹配的关键词组。
例如,图像标识“图1”的身份序号为“f1”,可以根据该身份序号,从目标文件的文本数据中检索出一个相同的身份序号“f1”,作为匹配的身份序号,并将该身份序号所对应的关键词组“图1”确定为与图像标识“图1”匹配的关键词组。又例如,图像标识“101”的身份序号为“label2_101”,可以根据该身份序号,从目标文件的文本数据出检索出一个相近的身份序号“101”,作为匹配的身份序号,并将 该身份序号所对应的关键词组“部件101”确定为与图像标识“101”匹配的关键词组。当然,上述所列举的确定匹配的关键词组的方式只是一种示意性说明。具体实施时,根据具体情况也可以采用其他合适的方式确定出图像标识的匹配的关键词组。
在本实施例中,在确定出与图像标识匹配的关键词组的身份序号后,可以根据与该身份序号对应的关键词组的位置信息,高效地文本数据中找到该匹配关键词组。进而可以在文本数据中通过对该匹配的关键词组向前、向后分别检索语句分隔符(例如句号、分号等),找到从所述文本数据中确定包含有所述关键词组的文本语句。
在本实施例中,考虑到专利文件撰写规律,也为了能够高效、快速地找到有价值的匹配文本,在从文本数据中找到多个包含有关键词组的文本语句时,可以分别统计多个文本语句中的各个文本语句所包含的字符数,将字符数最少的文本语句作为用于后续处理的包含有关键词组的文本语句。从而可以有效降低数据处理量,提高处理效率。
在本实施例中,在确定出包含有所述关键词组的文本语句后,可以直接将上述包含有关键词组的文本语句作为针对该图像标识的匹配的文本数据。也可以仅将上述文本语句中与关键词组相连的名词和关键词组的组合作为上述匹配的文本数据。如果上述包含有关键词组的文本语句较为复杂,也可以对上述包含有关键词组的文本语句进行语义识别,例如输入已经训练好的语义识别模型,得到对应语义识别结果。进而可以将语义识别结果作为匹配的文本数据等等。当然,上述所列举的确定匹配的文本数据的方式只是一种示意性说明。具体实施时,根据具体的应用场景和处理要求,还可以采用其他合适的方式来获取上述匹配的文本数据。对此,本说明书不作限定。
在本实施例中,在确定出包含有关键词组的文本语句后,进一步还可以根据组成关键词组的非数字的字符,确定出与该关键词组对应的图像标识指示的是附图整体(例如附图序号),还是附图中的某个内容对象(例如附图中的某个部件或模块)。具体的,例如,如果组成关键词组的非数字的字符为“图”、“fig.”或者“附图”等,则可以确定与该关键词组对应的图像标识为用于指示附图序号的标识信息;否则,则为用于指示附图中的内容对象的标识信息。
在确定出关键词组为用于指示附图序号的标识信息后,进一步还可以检索并根据该关键词组所在的文本语句中的特征词,来确定与关键词组对应的图像标识所指示的附图类型。其中,上述附图类型具体可以包括以下所列举的附图类型中的至少一种:平面图、立体图、斜视图(或透视图)、断面图、概略图、分解图、放大图(或部分图)、概念图或示例图等等。当然,上述所列举的附图类型只是一种示意性说明。具体实施时,根据具体情况和应用场景还可以包含有其他的附图类型。对此,本说明书不作限定。
在本实施例中,在确定出关键词组所对应的图像标识所指示的附图类型后,后续可以将该附图的附图类型作为一种匹配文本显示给用户。
在一个实施例中,在确定出关键词组为用于指示附图序号的标识信息后,进一步还可以检索并根据该关键词组所在的文本语句中的方位词,确定与关键词组所对应的图像标识所指示的附图的基于视角的视图类型。其中,上述视图类型具体可以包括以下至少之一:左视图、右视图、前视图、后视图、上视图、下视图等等。当然,上述所列举的视图类型只是一种示意性说明。具体实施时,根据具体情况和应用场景还可以包含有其他的视图类型。对此,本说明书不作限定。
在本实施例中,在确定出关键词组所对应的图像标识所指示的附图的视图类型后,后续可以将该附图 的视图类型作为一种匹配文本显示给用户。还可以根据不同附图的视图类型之间的相互关系,建立相应的触发链接。例如,当用户正在看前视图时,如果用户对左视图感兴趣,可以通过点击前视图的左侧,发出触发,这时客户端可以响应用户的触发,跳转到与该前视图对应的视图类型为左视图的附图展示给用户。还可以根据上述不同附图的视图类型之间的相互关系,将多个附图按照视图类型的相互关系进行3维重构,以建立一个可切换视角显示的三维立体图。进而可以进一步提高用户使用体验。
S4:发送所述目标附图数据,和匹配的文本数据,其中,所述目标附图数据,和匹配的文本数据用于客户端设备生成并展示包含有目标附图的附图页面,所述页面中还包含有所述目标附图中所包含的图像标识的匹配文本,所述目标附图中所包含的图像标识的匹配文本根据匹配的文本数据生成。
在本实施例中,服务器在获取得到了针对页面展示请求的目标附图数据,和匹配的文本数据后,可以将上述目标附图数据,和匹配的文本数据通过有线或无线的方式发送客户端设备。这样客户端设备可以根据上述目标附图数据,向用户生成并展示出包含有用户想要查询了解的目标附图的附图页面,同时还会根据匹配的文本数据,在上述页面中显示出与目标附图中的图像标识匹配的匹配文本,使得用户能够更加方便、高效地获取得到目标附图中的相关信息,提高用户的实用体验。
在本实施例中,具体实施时,客户端设备可以是根据针对页面展示请求的目标附图,和匹配的文本数据中,通过HTML Canvas重新绘制出一个展示页面展示给用户阅读。也可以是服务器先根据目标附图,和匹配的文本数据中,通过HTML Canvas重新绘制出一个展示页面,再将该展示页面发送给客户端设备,由客户端设备向用户展示;也可以通过专利文字排算法,显示变更后的数据,进而展示包含的图像标识的匹配文本的页面;还可以通过在浏览器等阅读工具的设置开启的操作,展示包含的图像标识的匹配文本的页面给用户阅读等等。
基于上述分别应用于客户端设备和服务器上的附图页面的展示方法,本申请还提供了一种应用于包含有服务器和客户端设备的业务系统上的附图页面的展示方法。参阅图19所示,该方法具体可以包括以下内容。
S1:客户端设备向服务器发送针对目标文件中的目标附图的页面展示请求;
S2:服务器响应所述页面展示请求,从目标文件的附图数据中,获取目标附图数据;确定并根据所述目标附图数据中的图像标识,从目标文件的文本数据中获取匹配的文本数据;向客户端设备发送所述目标附图数据,和匹配的文本数据;
S3:客户端设备根据所述目标附图数据,和匹配的文本数据,展示包含有目标附图的页面,并在所述页面中显示出所述目标附图所包含的图像标识的匹配文本。
在本实施例中,基于上述业务系统,服务器和客户端设备可以响应用户的附图页面展示请求,为用户智能地展示出包含有匹配的文本内容的,符合用户要求的目标附图的附图页面,方便用户阅读、理解目标附图,使得用户不再需要自己再来回地在目标文件的文本内容中翻找与图像标识对应的文本语句,简化了用户操作,提高了用户使用体验。
进一步联系更多的应用情景,又考虑到许多用户习惯使用双页面(或者称双窗口)来显示想要阅读的目标文件中的附图和/文本。
其中,上述双页面可以参阅图20所示,具体可以理解为一种分栏页面(例如,对应的图文对照的展示页面),相当于在同一个展示页面上设置有两个相互独立的窗口:第一窗口和第二窗口,其中,每一个窗 口所展示的页面与另一个窗口所展示的页面可以互不相关,也可以彼此联动。通过上述双页面显示,用户可以同时阅读目标文件的不同位置处的文本内容或附图内容,或者同时阅读目标文件的附图内容和文本内容。
具体的,参阅图20所示,可以使用客户端设备所展示的页面中的左侧分栏页面(即第二窗口),来阅读目标文件的目标文本;使用客户端设备所展示的页面中的右侧分栏页面(即第一窗口),来阅读目标文件的目标附图。即,在客户端设备所展示的同一个完整的页面中,可以通过该展示页面中相互独立的第一窗口和第二窗口同时向用户显示目标文件的目标附图和目标文本。
基于这种应用情景,用户往往希望在第一窗口阅读到目标附图中的某个图像标识时,或者对第一窗口所显示的目标附图中的某个图像标识感兴趣时,对应的可以在第二窗口所显的目标文本中快速、智能地跳转或显示出目标文件的文本中与该图像标识相关的文本内容,达到能够图文联动,以便可以结合文本和附图来更好地理解目标文件中的具体内容。然而现有的方法往往无法实现上述需求。
为了满足用户的上述需求,进一步提高用户的使用体验,本申请还提供了一种页面的展示方法,该方法具体可以应用于客户端设备一侧。具体实施时,可以参阅图21所示,该方法可以包括以下内容:
S2101:展示包含有目标文件中的目标文本和目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述目标附图在所述页面中的第一窗口中展示,所述目标文本在所述页面中的第二窗口中展示,所述目标附图中包含有图像标识,所述目标文本中包含有关键词组,所述关键词组为文本数据中用于指示附图,和/或,附图中的内容对象的字符组合;
S2102:接收针对所述第一窗口中的图像标识的触发操作,和/或,针对所述第二窗口中的关键词组的触发操作;
S2103:在接收针对所述第一窗口中的图像标识的触发操作的情况下,响应针对第一窗口中的图像标识的触发操作,确定出被触发的图像标识;
S2104:在所述第二窗口中显示被触发的图像标识的匹配文本,其中,所述被触发的图像标识的匹配文本至少包括用于描述所述被触发的图像标识的语句或词组;
S2105:在接收针对所述第二窗口中的关键词组的触发操作的情况下,响应针对第二窗口中的关键词组的触发操作,确定被触发的关键词组;
S2106:在所述第一窗口中显示被触发的关键词组的匹配附图,其中,所述被触发的关键词组的匹配附图至少包括被触发的关键词组所描述的附图或者附图中的内容对象。
在本实施例中,客户端设备可以根据由服务器处理提供的或者自行处理得到的目标文件的附图数据和文本数据,并根据上述目标文件的附图数据和文本数据,以及用户的选择,分别在第一窗口显示出目标附图,在第二窗口显示出目标文本。
在本实施例中,具体实施时,参阅图19所示,客户端设备可以接收并响应用户针对第一窗口中的图像标识的触发操作,将第二窗口跳转到包含有匹配文本的页面,并在第二窗口中以区别于其他文本的显示方式,显示出与第一窗口中被触发的图像标识对应的匹配文本。其中,上述匹配文本具体可以是与第一窗口中被触发的图像标识对应的关键词组(即与被触发的图像标识的身份序号相同的关联的关键词组),或者包含有对应的关键词组的文本语句。
参阅图22所示,客户端设备还可以接收并响应用户针对第二窗口中的关键词组的触发操作,将第一窗 口跳转到包含有匹配附图的页面。并在第一窗口中以区别于其他图像标识的显示方式,显示出与第二窗口中被触发的关键词组对应的图像标识。其中,上述匹配附图具体可以是包含有与第二窗口中被触发的关键词组对应的图像标识(即与被触发的关键词组的身份序号相同的图像标识)的附图。
还可以参阅图23所示。例如,当用户在客户端设备所展示的页面中的右侧的第一窗口所展示的包含有目标附图的附图页面中点击触发了自己感兴趣的图像标识9后,同一页面中的左侧的第二窗口所展示的文本页面会响应上述触发,并自动跳转到文本中与图像标识9对应的位置区域的文本页面的,即目标文本页面,并以不同颜色标识出对应的匹配文本,例如,与图像标识9对应的关键词组。当然,具体实施时还可以采用标亮、字体加粗等多种标识方式在文本页面中显示出与被触发的图像标识对应的关键词组。对此,本说明书不作限定。
类似的,参阅图24所示,当用户在左侧的第二窗口所展示的包含目标文本的文本页面中点击触发了感兴趣的关键词组“电机外壳”后,同一页面中右侧的第一窗口所展示的附图页面也会同样响应上述触发,并自动跳转到附图中与该关键词组对应的图像标识“9、电机外壳”的附图页面,即目标附图页面,并以不同颜色标识出与被触发的关键词组对应的图像标识。
在一个实施例中,所述目标文件具体可以包括以下至少之一:专利文件、论文文件、合同文件、商标文件等。
在一个实施例中,在所述第二窗口中显示被触发的图像标识的匹配文本,具体实施时,可以包括以下内容:根据目标文件的附图数据,确定被触发的图像标识的身份序号;根据目标文件的文本数据,检索出身份序号与被触发的图像标识的身份序号匹配的关键词组,作为关联的关键词组;在所述第二窗口中展示包含有所述关联的关键词组的文本作为被触发的图像标识的匹配文本。
在一个实施例中,在所述第二窗口中展示包含有所述关联的关键词组的文本作为被触发的图像标识的匹配文本的同时,所述方法具体实施时,还可以包括以下内容:通过预设的显示方式,在所述被触发的图像标识的匹配文本中显示出所述关联的关键词组,其中,所述预设的显示方式以下至少之一:以标亮的方式显示;以加粗字体的方式显示;以预设的字体颜色显示;以字体为斜体的方式显示;以字体的字号变大的方式显示。
在一个实施例中,在所述第二窗口中显示被触发的图像标识的匹配文本,具体实施时,还可以包括:在检索到多个关联的关键词组的情况下,根据关联的关键词组的位置信息,在所述第二窗口中展示包含有位置信息最靠前的关联的关键词组的文本作为被触发的图像标识的匹配文本。
在一个实施例中,在检索到多个关联的关键词组的情况下,所述方法具体实施时,还可以包括:展示多个关联的关键词组;接收并根据用户针对所述多个关联的关键词组的选择操作,确定出用户选中的关联的关键词组;在所述第二窗口展示出包含有用户选中的关联的关键词组的文本作为被触发的图像标识的匹配文本。
在本实施例中,在展示多个关联的关键词组时,还可以同时展示出例如关联的关键词组的位置信息等相关信息,供用户参考,便于用户更好地选择想要展示的关联的关键词组。
在一个实施例中,在所述第二窗口中展示包含有所述关联的关键词组的文本作为被触发的图像标识的匹配文本,具体实施时,可以包括以下内容:控制第二窗口跳转到包含有被触发的图像标识的匹配文本的页面;并在第二窗口中,通过预设的显示方式,在所述被触发的图像标识的匹配文本中显示出所述关联的 关键词组。
在一个实施例中,所述预设的显示方式包括以下至少之一:以标亮的方式显示;以加粗字体的方式显示;以预设的字体颜色显示;以字体为斜体的方式显示;以字体的字号变大的方式显示等等。
具体的,例如,可以参阅图30所示,当用户在第一窗口中点击图像标识“100”发起触发操作后,除了在第一窗口中以悬浮框等预设的方式显示出以下文本信息“100.天线”外;客户端设备还可以响应上述触发操作,在目标文本中检索到多个被触发的图像标识“100”的匹配文本,进而可以在第二窗口所展示的目标文本中,以标亮等预设的显示方式,显示出多个被触发的图像标识“100”的匹配文本。当然,根据用户的具体指令要求,具体实施时,也可以在第二窗口所展示的目标文本中,只以标亮等预设的显示方式显示出多个被触发的图像标识“100”的匹配文本中在目标文本中的位置最靠前的一个被触发的图像标识“100”的匹配文本。
在一个实施例中,在所述第二窗口中展示包含有所述关联的关键词组的文本作为被触发的图像标识的匹配文本的同时,所述方法具体实施时,还可以包括以下内容:在第一窗口中的预设显示位置处,显示所述被触发的图像标识的关联的关键词组,其中,所述预设显示位置包括沿远离所述目标附图的中心点方向,且与所述被触发的图像标识的距离小于等于预设的第一距离阈值的位置。
在一个实施例中,上述显示所述被触发的图像标识的关联的关键词组,具体实施时,可以包括以下内容:显示所述被触发的图像标识的匹配文本中的少于预设长度的字符;接收针对所述被触发的图像标识的匹配文本中的少于预设长度的字符的触发操作;响应所述触发操作,显示所述被触发的图像标识的匹配文本。
在一个实施例中,在所述第一窗口中显示被触发的关键词组的匹配附图,具体实施时,可以包括以下内容:根据目标文件的文本数据,确定被触发的关键词组的身份序号;根据目标文件的附图数据,检索出身份序号与所述被触发的关键词组的身份序号匹配的图像标识,作为关联的图像标识;在所述第一窗口中展示包含有所述关联的图像标识的附图作为被触发的关键词组的匹配附图。
在一个实施例中,在所述第一窗口中展示包含有所述关联的图像标识的附图作为被触发的关键词组的匹配附图的同时,所述方法具体实施时,还可以包括以下内容:通过预设的显示方式,在所述被触发的关键词组的匹配附图中显示出所述关联的图像标识,其中,所述预设的显示方式以下至少之一:以标亮的方式显示;以加粗字体的方式显示;以预设的字体颜色显示;以字体为斜体的方式显示;以字体的字号变大的方式显示等等。
在一个实施例中,在展示包含有目标文件中的目标文本和目标附图的页面前,所述方法具体实施时,还可以包括以下内容:获取目标文件的附图数据和文本数据,其中,所述附图数据包含有图像标识,所述文本数据包含有关键词组;接收针对目标文件中的目标附图的展示操作,以及针对目标文件中的目标文本的展示操作;响应针对目标附图的展示操作,根据所述目标文件的附图数据,在所述页面中的第一窗口展示出目标附图;响应针对目标文本的展示操作,根据所述目标文件的文本数据,在所述页面中的第二窗口展示出目标文本。
在一个实施例中,所述附图数据中所包含的图像标识具体可以按照以下方式确定:从所述附图数据中筛选出包含有预设页头的附图数据;对所述包含有预设页头的附图数据中的第一范围区域进行图像数据识别,将识别到的第一字符组作为所述图像标识,其中,所述第一字符组包括:数字和表征图的特征字符的 组合;根据所识别到的第一字符组,在所述包含有预设页头的附图数据中确定出第二范围区域;对所述包含有预设页头的附图数据中的第二范围区域进行图像数据识别,将识别到的第二字符组作为所述图像标识,其中,所述第二字符组包括:与所述包含有预设页头的附图数据中的特征线的距离小于预设的第二距离阈值的数字和/或字母的组合,所述特征线包括:携带有箭头的直线,和/或,携带有箭头的曲线。
在一个实施例中,在确定出所述附图数据中所包含的图像标识后,所述方法具体实施时,还可以包括以下内容:根据所述图像标识,生成对应的图像标识的身份序号,并将所述图像标识的身份序号保存于所述附图数据中。
在一个实施例中,上述确定所述文本数据中的关键词组,具体实施时,可以包括以下内容:检索所述文本数据中的小节标题;根据所述小节标题,将所述文本数据划分为多个文本区域;从所述多个文本区域中筛选出符合预设要求的文本区域,作为第三范围区域,其中,所述第三范围区域至少包括小节标题为附图说明或者BRIFE DESCRIPTION OF DRAWINGS的文本区域;根据所述文本数据中的第三范围区域,确定所述文本数据中的关键词组。
在一个实施例中,在检索所述文本数据,确定所述文本数据不包含有小节标题的情况下,所述方法具体实施时,还可以包括以下内容:利用预设的区域划分模型对所述文本数据进行处理,以从所述文本数据中确定出第三范围区域,其中,所述预设的区域划分模型为预先利用样本文本数据对神经网络进行模型训练得到的。
在一个实施例中,根据所述文本数据中的第三范围区域,确定所述文本数据中的关键词组,具体实施时,可以包括以下内容:对所述文本数据中的第三范围区域进行文本数据识别,识别出第三字符组,其中,所述第三字符组包括以下至少之一:数字和名词的组合、字母和名词的组合、数字和表征图的字符的组合;将所识别出的第三字符组作为所述关键词组。
在一个实施例中,在识别出多个语义近似的第三字符组的情况下,所述方法具体实施时,还可以包括以下内容:统计所述多个语义近似的第三字符组中的各个第三字符组在文本数据中的出现频次;将出现频次最高的第三字符组确定为所述关键词组。
在一个实施例中,在识别出多个第三字符组的情况下,所述方法具体实施时,还可以包括以下内容:统计所述多个第三字符组中的各个第三字符组所在文本段落所包含的字符数;将所在的文本段落所包含的字符数最少的第三字符组作为所述关键词组。
在一个实施例中,所述第三字符组具体还可以包括:多个数字和分隔符号的组合,在对所述文本数据中的第三范围区域进行数据处理,识别出包含有多个数字和分隔符号的组合的第三字符组后,所述方法具体实施时,还可以包括以下内容:识别所述分隔符号的类型;根据与所述分隔符号的类型匹配的拆分规则,将所述多个数字拆分为多个待拼接的数字字符;将所述多个待拼接的数字字符与所述第三字符组中的名词、字母或表征图的字符分别进行拼接,得到多个关键词组。
在一个实施例中,在确定出所述文本数据中的关键词组后,所述方法具体实施时,还可以包括以下内容:根据所述关键词组,生成对应的关键词组的身份序号,并将所述关键词组的身份序号保存于所述文本数据中。
由上可见,本申请提供的页面的展示方法,通过客户端设备先根据目标文件的附图数据和文本数据,分别在同一页面的第一窗口和第二窗口分别展示出用户选择的目标附图和目标文本;再通过接收并响应用 户针对第一窗口中图像标识的触发操作,在同一页面中的第二窗口所展示的文本中显示出与被触发的图像标识对应的匹配文本;或者,再接收并响应用户针对第二窗口中的关键词组的触发操作,在同一页面中的第一窗口所展示的附图中显示出与被触发的关键词组对应的匹配附图;从而智能、高效地实现图文的联动,便于用户能够更方便地结合使用目标文件的文本数据和附图数据,对目标文件的具体内容进行阅读、理解,进一步提高了用户的使用体验。
基于上述方法,本申请还提供了一种页面的展示装置,可以参阅图25所示,具体实施时,所述装置具体可以包括以下结构模块:
展示模块,具体可以用于展示包含有目标文件中的目标文本和目标附图的页面,其中,所述目标文件包含有附图数据和文本数据,所述目标附图在所述页面中的第一窗口中展示,所述目标文本在所述页面中的第二窗口中展示,所述目标附图中包含有图像标识,所述目标文本中包含有关键词组;
接收模块,具体可以用于接收针对所述第一窗口中的图像标识的触发操作;
确定模块,具体可以用于响应针对第一窗口中的图像标识的触发操作,确定出被触发的图像标识;
显示模块,具体可以用于在所述第二窗口中显示被触发的图像标识的匹配文本,其中,所述被触发的图像标识的匹配文本至少包括用于描述所述被触发的图像标识的语句或词组。
本说明书实施例还提供了一种基于上述页面的展示方法的计算机存储介质,所述计算机存储介质存储有计算机程序指令,在所述计算机程序指令被执行时实现:展示包含有目标文件中的目标文本和目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述目标附图在所述页面中的第一窗口中展示,所述目标文本在所述页面中的第二窗口中展示,所述目标附图中包含有图像标识,所述目标文本中包含有关键词组,所述关键词组为文本数据中用于指示附图,和/或,附图中的内容对象的字符组合;接收针对所述第一窗口中的图像标识的触发操作,和/或,针对所述第二窗口中的关键词组的触发操作;在接收针对所述第一窗口中的图像标识的触发操作的情况下,响应针对第一窗口中的图像标识的触发操作,确定出被触发的图像标识;在所述第二窗口中显示被触发的图像标识的匹配文本,其中,所述被触发的图像标识的匹配文本至少包括用于描述所述被触发的图像标识的语句或词组;在接收针对所述第二窗口中的关键词组的触发操作的情况下,响应针对第二窗口中的关键词组的触发操作,确定被触发的关键词组;在所述第一窗口中显示被触发的关键词组的匹配附图,其中,所述被触发的关键词组的匹配附图至少包括被触发的关键词组所描述的附图或者附图中的内容对象。
本说明书实施例还提供了一种基于上述页面的展示方法的客户端设备,其中,该客户端设备包括处理器以及用于存储处理器可执行指令的存储器,所述处理器执行所述指令时实现展示包含有目标文件中的目标文本和目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述目标附图在所述页面中的第一窗口中展示,所述目标文本在所述页面中的第二窗口中展示,所述目标附图中包含有图像标识,所述目标文本中包含有关键词组,所述关键词组为文本数据中用于指示附图,和/或,附图中的内容对象的字符组合;接收针对所述第一窗口中的图像标识的触发操作,和/或,针对所述第二窗口中的关键词组的触发操作;在接收针对所述第一窗口中的图像标识的触发操作的情况下,响应针对第一窗口中的图像标识的触发操作,确定出被触发的图像标识;在所述第二窗口中显示被触发的图像标识的匹配文本,其中,所述被触发的图像标识的匹配文本至少包括用于描述所述被触发的图像标识的语句或词组;在接收针对所述第二窗口中的关键词组的触发操作的情况下,响应针对第二窗口中的关键词组的触发操作,确定被触发的关键词 组;在所述第一窗口中显示被触发的关键词组的匹配附图,其中,所述被触发的关键词组的匹配附图至少包括被触发的关键词组所描述的附图或者附图中的内容对象。
进一步又考虑到目前为用户提供目标文件查询访问的平台,例如,莫某专利检索网站,为了满足用户多样化使用需求,往往会提供多种不同的标签页面供用户自行选择。用户可以通过点击页面上方的标签选择适合自己的页面类型来显示所要查阅的目标文件。参阅图15所示,例如用户可以选择“专利详情”标签,客户端设备会采用与专利详情标签对应的标签页面向用户展示出专利文件中的附图和文本。参阅图24所示,用户也可以选择“图文对照”,客户端设备会采用与图文对照标签对应的标签页面向用户展示出专利文件的附图和文本。
在这种应用情景下,有时用户往往还会需要在不同的标签页面中进行切换,以更好地理解目标文件中的相关内容。
例如,可能用户一开始只想粗略看下专利文件中的附图内容,因此先选择在专利详情的标签页面中阅读专利文件的附图。当在客户端设备所展示的专利详情的标签页面中阅读到附图中的某个图像标识时,对这个图像标识突然感兴趣,临时想切换使用图文对照的标签页面,来利用文本和附图对照的方式来更仔细研读下该图像标识所指示的内容对象的具体特点。这时可以通过客户端设备选择图文对照的标签,使得客户端设备切换为通过图文对照的标签页面展示专利文件。但是,基于现有方法,客户端设备在切换展示新的标签页面时,往往会不法保留在之前的标签页面中的显示记录。即,在客户端设备响应用户选择,所展示给用户的初始的图文对照的标签页面中,在图文对照的标签页面的第一窗口初始时会默认显示专利文件中附图的起始部分(例如附图1),在第二窗口初始是会默认显示专利文件中文本的起始部分,而不会只能地智能地直接在所切换的图文对照的标签页面的第一窗口中显示出在之前的专利详情的标签页面中刚浏览过的包含有感兴趣的图像标识的附图,更不会直接在第二窗口中显示出包含有与上述感兴趣的图像标识对应的关键词组的文本。这时用户必须自行分别在新切换的图文对照的标签页面中的第一窗口和第二窗口翻找找到包含有自己感兴趣的图像标识的附图,以及包含有与该图像标识对应的关键词组的文本来对照阅读,操作比较繁琐、麻烦,用户体验较差。
为了解决上述问题,本说明书实施例还提供了另一种页面的展示方法,可以参阅图26,以及图27所示的相关内容,该方法具体实施时,可以包括以下内容:
S2601:展示包含有目标文件中的目标附图的第一标签页面,其中,所述目标附图中包含有图像标识,所述目标文件包含有附图数据和文本数据,所述附图数据包含有图像标识,所述文本数据包含有关键词组,所述关键词组为文本数据中用于指示附图,和/或,附图中的内容对象的字符组合;
S2603:接收并响应针对所述第一标签页面中的图像标识的触发操作,确定并记录第一标签页面中被触发的图像标识;
S2605:响应标签页面的切换操作,展示包含有目标文件中的目标文本的第二标签页面;
S2607:在所述第二标签页面中显示出目标文件中包含有与第一标签页面中被触发的图像标识的匹配的关键词组的文本作为目标文本。
在本实施例中,客户端设备接收并响应针对所述第一标签页面中的图像标识的触发操作,确定第一标签页面中被触发的图像标识时,会通过客户端设备的存储资源自动记录存储下最新被触发的图像标识。具体记录时,客户端设备可以利用最新记录的被触发的图像标识替换掉之前记录存储的被触发的图像标识, 以备后续使用。
在本实施例中,上述第一标签页面和第二标签页面具体可以理解是为用户提供目标文件查询阅读服务的平台所提供给用户选择的不同类型的用于展示目标文件的标签页面。具体的,上述标签页面可以包括:专利详情的标签页面、图文对照的标签页面、PDF原文标签页面等等。对于上述标签页面的具体类型,本说明书不作限定。
在本实施例中,在所述第二标签页面中显示出目标文件中包含有与第一标签页面中被触发的图像标识的匹配的关键词组的文本作为目标文本,具体实施,可以理解为客户端设备在展示给用户的新的第二标签页面时,会将包含有与第一标签页面中被触发的图像标识的匹配的关键词组的文本作为目标文本展示在初始的第二标签页面中,这样就能够更好地贴近用户的行为习惯,避免用户在新的第二标签页面中再重新翻找自己感兴趣的文本或附图内容,提高了用户的使用体验。
在一个实施例中,所述第一标签页面具体可以包括附图详情页面等,所述第二标签页面具体可以包括图文对照页面等。当然,需要说明的是,上述所列举的第一标签页面和第二标签页面只是一种示意性说明。具体实施时,根据具体情况和场景要求,还可以是其他类型的标签页面。对此,本说明书不作限定。
在一个实施例中,在所述第二标签页面为图文对照页面的情况下,所述第二标签页面包含有第一窗口和第二窗口,其中,所述第一窗口用于显示目标附图,所述第二窗口用于显示目标文本。
在一个实施例中,在所述第二标签页面中显示出目标文件中包含有与第一标签页面中被触发的图像标识的匹配的关键词组的文本作为目标文本后,所述方法具体实施时,还可以包括以下内容:在所述第二标签页面的第二窗口中显示所述目标文本。
在一个实施例中,在所述第二标签页面中显示出目标文件中包含有与第一标签页面中被触发的图像标识的匹配的关键词组的文本作为目标文本的同时,所述方法具体实施时,还可以包括以下内容:在所述第二标签页面的第一窗口中显示出目标文件中包含有第一标签页面中被触发的图像标识的附图作为目标附图。
由上可见,本申请提供的页面的展示方法,通过客户端设备记录下用户在第一标签页面中被触发的图像标识,在接收并响应用户的标签页面的切换操作,展示第二标签页面时,可以根据所记录的用户在第一标签页面中被触发的图像标识,确定出对应的目标文本和/或目标附图作为初始展示的页面内容在第二标签页面进行展示,从而能够更加符合用户的行为使用习惯,进一步提高用户的使用体验。
本申请还提供了一种页面的展示装置,具体实施时,可以包括以下结构模块:
第一展示模块,具体可以用于展示包含有目标文件中的目标附图的第一标签页面,其中,所述目标附图中包含有图像标识;
确定模块,具体可以用于接收并响应针对所述第一标签页面中的图像标识的触发操作,确定并记录第一标签页面中被触发的图像标识;
第二展示模块,具体可以用于响应标签页面的切换操作,展示包含有目标文件中的目标文本的第二标签页面;
显示模块,具体可以用于在所述第二标签页面中显示出目标文件中包含有与第一标签页面中被触发的图像标识的匹配的关键词组的文本作为目标文本。
本说明书实施例还提供了一种基于上述页面的展示方法的一种客户端设备,包括处理器以及用于存储 处理器可执行指令的存储器,所述处理器执行所述指令时实现展示包含有目标文件中的目标附图的第一标签页面,其中,所述目标附图中包含有图像标识,所述目标文件包含有附图数据和文本数据,所述附图数据包含有图像标识,所述文本数据包含有关键词组,所述关键词组为文本数据中用于指示附图,和/或,附图中的内容对象的字符组合;接收并响应针对所述第一标签页面中的图像标识的触发操作,确定并记录第一标签页面中被触发的图像标识;响应标签页面的切换操作,展示包含有目标文件中的目标文本的第二标签页面;在所述第二标签页面中显示出目标文件中包含有与第一标签页面中被触发的图像标识的匹配的关键词组的文本作为目标文本。
本说明书实施例还提供了一种基于上述页面的展示方法的计算机存储介质,所述计算机存储介质存储有计算机程序指令,在所述计算机程序指令被执行时实现:展示包含有目标文件中的目标附图的第一标签页面,其中,所述目标附图中包含有图像标识,所述目标文件包含有附图数据和文本数据,所述附图数据包含有图像标识,所述文本数据包含有关键词组,所述关键词组为文本数据中用于指示附图,和/或,附图中的内容对象的字符组合;接收并响应针对所述第一标签页面中的图像标识的触发操作,确定并记录第一标签页面中被触发的图像标识;响应标签页面的切换操作,展示包含有目标文件中的目标文本的第二标签页面;在所述第二标签页面中显示出目标文件中包含有与第一标签页面中被触发的图像标识的匹配的关键词组的文本作为目标文本。
虽然本说明书提供了如实施例或流程图所述的方法操作步骤,但基于常规或者无创造性的手段可以包括更多或者更少的操作步骤。实施例中列举的步骤顺序仅仅为众多步骤执行顺序中的一种方式,不代表唯一的执行顺序。在实际中的装置或客户端产品执行时,可以按照实施例或者附图所示的方法顺序执行或者并行执行(例如并行处理器或者多线程处理的环境,甚至为分布式数据处理环境)。术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、产品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、产品或者设备所固有的要素。在没有更多限制的情况下,并不排除在包括所述要素的过程、方法、产品或者设备中还存在另外的相同或等同要素。第一,第二等词语用来表示名称,而并不表示任何特定的顺序。
本领域技术人员也知道,除了以纯计算机可读程序代码方式实现控制器以外,完全可以通过将方法步骤进行逻辑编程来使得控制器以逻辑门、开关、专用集成电路、可编程逻辑控制器和嵌入微控制器等的形式来实现相同功能。因此这种控制器可以被认为是一种硬件部件,而对其内部包括的用于实现各种功能的装置也可以视为硬件部件内的结构。或者甚至,可以将用于实现各种功能的装置视为既可以是实现方法的软件模块又可以是硬件部件内的结构。
本说明书可以在由计算机执行的计算机可执行指令的一般上下文中描述,例如程序模块。一般地,程序模块包括执行特定任务或实现特定抽象数据类型的例程、程序、对象、组件、数据结构、类等等。也可以在分布式计算环境中实践本说明书,在这些分布式计算环境中,由通过通信网络而被连接的远程处理设备来执行任务。在分布式计算环境中,程序模块可以位于包括存储设备在内的本地和远程计算机存储介质中。
通过以上的实施方式的描述可知,本领域的技术人员可以清楚地了解到本说明书可借助软件加必需的通用硬件平台的方式来实现。基于这样的理解,本说明书的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在存储介质中,如ROM/RAM、磁碟、 光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,移动终端,服务器,或者网络设备等)执行本说明书各个实施例或者实施例的某些部分所述的方法。
本说明书中的各个实施例采用递进的方式描述,各个实施例之间相同或相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。本说明书可用于众多通用或专用的计算机系统环境或配置中。例如:个人计算机、服务器计算机、手持设备或便携式设备、平板型设备、多处理器系统、基于微处理器的系统、置顶盒、可编程的电子设备、网络PC、小型计算机、大型计算机、包括以上任何系统或设备的分布式计算环境等等。
虽然通过实施例描绘了本说明书,本领域普通技术人员知道,本说明书有许多变形和变化而不脱离本说明书的精神,希望所附的权利要求包括这些变形和变化而不脱离本说明书的精神。

Claims (15)

  1. 一种附图页面的展示方法,其特征在于,包括:
    展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述目标附图中包含有图像标识;
    接收并响应针对所述目标附图中的图像标识的触发操作,确定出被触发的图像标识;
    根据所述附图数据和所述文本数据,在所述页面中显示出被触发的图像标识的匹配文本,其中,所述被触发的图像标识的匹配文本至少包括用于描述所述被触发的图像标识的语句或词组。
  2. 根据权利要求1所述的附图页面的展示方法,其特征在于,所述目标文件包括以下至少之一:专利文件、论文文件、合同文件、商标文件。
  3. 根据权利要求2所述的附图页面的展示方法,其特征在于,在展示包含有目标文件中的目标附图的页面前,所述方法还包括:
    获取目标文件的附图数据和文本数据,其中,所述附图数据包含有图像标识,所述文本数据包含有关键词组,所述关键词组为文本数据中用于指示附图,和/或,附图中的内容对象的字符组合;
    接收针对目标文件中的目标附图的展示操作;
    响应所述展示操作,根据所述目标文件的附图数据,展示包含有目标文件中的目标附图的页面。
  4. 根据权利要求3所述的附图页面的展示方法,其特征在于,根据所述附图数据和所述文本数据,在所述页面中显示出被触发的图像标识的匹配文本,包括:
    根据所述目标文件的附图数据,确定出所述被触发的图像标识的身份序号;
    从所述目标文件的文本数据所包含的关键词组中,检索出身份序号与所述被触发的图像标识的身份序号匹配的关键词组,作为关联的关键词组;
    根据所述关联的关键词组,确定所述被触发的图像标识的匹配文本;
    在所述页面中的预设显示位置处,显示所述被触发的图像标识的匹配文本。
  5. 根据权利要求1所述的附图页面的展示方法,其特征在于,所述在所述页面中显示出被触发的图像标识的匹配文本,包括:
    在所述页面中,以与页面的主体结构相独立的形式显示出被触发的图像标识的匹配文本;
    相应的,所述方法还包括:接收并根据用户的操作指令,在所述页面中将所显示的被触发的图像标识的匹配文本移动至指定位置;
    在所述页面中将所显示的被触发的图像标识的匹配文本移动至指定位置后,所述方法还包括:记录所述指定位置。
  6. 根据权利要求1所述的附图页面的展示方法,其特征在于,所述方法还包括:在所述页面中生成并展示数据列表,其中,所述数据列表包括目标附图中所包括的图像标识,以及图像标识的匹配文本;其中,所述数据列表与目标附图中的图像标识相互关联。
  7. 根据权利要求6所述的附图页面的展示方法,其特征在于,所述在所述页面中显示出被触发的图像标识的匹配文本,还包括:
    在所述数据列表中以预设的显示方式显示出被触发的图像标识,以及被触发的图像标识的匹配文本。
  8. 根据权利要求7所述的附图页面的展示方法,其特征在于,所述预设的显示方式包括以下至少之一:以标亮的方式显示;以加粗字体的方式显示;以预设的字体颜色显示;以悬浮框的方式显示;以字体为斜体的方式显示。
  9. 根据权利要求1所述的附图页面的展示方法,其特征在于,在根据所述附图数据和所述文本数据,在所述页面中显示出被触发的图像标识的匹配文本后,所述方法还包括:
    接收并响应用户的保存操作,保存显示了被触发的图像标识的匹配文本的目标附图页面。
  10. 一种附图页面的展示方法,其特征在于,包括:
    展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述附图数据包含有图像标识,所述文本数据包含有关键词组,所述目标附图中包含有图像标识;
    接收第一类触发操作,其中,所述第一类触发操作用于控制显示所述目标附图中所包含的图像标识的匹配文本;
    响应所述第一类触发操作,在所述页面中显示出所述目标附图中所包含的图像标识的匹配文本,其中,所述图像标识的匹配文本包括用于描述图像标识的语句或词组。
  11. 根据权利要求3或权利要求10所述的附图页面的展示方法,其特征在于,所述附图数据中所包含的图像标识按照以下方式确定:
    从所述附图数据中筛选出包含有预设页头的附图数据;
    对所述包含有预设页头的附图数据中的第一范围区域进行图像数据识别,将识别到的第一字符组作为所述图像标识,其中,所述第一字符组包括:数字和表征图的特征字符的组合;
    根据所识别到的第一字符组,在所述包含有预设页头的附图数据中确定出第二范围区域;
    对所述包含有预设页头的附图数据中的第二范围区域进行图像数据识别,将识别到的第二字符组作为所述图像标识,其中,所述第二字符组包括:与所述包含有预设页头的附图数据中的特征线的距离小于预设的第二距离阈值的数字和/或字母的组合,所述特征线包括:携带有箭头的直线,和/或,携带有箭头的曲线。
  12. 根据权利要求3或权利要求10所述的附图页面的展示方法,其特征在于,确定所述文 本数据中的关键词组,包括:
    检索所述文本数据中的小节标题;
    根据所述小节标题,将所述文本数据划分为多个文本区域;
    从所述多个文本区域中筛选出符合预设要求的文本区域,作为第三范围区域,其中,所述第三范围区域至少包括小节标题为附图说明或者BRIFE DESCRIPTION OF DRAWINGS的文本区域;
    根据所述文本数据中的第三范围区域,确定所述文本数据中的关键词组。
  13. 一种文本页面的展示方法,其特征在于,包括:
    展示包含有目标文件中的目标文本的页面,其中,所述目标文件包括附图数据和文本数据,所述目标文本中包含有关键词组;
    接收并响应针对所述目标文本中的关键词组的触发操作,确定出被触发的关键词组;
    在所述页面中显示出被触发的关键词组的匹配附图,其中,所述被触发的关键词组的匹配附图至少包括被触发的关键词组所描述的附图或者附图中的内容对象。
  14. 一种附图页面的展示装置,其特征在于,包括:
    展示模块,用于展示包含有目标文件中的目标附图的页面,其中,所述目标文件包括附图数据和文本数据,所述目标附图中包含有图像标识;
    确定模块,用于接收并响应针对所述目标附图中的图像标识的触发操作,确定出被触发的图像标识;
    显示模块,用于根据所述附图数据和所述文本数据,在所述页面中显示出被触发的图像标识的匹配文本,其中,所述被触发的图像标识的匹配文本至少包括用于描述所述被触发的图像标识的语句或词组。
  15. 一种文本页面的展示装置,其特征在于,包括:
    展示模块,用于展示包含有目标文件中的目标文本的页面,其中,所述目标文件包括附图数据和文本数据,所述目标文本中包含有关键词组;
    接收模块,用于接收并响应针对所述目标文本中的关键词组的触发操作,确定出被触发的关键词组;
    显示模块,用于在所述页面中显示出被触发的关键词组的匹配附图,其中,所述被触发的关键词组的匹配附图至少包括被触发的关键词组所描述的附图或者附图中的内容对象。
PCT/CN2020/078922 2019-03-18 2020-03-12 附图页面的展示方法和装置、文本页面的展示方法和装置 WO2020187117A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910203855.7 2019-03-18
CN201910203855.7A CN110008426A (zh) 2019-03-18 2019-03-18 附图页面的展示方法和装置、文本页面的展示方法和装置

Publications (1)

Publication Number Publication Date
WO2020187117A1 true WO2020187117A1 (zh) 2020-09-24

Family

ID=67167260

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/078922 WO2020187117A1 (zh) 2019-03-18 2020-03-12 附图页面的展示方法和装置、文本页面的展示方法和装置

Country Status (2)

Country Link
CN (1) CN110008426A (zh)
WO (1) WO2020187117A1 (zh)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110008426A (zh) * 2019-03-18 2019-07-12 智慧芽信息科技(苏州)有限公司 附图页面的展示方法和装置、文本页面的展示方法和装置
CN111580907A (zh) * 2020-04-28 2020-08-25 浙江大华技术股份有限公司 一种ai属性显示方法、显示装置及计算机可读存储介质
CN112001312A (zh) * 2020-08-21 2020-11-27 深圳传音控股股份有限公司 文档拼接方法、设备及存储介质
CN112633343B (zh) * 2020-12-16 2024-04-19 国网江苏省电力有限公司检修分公司 一种电力设备端子排接线校核方法及装置
CN114625472A (zh) * 2022-05-16 2022-06-14 天津中新智冠信息技术有限公司 一种页面展示方法、装置、电子设备及存储介质

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102737030A (zh) * 2011-04-06 2012-10-17 上海量明科技发展有限公司 专利文档的数据输出方法、终端及系统
US20140122993A1 (en) * 2012-04-03 2014-05-01 Salesforce.Com, Inc. Mechanism for facilitating user-controlled management of webpage elements for dynamic customization of information
CN107491488A (zh) * 2017-07-18 2017-12-19 北京京东尚科信息技术有限公司 页面数据采集的方法和装置
CN108763445A (zh) * 2018-05-25 2018-11-06 厦门智融合科技有限公司 专利知识库的构建方法、装置、计算机设备和存储介质
CN109299424A (zh) * 2018-11-29 2019-02-01 北京字节跳动网络技术有限公司 一种基于富文本提供响应事件的方法、装置和计算机记录介质
CN109445900A (zh) * 2018-11-13 2019-03-08 江苏省舜禹信息技术有限公司 用于图片显示的翻译方法和装置
CN110008426A (zh) * 2019-03-18 2019-07-12 智慧芽信息科技(苏州)有限公司 附图页面的展示方法和装置、文本页面的展示方法和装置

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102650998A (zh) * 2011-02-28 2012-08-29 鸿富锦精密工业(深圳)有限公司 外观设计专利展示系统及方法
CN105893498B (zh) * 2016-03-30 2019-05-21 努比亚技术有限公司 一种实现截屏的方法和装置、查找图片的方法和装置

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102737030A (zh) * 2011-04-06 2012-10-17 上海量明科技发展有限公司 专利文档的数据输出方法、终端及系统
US20140122993A1 (en) * 2012-04-03 2014-05-01 Salesforce.Com, Inc. Mechanism for facilitating user-controlled management of webpage elements for dynamic customization of information
CN107491488A (zh) * 2017-07-18 2017-12-19 北京京东尚科信息技术有限公司 页面数据采集的方法和装置
CN108763445A (zh) * 2018-05-25 2018-11-06 厦门智融合科技有限公司 专利知识库的构建方法、装置、计算机设备和存储介质
CN109445900A (zh) * 2018-11-13 2019-03-08 江苏省舜禹信息技术有限公司 用于图片显示的翻译方法和装置
CN109299424A (zh) * 2018-11-29 2019-02-01 北京字节跳动网络技术有限公司 一种基于富文本提供响应事件的方法、装置和计算机记录介质
CN110008426A (zh) * 2019-03-18 2019-07-12 智慧芽信息科技(苏州)有限公司 附图页面的展示方法和装置、文本页面的展示方法和装置

Also Published As

Publication number Publication date
CN110008426A (zh) 2019-07-12

Similar Documents

Publication Publication Date Title
WO2020187118A1 (zh) 页面的展示方法和装置
WO2020187117A1 (zh) 附图页面的展示方法和装置、文本页面的展示方法和装置
US10503797B2 (en) Apparatus and method for sharing introduction information
EP3246796A1 (en) Device and method for inputting note information into image of photographed object
WO2022166579A1 (zh) 信息展示的方法、装置以及计算机存储介质
US9342233B1 (en) Dynamic dictionary based on context
CN111381751A (zh) 一种文本处理方法及装置
US20150234938A1 (en) Method and electronic terminal for searching for contact in directory
KR102369604B1 (ko) 고정 포맷 문서를 리플로우 포맷으로 프리젠트하는 기법
US20150339348A1 (en) Search method and device
CN104808903B (zh) 文本选择方法和装置
CN111859856A (zh) 信息显示方法、装置、电子设备及存储介质
US20150149428A1 (en) Search Inquiry Method Using Contextual Annotation
US20070136348A1 (en) Screen-wise presentation of search results
WO2023087934A1 (zh) 一种语音控制方法、装置、设备以及计算机存储介质
DE102014101026A1 (de) Eingabestiftkurzschrift
US9141867B1 (en) Determining word segment boundaries
WO2017211202A1 (zh) 数据的提取方法、装置及终端设备
EP2947584A1 (en) Multimodal search method and device
JP2016066115A (ja) デジタルコンテンツの閲覧支援装置、閲覧支援方法及びプログラム
JP2015114955A (ja) 情報処理装置、情報処理方法、およびプログラム
KR20150097250A (ko) 태그 정보를 이용한 스케치 검색 시스템, 사용자 장치, 서비스 제공 장치, 그 서비스 방법 및 컴퓨터 프로그램이 기록된 기록매체
US11010978B2 (en) Method and system for generating augmented reality interactive content
CN104850316A (zh) 电子图书字体调整方法及装置
JP5703244B2 (ja) トレース支援装置、トレース支援システム、トレース支援方法、およびトレース支援プログラム

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20773450

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20773450

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 20773450

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 07.06.2022)

122 Ep: pct application non-entry in european phase

Ref document number: 20773450

Country of ref document: EP

Kind code of ref document: A1