CN116522876B - Method and device for realizing PDF text labeling of webpages of fire fox browser - Google Patents

Method and device for realizing PDF text labeling of webpages of fire fox browser Download PDF

Info

Publication number
CN116522876B
CN116522876B CN202310513079.7A CN202310513079A CN116522876B CN 116522876 B CN116522876 B CN 116522876B CN 202310513079 A CN202310513079 A CN 202310513079A CN 116522876 B CN116522876 B CN 116522876B
Authority
CN
China
Prior art keywords
drag
mouse
dom
node
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310513079.7A
Other languages
Chinese (zh)
Other versions
CN116522876A (en
Inventor
张林辉
何凡
计雪莉
王彬彬
王云川
潘海勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hubei Haichuang Zhiyun Technology Co ltd
Beijing Leadal Technology Development Co ltd
Original Assignee
Hubei Haichuang Zhiyun Technology Co ltd
Beijing Leadal Technology Development Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hubei Haichuang Zhiyun Technology Co ltd, Beijing Leadal Technology Development Co ltd filed Critical Hubei Haichuang Zhiyun Technology Co ltd
Priority to CN202310513079.7A priority Critical patent/CN116522876B/en
Publication of CN116522876A publication Critical patent/CN116522876A/en
Application granted granted Critical
Publication of CN116522876B publication Critical patent/CN116522876B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/169Annotation, e.g. comment data or footnotes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/0486Drag-and-drop
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/117Tagging; Marking up; Designating a block; Setting of attributes
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Data Mining & Analysis (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention provides a method and a device for realizing PDF text labeling of a fire fox browser webpage version, belonging to the technical field of computers, wherein the method comprises the following steps: according to a window.getselection method, obtaining DOM nodes when a mouse starts to drag and when the mouse ends to drag and text coordinates of selected contents in the DOM nodes; judging whether the DOM node when the mouse starts to drag and the DOM node when the mouse finishes to drag are the same, and determining a labeling mode according to a judging result; if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are the same, setting the background color of the selected text directly through style; and if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are not the same, marking the selected text according to a preset marking algorithm. The method can set the marked text across pages, the marked text is reproducible, the flexibility, the controllability and the expansibility are higher, and convenience is provided for PDF operation of users.

Description

Method and device for realizing PDF text labeling of webpages of fire fox browser
Technical Field
The invention relates to the technical field of computers, in particular to a method, a device, equipment and a storage medium for realizing PDF text labeling of a web page version of a fire fox browser.
Background
In the fire fox browser, only a preview function is provided in a PDF tool originally provided by the fire fox browser, the prior art on the market at present has a core solution method for solving the PDF text annotation of a webpage version, namely, the display effect of the annotation is achieved by switching canvas layers, the method only solves the annotation function visually from a user, and the PDF text annotation is not solved in a core in a real sense.
The prior art has the following defects: the selected marking information cannot be copied, and the canvas cannot realize page-crossing text marking aiming at PDFs of multiple pages.
Disclosure of Invention
The method, the device, the equipment and the storage medium for realizing the PDF text marking of the fire fox browser webpage version can set marking texts in a page-crossing manner, the marked texts are reproducible, the flexibility, the controllability and the expansibility are higher, and convenience is provided for PDF operation of users.
In a first aspect, an embodiment of the present invention provides a method for implementing PDF text labeling of a web page version of a fire fox browser, where the method includes:
according to a window.getselection method, obtaining DOM nodes when a mouse starts to drag and DOM nodes when the mouse ends to drag and text coordinates of selected contents in the DOM nodes;
judging whether the DOM node when the mouse starts to drag and the DOM node when the mouse finishes to drag are the same, and determining a labeling mode according to a judging result;
if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are the same, setting the background color of the selected text directly through style;
if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are not the same, marking the selected text according to a preset marking algorithm.
The technical scheme is completely different from that of directly calculating canvas, the method directly calculates the selected DOM and text coordinate information to label, the original PDF analysis data structure can be greatly optimized and reserved, the text can be extracted, the text labeling color can be customized at will, a convenient API is provided for a secondary developer, and repeated labeling and assignment can be conveniently carried out.
Optionally, if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are not the same, the method for labeling the selected text according to the preset labeling algorithm includes:
setting a DOM node between the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag through the style, and selecting the background color of the text corresponding to the DOM node;
marking the DOM node when the mouse starts to drag and the selected text corresponding to the DOM node when the mouse ends to drag according to the head-tail cutting algorithm.
Optionally, the method for labeling the selected text corresponding to the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag according to the head-tail cutting algorithm comprises the following steps:
cutting an HTML character string into slices according to the DOM node and the text coordinates of the selected content in the node when the mouse starts to drag, the DOM node and the text coordinates of the selected content in the node when the mouse ends to drag, and the slice method of javascript;
adding DOM nodes at the cutting points in a splicing way, and setting background colors at the DOM nodes added in the splicing way;
the cutting point selects the content from the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag and the unselected content.
Optionally, the method further comprises:
acquiring node data corresponding to the selected text;
wherein the node data includes: selecting DOM nodes corresponding to the text and text coordinates of the selected text;
calculating the position information of DOM nodes corresponding to the selected text according to the node data;
wherein the location information includes: page number and node number;
the position information is used for positioning DOM nodes when the marking patterns are set and displayed back.
In a second aspect, an embodiment of the present invention provides a device for implementing PDF text labeling of a web page version of a fire fox browser, where the device includes:
the acquisition module is used for acquiring DOM nodes when the mouse starts to drag and when the mouse finishes dragging and text coordinates of selected contents in the DOM nodes according to a window. Getselection method;
the determining module is used for judging whether the DOM node when the mouse starts to drag and the DOM node when the mouse finishes dragging are the same or not, and determining a labeling mode according to a judging result;
if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are the same, setting the background color of the selected text directly through style;
if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are not the same, marking the selected text according to a preset marking algorithm.
Optionally, the apparatus further comprises:
the computing module is used for acquiring node data corresponding to the selected text; calculating the position information of DOM nodes corresponding to the selected text according to the node data;
wherein the node data includes: selecting DOM nodes corresponding to the text and text coordinates of the selected text; the location information includes: page number and node number;
the position information is used for positioning DOM nodes when the marking patterns are set and displayed back.
In a third aspect, an embodiment of the present invention provides an electronic device, including a memory and a processor, where the memory stores a computer program, and the processor implements the method according to any implementation manner of the first aspect when executing the program.
In a fourth aspect, embodiments of the present invention provide a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements a method according to any of the implementations of the first aspect.
The invention provides a method and a device for realizing PDF text labeling of a fire fox browser webpage version, wherein the method comprises the following steps: according to a window.getselection method, obtaining DOM nodes when a mouse starts to drag and when the mouse ends to drag and text coordinates of selected contents in the DOM nodes; judging whether the DOM node when the mouse starts to drag and the DOM node when the mouse finishes to drag are the same, and determining a labeling mode according to a judging result; if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are the same, setting the background color of the selected text directly through style; and if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are not the same, marking the selected text according to a preset marking algorithm. The method can set the marked text across pages, the marked text is reproducible, the flexibility, the controllability and the expansibility are higher, and convenience is provided for PDF operation of users.
It should be understood that the description in this summary is not intended to limit the critical or essential features of the embodiments of the invention, nor is it intended to limit the scope of the invention. Other features of the present invention will become apparent from the description that follows.
Drawings
The above and other features, advantages and aspects of embodiments of the present invention will become more apparent by reference to the following detailed description when taken in conjunction with the accompanying drawings. In the drawings, the same or similar reference numerals denote the same or similar elements.
FIG. 1 is a flowchart of a method for realizing PDF text labeling of a web page version of a fire fox browser according to an embodiment of the invention;
fig. 2 is a schematic structural diagram of a device for implementing PDF text labeling of a web page version of a fire fox browser according to an embodiment of the present invention;
fig. 3 is a block diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
In order to enable a person skilled in the art to better understand the technical solutions in one or more embodiments of the present specification, the technical solutions in one or more embodiments of the present specification will be clearly and completely described below with reference to the drawings in one or more embodiments of the present specification, and it is obvious that the described embodiments are only some embodiments of the present specification, not all embodiments. All other embodiments, which can be made by one or more embodiments of the present disclosure without inventive faculty, are intended to be within the scope of the present disclosure.
It should be noted that, the description of the embodiment of the present invention is only for the purpose of more clearly describing the technical solution of the embodiment of the present invention, and does not constitute a limitation on the technical solution provided by the embodiment of the present invention.
Fig. 1 is a flowchart of a method for implementing a fire fox browser web page PDF text labeling according to an embodiment of the present invention. As shown in fig. 1, includes:
s101, according to a window. Getselection method, obtaining DOM nodes when the mouse starts to drag and when the mouse ends to drag and text coordinates of selected contents in the DOM nodes.
Optionally, a Selection object is returned after using the window. Getselection method, which indicates the text range or the current position of the cursor selected by the user.
S102, judging whether the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are the same, and determining the labeling mode according to the judging result.
Optionally, if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are the same, setting the background color of the selected text directly through style;
if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are not the same, marking the selected text according to a preset marking algorithm.
For example, the background color of the text or the text color may be set when the text is annotated. However, the use of this method is not limited to setting the background color and the text color, and may be annotating or labeling.
Optionally, if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are not the same, the method for labeling the selected text according to the preset labeling algorithm includes:
setting a DOM node between the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag through the style, and selecting the background color of the text corresponding to the DOM node;
marking the DOM node when the mouse starts to drag and the selected text corresponding to the DOM node when the mouse ends to drag according to the head-tail cutting algorithm.
Optionally, the method for labeling the selected text corresponding to the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag according to the head-tail cutting algorithm comprises the following steps:
cutting an HTML character string into slices according to the DOM node and the text coordinates of the selected content in the node when the mouse starts to drag, the DOM node and the text coordinates of the selected content in the node when the mouse ends to drag, and the slice method of javascript;
adding DOM nodes at the cutting points in a splicing way, and setting background colors at the DOM nodes added in the splicing way;
the cutting point selects the content from the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag and the unselected content.
Alternatively, the slice method may extract a certain portion of the string and return the extracted portion with a new string; designating a portion of string extraction using start (contained) and end (not contained) parameters; wherein, the first character position in the character string is 0, the second character position is 1, and the like.
Alternatively, if negative, the parameter specifies a starting position from the end of the string.
Illustratively, -1 refers to the last character of the string, -2 refers to the penultimate character, and so on.
Illustratively, the text content may be as follows: "a test document, I are the start node, I are the first intermediate node, I are the second intermediate node, I are the third intermediate node, I are the fourth intermediate node, I are the fifth intermediate node, I are the sixth intermediate node, I are the seventh intermediate node, I are the end node. "
The content selected during the labeling of the user is as follows: "Start node, I are the first intermediate node, I are the second intermediate node, I are the third intermediate node, I are the fourth intermediate node, I are the fifth intermediate node, I are the sixth intermediate node, I are the seventh intermediate node, I are the nodes".
Wherein, "i am the first intermediate node, i am the second intermediate node, i am the third intermediate node, i am the fourth intermediate node, i am the fifth intermediate node, i am the sixth intermediate node, i am the seventh intermediate node" the background color of the text can be set by style.
Where "start node" and "I are nodes" are not all content in DOM node, so the HTML string is cut into pieces by the slice method of javascript, and the text after cutting is as follows: i are, start node, i are node, end node.
Then adding DOM nodes in a splicing way between I 'M is' and 'start node', 'I' M is 'knot' and 'bundle node', and setting background color at the DOM nodes added in the splicing way.
Optionally, the method for realizing the fire fox browser webpage PDF text labeling further comprises the following steps:
acquiring node data corresponding to the selected text;
wherein the node data includes: selecting DOM nodes corresponding to the text and text coordinates of the selected text;
calculating the position information of DOM nodes corresponding to the selected text according to the node data;
wherein the location information includes: page number and node number; the position information is used for positioning DOM nodes when the marking patterns are set and displayed back.
The embodiment of the invention provides a method for realizing PDF text labeling of a web page version of a fire fox browser, which comprises the following steps: according to a window.getselection method, obtaining DOM nodes when a mouse starts to drag and when the mouse ends to drag and text coordinates of selected contents in the DOM nodes; judging whether the DOM node when the mouse starts to drag and the DOM node when the mouse finishes to drag are the same, and determining a labeling mode according to a judging result; if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are the same, setting the background color of the selected text directly through style; and if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are not the same, marking the selected text according to a preset marking algorithm. The method can set the marked text across pages, the marked text is reproducible, the flexibility, the controllability and the expansibility are higher, and convenience is provided for PDF operation of users.
The following describes in detail the device provided in the embodiment of the present application and capable of executing the method for implementing PDF text labeling of web page version of firefox browser.
Fig. 2 is a schematic structural diagram of a device for implementing PDF text labeling of a web page version of a fire fox browser according to an embodiment of the present invention; as shown in fig. 2, the labeling device 20 includes:
the obtaining module 201 is configured to obtain, according to a window. Getselection method, a DOM node when the mouse starts to drag and when the mouse ends to drag, and text coordinates of selected content in the DOM node;
the determining module 202 is configured to determine whether the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are the same, and determine a labeling mode according to a determination result;
if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are the same, setting the background color of the selected text directly through style;
if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are not the same, marking the selected text according to a preset marking algorithm.
Optionally, the labeling device 20 further comprises:
the computing module 203 is configured to obtain node data corresponding to the selected text, where the computing module 203 is configured to obtain the node data; calculating the position information of DOM nodes corresponding to the selected text according to the node data;
wherein the node data includes: selecting DOM nodes corresponding to the text and text coordinates of the selected text; the location information includes: page number and node number;
the position information is used for positioning DOM nodes when the marking patterns are set and displayed back.
Optionally, the determining module 202 is further configured to set, through style, a background color of the selected text corresponding to a DOM node between a DOM node when the mouse starts to drag and a DOM node when the mouse ends to drag; marking the DOM node when the mouse starts to drag and the selected text corresponding to the DOM node when the mouse ends to drag according to the head-tail cutting algorithm.
Optionally, the determining module 202 is further configured to cut the HTML string into slices according to a DOM node when the mouse starts to drag and a text coordinate of the selected content in the DOM node, a DOM node when the mouse ends to drag and a text coordinate of the selected content in the DOM node, and a slice method of javascript; and splicing and adding DOM nodes at the cutting points, and setting background colors at the DOM nodes added by splicing.
The cutting point selects the content from the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag and the unselected content.
The embodiment of the present invention also provides a computer electronic device, fig. 3 shows a schematic diagram of a structure of an electronic device to which the embodiment of the present invention can be applied, and as shown in fig. 3, the computer electronic device includes a central processing module (CPU) 301 that can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM) 302 or a program loaded from a storage section 308 into a Random Access Memory (RAM) 303. In the RAM 303, various programs and data required for the system operation are also stored. The CPU 301, ROM 302, and RAM 303 are connected to each other through a bus 304. An input/output (I/O) interface 305 is also connected to bus 304.
The following components are connected to the I/O interface 305: an input section 306 including a keyboard, a mouse, and the like; an output portion 307 including a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, a speaker, and the like; a storage section 308 including a hard disk or the like; and a communication section 309 including a network interface card such as a LAN card, a modem, or the like. The communication section 309 performs communication processing via a network such as the internet. The drive 310 is also connected to the I/O interface 305 as needed. A removable medium 311 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is installed on the drive 310 as needed, so that a computer program read out therefrom is installed into the storage section 308 as needed.
The flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The modules or modules involved in the embodiments of the present invention may be implemented in software or in hardware. The described modules or modules may also be provided in a processor, for example, as: a processor includes an acquisition module 201 and a determination module 202, where the names of these modules do not in some cases define the module itself, for example, the acquisition module 201 may also be described as "an acquisition module 201 for acquiring the DOM node at the beginning of a mouse drag and the literal coordinates of the selected content within the DOM node at the end of the drag" according to the window.
As another aspect, the present invention further provides a computer readable storage medium, where the computer readable storage medium may be a computer readable storage medium included in a device for implementing PDF text labeling of a web page version of a fire fox browser in the above embodiment; or may be a computer-readable storage medium, alone, that is not incorporated into an electronic device. The computer readable storage medium stores one or more programs for use by one or more processors in performing a method for implementing a fire fox browser web page PDF text annotation described in the present invention.
The above description is only illustrative of the preferred embodiments of the present invention and of the principles of the technology employed. It will be appreciated by persons skilled in the art that the scope of the invention referred to in the present invention is not limited to the specific combinations of the technical features described above, but also covers other technical features formed by any combination of the technical features described above or their equivalents without departing from the inventive concept. Such as the above-mentioned features and the technical features disclosed in the present invention (but not limited to) having similar functions are replaced with each other.

Claims (6)

1. A method for realizing PDF text labeling of a fire fox browser webpage version is characterized by comprising the following steps:
according to a window.getselection method, obtaining DOM nodes when a mouse starts to drag and when the mouse ends to drag and text coordinates of selected contents in the DOM nodes;
judging whether the DOM node when the mouse starts to drag and the DOM node when the mouse finishes to drag are the same, and determining a labeling mode according to a judging result;
if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are the same, setting the background color of the selected text directly through style;
if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are not the same, marking the selected text according to a preset marking algorithm;
if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are not the same, the method for marking the selected text according to the preset marking algorithm comprises the following steps:
setting a background color of a selected text corresponding to a DOM node between the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag through a style;
marking a DOM node when the mouse starts to drag and a selected text corresponding to the DOM node when the mouse ends to drag according to a head-tail cutting algorithm;
the method for marking the selected text corresponding to the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag according to the head-tail cutting algorithm comprises the following steps:
cutting an HTML character string into slices according to DOM nodes and text coordinates of selected contents in the nodes when the mouse starts to drag, DOM nodes and text coordinates of selected contents in the nodes when the mouse ends to drag and a slice method of javascript;
adding DOM nodes at the cutting points in a splicing way, and setting background colors at the DOM nodes added in the splicing way; and the cutting point selects the content from the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag and the unselected content.
2. The method for realizing the fire fox browser webpage PDF text labeling according to claim 1, wherein the method further comprises the steps of:
acquiring node data corresponding to the selected text; the node data includes: the DOM node corresponding to the selected text and the text coordinates of the selected text;
calculating the position information of the DOM node corresponding to the selected text according to the node data; the location information includes: page number and node number;
the position information is used for positioning DOM nodes when the marking patterns are set and displayed back.
3. The device for realizing the PDF text labeling of the web page version of the fire fox browser is characterized by comprising the following components:
the acquisition module is used for acquiring DOM nodes when the mouse starts to drag and when the mouse ends to drag and text coordinates of selected contents in the DOM nodes according to a window. Getselection method;
the determining module is used for judging whether the DOM node when the mouse starts to drag and the DOM node when the mouse finishes dragging are the same or not, and determining a labeling mode according to a judging result;
if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are the same, setting the background color of the selected text directly through style;
if the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag are not the same, marking the selected text according to a preset marking algorithm;
the determining module is further used for setting background colors of the selected texts corresponding to DOM nodes between the DOM nodes when the mouse starts to drag and the DOM nodes when the mouse ends to drag through the style; marking a DOM node when the mouse starts to drag and a selected text corresponding to the DOM node when the mouse ends to drag according to a head-tail cutting algorithm;
the determining module is further used for cutting the HTML character string into slices according to the DOM node when the mouse starts to drag and the text coordinates of the selected content in the node, the DOM node when the mouse ends to drag and the text coordinates of the selected content in the node, and a slice method of javascript; adding DOM nodes at the cutting points in a splicing way, and setting background colors at the DOM nodes added in the splicing way; and the cutting point selects the content from the DOM node when the mouse starts to drag and the DOM node when the mouse ends to drag and the unselected content.
4. A device for realizing fire fox browser web page PDF text labeling according to claim 3, further comprising:
the computing module is used for acquiring node data corresponding to the selected text; calculating the position information of the DOM node corresponding to the selected text according to the node data; the node data includes: the DOM node corresponding to the selected text and the text coordinates of the selected text; the location information includes: page number and node number;
the position information is used for positioning DOM nodes when the marking patterns are set and displayed back.
5. An electronic device comprising a memory and a processor, the memory having stored thereon a computer program, the processor implementing the method of claim 1 or 2 when executing the computer program.
6. A computer-readable storage medium, characterized in that a computer program is stored, which, when being executed by a processor, implements the method according to claim 1 or 2.
CN202310513079.7A 2023-05-08 2023-05-08 Method and device for realizing PDF text labeling of webpages of fire fox browser Active CN116522876B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310513079.7A CN116522876B (en) 2023-05-08 2023-05-08 Method and device for realizing PDF text labeling of webpages of fire fox browser

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310513079.7A CN116522876B (en) 2023-05-08 2023-05-08 Method and device for realizing PDF text labeling of webpages of fire fox browser

Publications (2)

Publication Number Publication Date
CN116522876A CN116522876A (en) 2023-08-01
CN116522876B true CN116522876B (en) 2024-01-09

Family

ID=87393858

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310513079.7A Active CN116522876B (en) 2023-05-08 2023-05-08 Method and device for realizing PDF text labeling of webpages of fire fox browser

Country Status (1)

Country Link
CN (1) CN116522876B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016101755A1 (en) * 2014-12-26 2016-06-30 北京奇虎科技有限公司 Drag-and-drop operation-based method and device for adjusting location of object in cross pages
WO2019169205A1 (en) * 2018-02-28 2019-09-06 Rocky Kahn Document viewer aligning pdf and xml
CN112560403A (en) * 2019-09-26 2021-03-26 北京国双科技有限公司 Text processing method and device and electronic equipment
CN112733497A (en) * 2020-12-31 2021-04-30 达而观数据(成都)有限公司 Cross-platform high-performance marking engine
CN113807049A (en) * 2020-06-15 2021-12-17 阿里巴巴集团控股有限公司 Labeling component, labeling method, device, equipment and storage medium
CN114021522A (en) * 2021-11-09 2022-02-08 南京中孚信息技术有限公司 Research based on WEB document marking technology
CN114780881A (en) * 2022-03-01 2022-07-22 深圳开源互联网安全技术有限公司 Method and device for labeling text content of browser and electronic equipment

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9990343B2 (en) * 2016-06-27 2018-06-05 Synergy Platform Pty Ltd System and method for in-browser editing

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016101755A1 (en) * 2014-12-26 2016-06-30 北京奇虎科技有限公司 Drag-and-drop operation-based method and device for adjusting location of object in cross pages
WO2019169205A1 (en) * 2018-02-28 2019-09-06 Rocky Kahn Document viewer aligning pdf and xml
CN112560403A (en) * 2019-09-26 2021-03-26 北京国双科技有限公司 Text processing method and device and electronic equipment
CN113807049A (en) * 2020-06-15 2021-12-17 阿里巴巴集团控股有限公司 Labeling component, labeling method, device, equipment and storage medium
CN112733497A (en) * 2020-12-31 2021-04-30 达而观数据(成都)有限公司 Cross-platform high-performance marking engine
CN114021522A (en) * 2021-11-09 2022-02-08 南京中孚信息技术有限公司 Research based on WEB document marking technology
CN114780881A (en) * 2022-03-01 2022-07-22 深圳开源互联网安全技术有限公司 Method and device for labeling text content of browser and electronic equipment

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
基于Chrome浏览器的HTML批注技术研究;沈健;;电脑编程技巧与维护(第20期);全文 *
基于节点类型标注的网页主题信息抽取方法;谢方立;周国民;王健;;计算机科学(第S2期);全文 *
用画线对Web网页进行注释的协同浏览系统;方勃;贺贵明;;计算机工程(第22期);全文 *

Also Published As

Publication number Publication date
CN116522876A (en) 2023-08-01

Similar Documents

Publication Publication Date Title
US20190243848A1 (en) Generating a structured document guiding view
US7685514B1 (en) Method and system for incorporation of graphical print techniques in a web browser
CN109857388B (en) Code generation method, device, server and computer readable medium
JP2007122708A (en) System and method for text legibility enhancement
US7793224B1 (en) Methods and apparatus for formatting identified content
US20180321805A1 (en) Method for automatically applying page labels using extracted label contents from selected pages
CN112527291A (en) Webpage generation method and device, electronic equipment and storage medium
US20120072828A1 (en) Complex input to image transformation for distribution
US20150309966A1 (en) Method and apparatus for preserving fidelity of bounded rich text appearance by maintaining reflow when converting between interactive and flat documents across different environments
MX2008003413A (en) Script markup.
CN113536182A (en) Method and device for generating long text webpage, electronic equipment and storage medium
CN107451163B (en) Animation display method and device
CN109558123B (en) Method for converting webpage into electronic book, electronic equipment and storage medium
CN111562911A (en) Webpage editing method and device and storage medium
CN113360106B (en) Webpage printing method and device
CN116522876B (en) Method and device for realizing PDF text labeling of webpages of fire fox browser
JP5255483B2 (en) Method, apparatus and program for facilitating object selection
CN108694043B (en) Page decoration method and system
CN112328246A (en) Page component generation method and device, computer equipment and storage medium
US11126410B2 (en) Method and apparatus for building pages, apparatus and non-volatile computer storage medium
US20130254157A1 (en) Computer-implemented methods and systems for associating files with cells of a collaborative spreadsheet
JP5097117B2 (en) Positioning screen elements
US7849193B1 (en) Multiple hyperlinks in a uniform resource locator
CN108664511B (en) Method and device for acquiring webpage information
KR20130021482A (en) Method of editing html tables by cell unit

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant