WO2022184079A1 - 一种文档处理方法、装置、设备和介质 - Google Patents

一种文档处理方法、装置、设备和介质 Download PDF

Info

Publication number
WO2022184079A1
WO2022184079A1 PCT/CN2022/078732 CN2022078732W WO2022184079A1 WO 2022184079 A1 WO2022184079 A1 WO 2022184079A1 CN 2022078732 W CN2022078732 W CN 2022078732W WO 2022184079 A1 WO2022184079 A1 WO 2022184079A1
Authority
WO
WIPO (PCT)
Prior art keywords
language
content
target
response
universal
Prior art date
Application number
PCT/CN2022/078732
Other languages
English (en)
French (fr)
Inventor
彭龙腾
Original Assignee
北京字跳网络技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京字跳网络技术有限公司 filed Critical 北京字跳网络技术有限公司
Priority to JP2023553218A priority Critical patent/JP2024509149A/ja
Publication of WO2022184079A1 publication Critical patent/WO2022184079A1/zh
Priority to US18/458,704 priority patent/US20230409815A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/14Details of searching files based on file metadata
    • G06F16/148File search processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/16File or folder operations, e.g. details of user interfaces specifically adapted to file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/04842Selection of displayed objects or displayed text elements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/253Grammatical analysis; Style critique
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/263Language identification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates

Definitions

  • Embodiments of the present disclosure relate to information processing technologies, and in particular, to a document processing method, apparatus, device, and medium.
  • Embodiments of the present disclosure provide a document processing method, apparatus, device, and medium to effectively manage common languages in online documents.
  • an embodiment of the present disclosure provides a document processing method, including:
  • the target content in the edited content is updated to the universal language, and the universal language identifier is determined based on the edited content.
  • an embodiment of the present disclosure further provides a document processing method, the method being applied to a server, including:
  • an embodiment of the present disclosure further provides a document processing apparatus, including:
  • a content obtaining module configured to obtain editing content in response to receiving an editing operation of the target document by the first user
  • a request sending module configured to send a first data request to the server based on the edited content
  • a universal language update module configured to update the target content in the edited content to a universal language in response to receiving the universal language identifier sent by the server, where the universal language identifier is determined based on the edited content.
  • an embodiment of the present disclosure further provides a document processing apparatus, where the apparatus is configured on a server and includes:
  • a request receiving module configured to receive a first data request sent by the client, where the first data request includes editing content
  • a universal language identifier acquisition module configured to acquire the universal language identifier corresponding to the edited content based on the edited content
  • a universal language identifier sending module configured to send the universal language identifier corresponding to the edited content to the client.
  • an embodiment of the present disclosure further provides a document processing device, including:
  • processors one or more processors
  • memory for storing one or more programs
  • the one or more processors When the one or more programs are executed by the one or more processors, the one or more processors implement the document processing method according to any one of the embodiments of the present disclosure.
  • an embodiment of the present disclosure further provides a medium, where the medium stores a computer program, and when the computer program is executed by a processor, implements the document processing method according to any one of the embodiments of the present disclosure.
  • Embodiments of the present disclosure provide a document processing method, apparatus, device, and medium.
  • the above-mentioned document processing method includes: in response to receiving an editing operation of a target document by a first user, obtaining editing content; and sending a first editing content to a server based on the editing content A data request; in response to receiving the universal language identifier sent by the server, updating the target content in the edited content to the universal language, wherein the universal language identifier is determined based on the edited content.
  • the embodiment of the present disclosure automatically updates the target content in the editing content to the common language if it is recognized that the input editing content includes a common language, thereby realizing effective management of the common language in the online document.
  • FIG. 1 is a flowchart of a document processing method provided by an embodiment of the present disclosure
  • FIG. 2 is a schematic diagram of a first interaction area provided by an embodiment of the present disclosure
  • FIG. 3 is a schematic diagram of a second interaction area provided by an embodiment of the present disclosure.
  • FIG. 4 is a schematic diagram of a document operation page provided by an embodiment of the present disclosure.
  • FIG. 5 is a flowchart of a document processing method provided by an embodiment of the present disclosure.
  • FIG. 6 is a flowchart of an interactive method for document processing provided by an embodiment of the present disclosure.
  • FIG. 7 is a flowchart of an interactive method for document processing provided by an embodiment of the present disclosure.
  • FIG. 8 is a flowchart of an interactive method for document processing provided by an embodiment of the present disclosure.
  • FIG. 9 is a schematic structural diagram of a document processing apparatus provided by an embodiment of the present disclosure.
  • FIG. 10 is a schematic structural diagram of a document processing apparatus provided by an embodiment of the present disclosure.
  • FIG. 11 is a structural diagram of an electronic device provided by an embodiment of the present disclosure.
  • the term “including” and variations thereof are open-ended inclusions, ie, "including but not limited to”.
  • the term “based on” is “based at least in part on.”
  • the term “one embodiment” means “at least one embodiment”; the term “another embodiment” means “at least one additional embodiment”; the term “some embodiments” means “at least some embodiments”. Relevant definitions of other terms will be given in the description below.
  • FIG. 1 is a flowchart of a document processing method provided by an embodiment of the present disclosure, and this embodiment is applicable to the case of managing a common language in a document.
  • the method may be performed by a document processing apparatus, which may be implemented by means of software and/or hardware.
  • the document processing method is applied in the client.
  • the document processing method provided in this embodiment can be used on the client, and can be considered to be executed by the document processing apparatus integrated on the client.
  • the client may specifically be a computer device including a processor, a memory, an input device, and an output device, such as a notebook computer, a desktop computer, a tablet computer, and an intelligent terminal.
  • the document processing method provided by this embodiment mainly includes steps S11 , S12 and S13 .
  • the above-mentioned target document may be an editable document displayed in the display area of the client, for example, may be a common document such as a word document, an excel document, a pdf document, and a txt document, or may be other editable documents, which is not performed in this embodiment. limited.
  • the above-mentioned editing content refers to the content input by the staff in the current document page, which may be the content input by the user through the pasting method, or the content input by the user through the input method. In this embodiment, only the editing content is exemplified. rather than limiting.
  • the above server may be a backend server or a backend server, which provides computing or application services for clients in the network. It has the ability to respond to service requests, undertake services, and guarantee services, with high-speed CPU computing power, long-term reliable operation, strong I/O external data throughput, and better scalability.
  • the first data request includes editing content, and the first data request is used to request the server to identify whether there is a common language in the editing content. If the server recognizes the universal language identifier corresponding to the edited content and sends the universal language identifier to the client.
  • the common language refers to the language used by a work team to describe the same concept or knowledge point.
  • the work team maintains the common language without introducing other ambiguities and facilitates summary and clear explanation to facilitate subsequent knowledge transfer and information alignment.
  • the common language is widely used by all participants within the team in the process of software development based on DDD (program-driven design), so as to accurately transmit the description of business rules, facilitate team members to reach agreement on terminology identification, and facilitate understanding and dissemination.
  • the universal language identifier means that each universal language has a unique identification ID, through which the name and corresponding definition of the corresponding universal language can be found.
  • the target content is part or all of the edited content.
  • the client receives the editing content input by the first user on the target document page, generates a first data request based on the editing content, and sends it to the server, and the server identifies whether the editing content has a corresponding common language based on the editing content logo. If present, the above-mentioned universal language identifier is returned to the client. The client automatically updates the target content in the edited content to the common language according to the common language identifier sent by the server. If the edited content does not have a corresponding universal language identifier, the server does not perform any processing.
  • the universal language identifier corresponding to the edited content means that part or all of the edited content has a corresponding universal language identifier.
  • the target content in the edited content is automatically updated to the common language, which can be understood as directly updating the target content to the common language after receiving the common language identifier sent by the server, without manual intervention, which is convenient and fast .
  • updating the target content in the edited content to the universal language includes: displaying a first interaction area, the first interaction area and the The target content in the edited content is associated; in response to receiving the first operation by the first user on the first interaction area, the target content in the edited content is updated to a common language.
  • the interaction area may be understood as a display area displayed in the target document for the first user to interact with the client.
  • the interactive area may be a preset area in the target document page, or may also be the area where the cursor is located in the target document.
  • the preset area may be a preset fixed position, or may be changed following the change of the position of the edited content.
  • the predetermined position can be the bottom, top, left or right side of the document page.
  • the change can be the lower right, the upper right or the lower left of the editing content according to the change of the position of the editing content.
  • the server returns the universal language identifier corresponding to the target content to the client, and the client displays information associated with the target content in the first interaction area.
  • the above-mentioned associated information is used to prompt the first user whether to update the target content to the common language.
  • FIG. 2 is a schematic diagram of a first interaction area provided by an embodiment of the present disclosure. As shown in FIG. 2 , “Set as a common language?” is displayed in the first interaction area 11, and two selection buttons are displayed, the “Yes” button and "No” button.
  • the target content in the edited content is updated to a common language.
  • no processing is performed on the edited content.
  • the above-mentioned first operation may be a triggering operation of the "Yes" button in the first interaction area 11 to update the target content in the edited content to a common language.
  • the triggering operation may also be a triggering operation for the “No” button in the first interaction area 11 .
  • the above triggering operation on the button may be a single-click, double-click or touch operation on the button, which is not limited in this embodiment.
  • the operation is responded to, and the target content is updated to the common language.
  • the target content is still conventional editing content.
  • the first interaction area and the operation mode shown in FIG. 2 are only exemplary descriptions, not limitations, and other settings may also be performed according to actual page display scenarios.
  • updating the target content in the edited content to the universal language includes: displaying a second interaction area, where the second interaction area includes a general language identifier associated with the target content in the edited content; in response to receiving a second operation by the first user on the second interaction area, acquiring the general language identifier corresponding to the target content; responding After receiving the third operation of the first user on the second interaction area, the target content in the edited content is updated to the general language corresponding to the general language identifier.
  • the server recognizes that the target content in the edited content corresponds to multiple universal language identifiers, the server returns the multiple universal language identifiers corresponding to the target content to the client, and the client displays the target content in the second interaction area. Associated multiple common language identifiers.
  • FIG. 3 is a schematic diagram of a second interaction area provided by an embodiment of the present disclosure.
  • a first interaction area 21 is displayed in the lower right corner of the edited content, and the first interaction area 21 includes a plurality of common language identifiers, such as: Common language A, common language B, common language C, common language D, and, if not set as common language and other operable options, the staff can select the common language identifier corresponding to the target content in the above-mentioned second interaction area.
  • the target content is not processed in any way, but is only stored as ordinary text.
  • the matching degree of each universal language identifier and the target content may also be received, and the order of the universal language identifier in the second interaction area is determined by the above matching degree.
  • the top-to-bottom order of the universal language identifier in the second interaction area is the order of the matching degree of the corresponding target content on the universal language identifier from high to low. Impressions with the highest match are first and then decrease with match.
  • a general language identifier corresponding to the target content is obtained, and the target content is directly set as the general language corresponding to the general language identifier.
  • the general language identifier A is obtained, and the target content is directly updated to the general language corresponding to the general language identifier A.
  • the general language identifier corresponding to the target content is acquired, and the displayed content in the second interaction area is updated.
  • the first Information associated with the target content is displayed in the second interactive area.
  • the target content is updated to the general language corresponding to the general language identifier.
  • the third operation performed by the first user on the second interaction area is the same as the first operation in the first interaction area, for details, reference may be made to the description of the first operation in the first interaction area in the foregoing embodiment. Repeat.
  • the above-mentioned associated information is used to prompt the first user whether to set the target content as a common language.
  • updating the target content in the edited content to the universal language includes: when there are multiple target contents in the edited content , showing a third interaction area, where the third interaction area includes universal language identifiers associated with a plurality of the target contents; in response to receiving the fourth operation by the first user on the third interaction area, for The target content matches the corresponding universal language identifier; in response to receiving the fifth operation by the first user on the third interaction area, the target content in the edited content is updated to correspond to the matched universal language identifier common language.
  • the server recognizes that multiple target contents in the edited content have corresponding universal language identifiers respectively, the server returns the multiple universal language identifiers corresponding to the target contents to the client, and the client displays them in the third interactive area A common language identifier associated with multiple target content.
  • the client updates the target content in the edited content to a common language corresponding to the matched common language identifier.
  • target content a corresponds to universal language identifier A
  • target content b corresponds to universal language identifier B
  • target content c corresponds to universal language identifier C
  • target content d corresponds to universal language identifier C.
  • target content D corresponds to the universal language identifier D.
  • the method in response to receiving the editing operation of the target document by the first user, before acquiring the edited content, the method further includes: in response to the operation of the mode switch by the first user, enabling the universal language identified by the user Identify patterns.
  • a mode switch can be set in the document.
  • the above mode switch is used to turn on or off the general language recognition mode.
  • the general language recognition mode means that after the user enters the editing content, it will automatically recognize whether the editing content is the general language.
  • the general language recognition mode If the general language recognition mode is on, after the user enters the edited content, it will automatically identify whether the edited content is a common language, that is, execute the document processing method provided by the embodiment of the present disclosure, as described above in steps S11-S13. If the general language recognition mode is in the off state, after the user enters the editing content, the general language is not recognized, and is directly stored as a common editing text.
  • the general language recognition mode is turned on in response to an operation on the mode switch, where the operation on the mode switch includes single-clicking, double-clicking or touching the mode switch.
  • the mode switch is in the switch state, the general language recognition mode is turned off in response to the operation of the mode switch, and the operation of the mode switch includes single-clicking, double-clicking or touching the mode switch.
  • the document processing method provided by the embodiment of the present disclosure includes: in response to receiving an editing operation on a target document by a first user, obtaining editing content; sending a first data request to a server based on the editing content; The language identifier, which updates the target content in the edited content to a common language, and the common language identifier is determined based on the edited content.
  • the embodiments of the present disclosure realize effective management of the common language in the online document by automatically updating the target content in the editing content to the common language if the inputted editing content includes the common language during the editing of the online document.
  • the embodiments of the present disclosure provide another document processing method.
  • the document processing method includes: in response to an insertion operation of a general language, displaying the general language in the target document.
  • the operation of inserting the universal language can be understood as the operation of inserting the universal language in the target current document.
  • the above insert operation can be performed by right-clicking, or by the insert button in the function box at the top of the document. It is not limited in this embodiment.
  • the displaying the universal language in the target document in response to the insertion operation for the universal language includes: in response to the insertion operation for the universal language, displaying a first operation entry, the first The operation entry is used to acquire the target universal language; in response to acquiring the target universal language, display a second operation entry; in response to receiving an operation instruction for the second operation entry, display the object in the target document common language.
  • the above-mentioned second operation entry includes one or more operation entries of the target common language.
  • the operation instruction for the second operation entry may include any one option of single click, double click or touch of the second operation entry.
  • the insertion operation of the general language can be operated through the insert button at the top of the document, showing all options that can insert content in the document, including the first operation entry, in response to the general language option corresponding to the first operation entry. operate.
  • An insertion selection box can also be displayed in the target document, and the insertion selection box displays all options that can add content in the document, including the first operation entry, in response to the operation of the general language option corresponding to the first operation entry.
  • the target universal language is acquired, and in response to the acquisition of the target universal language, the second operation entry is displayed.
  • one target ubiquitous language is selected from the plurality of target ubiquitous languages.
  • the target content input by the first user in the target document is received, and the target content is updated to the common language.
  • FIG. 4 is a schematic diagram of a document operation page provided by an embodiment of the present disclosure.
  • click the right mouse button anywhere in the target document or perform a selection operation in the function bar at the top of the document the function list box 31 shown in Figure 4 is displayed in the target document.
  • Operate on the "Add" option 22 below, the current document displays the add selection box 23 shown in Figure 4, and operate the "common language” option of the first operation entry in the add selection box 23, the second displayed in the target document Check box 24.
  • FIG. 4 is a schematic diagram of a document operation page provided by an embodiment of the present disclosure.
  • the second selection box 24 includes a second operation entry, for example: common language A, common language B, common language C, and common language D, the staff can arbitrarily select a common language ID as the next The general language corresponding to the editing content of the document input.
  • FIG. 4 is only an exemplary description, and the method of inserting the common language is not limited.
  • displaying the universal language in the target document in response to the insertion operation for the universal language includes: acquiring target location information in response to the insertion operation for the universal language; based on the target location information to display the common language in the target document.
  • the target location information refers to the location where the universal language is inserted, and the location information is realized by the first user's insertion operation of the universal language.
  • the first user can insert the common language anywhere in the target document.
  • the target content subsequently input by the first user at the target location is updated to the target common language.
  • the method further includes: establishing an association relationship between the target content in the edited content and the universal language identifier; and storing the association relationship in a mapping relationship table.
  • the above-mentioned general language identifier may be the general language identifier selected by the first user in the first interactive area, the second interactive area or the third interactive area in the above-mentioned embodiment.
  • the target content and the selected common language identifier may be in a one-to-one relationship, or may be in a many-to-one relationship. That is, different target contents can be identified by the same common language.
  • the mapping relationship table stores the corresponding relationship between the target content and the general language identifier, so that the staff can conveniently obtain the general language ID corresponding to the target content in the subsequent document use process.
  • the method further includes: in response to the current editing mode being the first mode, changing the display mode of the target content in the editing content from the first mode to the second mode; or, in response to the current editing mode being The third mode is to change the display mode of the target content in the edited content from the third mode to the second mode associated with the third mode.
  • the first mode can be understood as a display mode of regular content in the target document.
  • the second mode refers to an exclusive display of target content in a common language.
  • the first display mode is the Song Dynasty, the fourth font.
  • the second display mode may be a display mode such as adding an underline, adding a text background, bolding, or slashing.
  • the third mode can be understood as a display mode in which rendering has been added on the basis of the display mode of the conventional content in the target document.
  • the second mode refers to the exclusive display of the target content in the presence of a common language.
  • the difference between the third display mode and the first display mode is that the first mode is only the basic display of the document content, and the third display mode adds a rendering display mode on the basis of the first mode.
  • the second display mode associated with the third mode is a rendering display mode added on the basis of the third mode.
  • the first display mode is the Song Dynasty, the fourth font.
  • the third mode is Arial, four-point font, bold.
  • the second display mode associated with the third mode is Arial, four-point font, bold, and underlined.
  • the second display mode associated with the third mode is Arial, four-point font, bold, add text background.
  • the above document processing method further includes: in response to a triggering operation on the updated target content, sending a second data request to the server, wherein the second data request includes a common language The identifier and the identifier of the target document; in response to receiving the second data sent by the server, displaying the general language description information corresponding to the general language identifier.
  • the triggering operation on the updated target content may be a selection operation on the updated target content, a click operation on the updated target content, or a mouse cursor indicating to Updated target content.
  • the general language description information refers to the detailed explanation information of the general language corresponding to the target content.
  • the second data request is mainly used for the client to request the server for detailed explanation information of the general language corresponding to the target content.
  • a second data request is sent to the server.
  • the server determines a corresponding preset mapping relationship table based on the identifier of the target document, and performs a query based on the target content in the preset mapping relationship table. If the general language identifier corresponding to the target content is queried in the preset mapping relationship table, it means that the target content currently operated is a general language, and the general language description information corresponding to the target content is obtained based on the general language identifier, and sent to Client displays.
  • the client can display the general language description information in the paraphrase display box.
  • the definition display box may be displayed in a preset area on the target document page, or the definition display box may be displayed on the area where the cursor is located in the document area.
  • a paraphrase frame is a frame that hovers over a document.
  • the preset area can be a preset fixed position, or a position that changes with the content of the document.
  • the predetermined position can be the bottom, top, left or right side of the document page.
  • the position that follows the change of the document content can be the lower right, upper right or lower left of the editing content.
  • the paraphrase display box is set at the bottom of the document.
  • the staff needs to check the specific content of the common language corresponding to the target content "ABCD”, and can place the mouse cursor on the target content "ABCD", or click the target content "ABCD” with the mouse
  • the client sends a second data request to the server after detecting the operator's operation.
  • the server returns the general language description information corresponding to the target content, and displays the general language description information in the definition display box in the target document.
  • the paraphrase display box can be closed through the close button of the paraphrase display box; the paraphrase display box can also be closed by operating the document content "ABCD" again.
  • the above-mentioned document processing method further includes: in response to the adding operation of the common language, receiving the name of the common language and the description information of the common language; sending the name of the common language and the description information of the common language to the server so that the server creates a corresponding universal language record.
  • the common language name may be the name specified by the staff for the common language.
  • the client can receive an instruction to establish or update the universal language library entered by the user in the document, and in response to the addition operation of the universal language, receive the universal language name and universal language description information input by the user, and request the server's The interface of the universal language library, pass in the universal language name and the corresponding universal language description information.
  • the server After the server receives the request, it determines that there is no duplicate common language, and if there is no duplicate common language, it generates a common language record, updates the common language library, and returns the processing result to the client.
  • FIG. 5 is a flowchart of a document processing method provided by an embodiment of the present disclosure. This embodiment can be applied to the case of managing a common language in a document.
  • the method may be performed by a document processing apparatus, which may be implemented by means of software and/or hardware.
  • the document processing method is applied in a server.
  • the document processing method provided by this embodiment mainly includes steps S21, S22 and S23.
  • the above server may be a backend server or a backend server, which provides computing or application services for clients in the network. It has the ability to respond to service requests, undertake services, and guarantee services, with high-speed CPU computing power, long-term reliable operation, strong I/O external data throughput, and better scalability.
  • the server serves the client.
  • a communication connection is established between the client and the server, and data can be exchanged and transmitted.
  • the client receives the editing content input by the staff on the target document page, generates a first data request based on the editing content, sends the first data request to the server, and the server receives the first data request sent by the client.
  • the obtaining the universal language identifier corresponding to the editing content based on the editing content includes: performing aggregation processing on at least one piece of data information corresponding to the editing content, and determining the total data information after the aggregation processing. When the total amount of data information meets the threshold condition, obtain the general language identifier corresponding to the edited content.
  • the threshold condition refers to the condition corresponding to the common language identifier based on the editing content query.
  • the above-mentioned threshold condition may be an attribute feature of the edited content, for example, it may be the number of text characters of the edited content, the text format, the font size of the edited content, and so on.
  • the binlog logs in the MySQL database are monitored; and multiple binlog logs corresponding to the target document are acquired.
  • the modifications corresponding to the multiple binlog logs are aggregated into one modification, and the specific modification content is extracted.
  • the word count of the specific modified content reaches a threshold condition, the general language identifier corresponding to the edited content is acquired.
  • MySQL database is an open source row-column database that can store user data.
  • Binlog log is a binary log maintained by the MySQL database service layer. It is mainly used to record SQL statements that update or potentially update MySQL data, and save them in the disk in the form of "transactions". Simply put, it records the action of each operation of the database, and the action of operating MySQL can be repeated like a movie through binlog.
  • the common language module asynchronously monitors the Binlog log of the MySQL database and parses the binlog log.
  • delayed queue consumption is adopted, and multiple binlog records of a certain document are consumed each time, and then the corresponding modifications are aggregated into one modification, and then the specific modified content is extracted. If the number of text words added by the specifically modified content exceeds the threshold condition, the general language identifier corresponding to the edited content is acquired.
  • the threshold condition in this embodiment is 10 characters. It should be noted that the threshold condition is that 10 characters are obtained according to experience, and the threshold condition can also be set according to the actual application situation, which is only described in this embodiment, but not limited.
  • the obtaining, based on the editing content, the general language identifier corresponding to the editing content includes: performing word segmentation on the editing content to obtain a plurality of word elements corresponding to the editing content; The predicates are matched in the universal language database, and the universal language identifier corresponding to each word is determined.
  • word segmentation is performed on the edited content locally, the corresponding lexical elements are extracted from the edited content, and the lexical elements are searched in the general language ElasticSearch to accurately match the lexical elements.
  • Any word segmentation strategy can be used in the above word segmentation processing, for example, the Chinese word segmentation strategy of Jieba. In this embodiment, only the word segmentation strategy is described, but not limited.
  • querying the general language identifier corresponding to the edited content includes: determining a query result by performing fuzzy matching on the edited content in a general language database.
  • ElasticSearch is a search engine based on Lucene library. It provides a distributed, multi-tenant full-text search engine.
  • the word element is used to go to the general language ElasticSearch search database to accurately match the word element
  • the unsegmented edited content is used to go to the ElasticSearch search database for fuzzy search
  • the query result is determined according to the results of the two searches.
  • the search speed can be improved by going to the general language ElasticSearch search database to precisely match the word elements
  • the unsegmented edited content can be used to go to the ElasticSearch search database for fuzzy search, which can improve the recall rate of the query.
  • no content may be returned.
  • the general language identifiers returned to the client may be general language identifiers sorted according to the matching degree from high to low.
  • L is equal to the preset threshold.
  • the above preset thresholds can be set according to actual application conditions. For example: it can be 3 or 5.
  • the sorting process may not be performed, and all the queried universal language IDs are returned to the client.
  • the specific operation of automatically recognizing the common language can be performed only after the current user ID has turned on the common language recognition mode. If the current user ID does not have the universal language recognition mode turned on, you can do nothing.
  • the above document processing method further includes: receiving a second data request sent by the client; wherein the second data request includes a universal language identifier and an identifier of the target document; querying the universal language database for all The general language description information corresponding to the general language identifier is sent to the client, and the second data includes the general language description information corresponding to the general language identifier.
  • the server after receiving the second data request, the server needs to determine the general language description information corresponding to the target content based on the general language identifier in the second data request. Specifically, the server determines a corresponding preset mapping relationship table based on the identifier of the target document, and performs a query based on the target content in the preset mapping relationship table. If the general language identifier corresponding to the target content is queried in the preset mapping relationship table, it means that the target content currently operated is a general language, and the general language description information corresponding to the target content is obtained based on the general language identifier, and sent to Client displays.
  • the second data request further includes: a user identification; the method further includes: determining whether the user has the authority of the target document based on the user identification and the target document identification; in response to the The user has the authority of the target document, and performs the operation of querying the general language description information corresponding to the general language identifier in the general language library.
  • the above rights may refer to the rights to modify, input, delete, and edit.
  • the authority in this embodiment mainly refers to the authority to query the universal language identifier.
  • the operation of querying the general language description information corresponding to the target content under the general language identifier in the general language library is performed.
  • the corresponding relationship between the current user ID and the target document ID is not found in the document ID and user ID correspondence table, it is determined that the current user ID does not have the current document authority, and the user is directly prompted that it does not have the authority to query the common language ID. In this way, the security and confidentiality of the document can be improved.
  • the above document processing method further includes: receiving the name of the common language and description information of the common language sent by the client; querying a common language library based on the name of the common language; in response to not finding the common language name of the language, the universal language library is updated based on the name of the universal language and the description information of the universal language.
  • the client can receive an instruction to establish or update the universal language library entered by the user in the document, and in response to the addition operation of the universal language, receive the universal language name and universal language description information input by the user, and request the server's The interface of the universal language library, pass in the universal language name and the corresponding universal language description information.
  • the server After the server receives the request, it determines that there is no duplicate common language, and if there is no duplicate common language, it generates a common language record, updates the common language library, and returns the processing result to the client.
  • FIG. 6 is a flowchart of an interaction method for document processing provided by an embodiment of the present disclosure. As shown in FIG. 6 , the interaction method provided in this embodiment mainly includes the following steps:
  • the client receives the edited content in the target document.
  • the client sends the edited content to the server.
  • the server queries the general language identifier corresponding to the edited content.
  • the server determines the general language identifier corresponding to the edited content based on the query result.
  • the server sends the universal language identifier to the client.
  • the client displays a second interaction area in the target document, where the second interaction area includes one or more general language identifiers.
  • the client updates the target content in the edited content to a common language in response to the operation on the second interaction area.
  • FIG. 7 is a flowchart of an interaction method for document processing provided by an embodiment of the present disclosure. As shown in FIG. 7 , the interaction method provided in this embodiment mainly includes the following steps:
  • the client responds to an operation on the document content in the target document.
  • the client sends the server a second data request.
  • the server determines whether the current user ID has the current document authority based on the current user ID and the target document ID.
  • the server searches the general language database for the general language description information corresponding to the document content under the general language identifier.
  • the server returns the general language description information to the client.
  • the client displays the general language description information.
  • FIG. 8 is a flowchart of an interaction method for document processing provided by an embodiment of the present disclosure. As shown in FIG. 8 , the interaction method provided in this embodiment mainly includes the following steps:
  • the client receives the general language name and the general language description information in response to the operation on the target document.
  • the client sends the general language name and the general language description information to the server.
  • the server searches the general language database by using the general language name.
  • the server returns the update result of the general language library to the client.
  • FIG. 9 is a schematic structural diagram of a document processing apparatus provided by an embodiment of the present disclosure. This embodiment is applicable to the case of managing a general language in a document, and the apparatus may be implemented by means of software and/or hardware.
  • the document processing apparatus is configured in the client.
  • the document processing apparatus mainly includes: a content obtaining module 91 , a request sending module 92 and a general language updating module 93 .
  • the content acquisition module 91 is configured to acquire the edited content in response to receiving the editing operation of the target document by the first user;
  • a request sending module 92 configured to send a first data request to the server based on the edited content
  • the universal language update module 93 is configured to update the target content in the edited content to the universal language in response to receiving the universal language identifier sent by the server, where the universal language identifier is determined based on the edited content.
  • the document processing apparatus obtains the edited content in response to receiving the editing operation of the target document by the first user; sends a first data request to the server based on the edited content; in response to receiving the universal language identifier sent by the server , the target content in the edited content is updated to a common language, wherein the common language identifier is determined based on the edited content.
  • the embodiment of the present disclosure automatically updates the target content in the editing content to the common language if it is recognized that the input editing content includes a common language, thereby realizing effective management of the common language in the online document.
  • the universal language update module 93 is specifically configured to display a first interaction area, where the first interaction area is associated with the target content in the edited content; The first operation of the first interaction area is performed, and the target content in the edited content is updated to the common language.
  • the universal language update module 93 is specifically configured to display a second interaction area, where the second interaction area includes a universal language identifier associated with the target content in the edited content; in response to receiving the The first user's second operation on the second interaction area obtains a general language identifier corresponding to the target content; in response to receiving the third operation by the first user on the second interaction area, the The target content in the editing content is updated to the general language corresponding to the general language identifier.
  • the general language update module 93 is specifically configured to display a third interaction area when there are multiple target contents in the edited content, and the third interaction area includes a plurality of target contents related to the target contents. in response to receiving the fourth operation on the third interaction area by the first user, matching the corresponding universal language identifier for the target content; in response to receiving the first user's response to all The fifth operation in the third interaction area is performed, and the target content in the edited content is updated to the general language corresponding to the matched general language identifier.
  • the universal language update module 93 is further configured to establish an association relationship between the target content in the edited content and the universal language identifier; and store the association relationship in a mapping relationship table.
  • the universal language update module 93 is further configured to, in response to the current editing mode being the first mode, change the display mode of the target content in the editing content from the first mode to the second mode; or, in response to When the current editing mode is the third mode, the display mode of the target content in the editing content is changed from the third mode to the second mode associated with the third mode.
  • the request sending module 92 is further configured to send a second data request to the server in response to a triggering operation on the updated target content, wherein the second data request includes a universal language identifier and the identification of said target document;
  • the apparatus further includes: a description information display module, configured to display the general language description information corresponding to the general language identifier in response to receiving the second data sent by the server.
  • a description information display module configured to display the general language description information corresponding to the general language identifier in response to receiving the second data sent by the server.
  • the apparatus further includes: a universal language display module, configured to display the universal language in the target document in response to the insertion operation of the universal language.
  • the universal language display module is specifically configured to display a first operation entry in response to the insertion operation of the universal language, and the first operation entry is used to obtain the target universal language; in response to acquiring the target universal language language, displaying a second operation entry; in response to receiving an operation instruction for the second operation entry, displaying the target common language in the target document.
  • the universal language display module is specifically configured to acquire target location information in response to an insertion operation of the universal language; and display the universal language in the target document based on the target location information.
  • the apparatus further includes: an addition operation response module, configured to receive the name of the universal language and the description information of the universal language in response to the addition operation of the universal language;
  • a description information sending module configured to send the name of the universal language and the description information of the universal language to the server, so that the server creates a corresponding universal language record.
  • the target content is part or all of the edited content.
  • the document processing apparatus provided by the embodiment of the present disclosure can execute the document processing method provided by any embodiment of the present disclosure, and has corresponding functional modules and beneficial effects for executing the data processing method.
  • FIG. 10 is a schematic structural diagram of a document processing apparatus provided by an embodiment of the present disclosure. This embodiment is applicable to the case of managing a general language in a document, and the apparatus may be implemented by means of software and/or hardware.
  • the document processing apparatus is configured in the server side.
  • the document processing apparatus mainly includes: a request receiving module 101 , a general language identifier acquiring module 102 and a general language identifier sending module 103 .
  • a request receiving module 101 configured to receive a first data request sent by a client, where the first data request includes editing content
  • a universal language identifier acquisition module 102 configured to acquire, based on the edited content, a universal language identifier corresponding to the edited content
  • the universal language identifier sending module 103 is configured to send the universal language identifier corresponding to the edited content to the client.
  • the general language identifier acquisition module 102 is specifically configured to perform word segmentation processing on the edited content to obtain multiple word elements corresponding to the edited content; Determine the universal language identifier for each token.
  • the universal language identifier acquisition module 102 specifically aggregates data information corresponding to at least one of the edited content, and determines the total amount of data information after the aggregation processing; when the total amount of data information meets a threshold condition , and obtain the general language identifier corresponding to the edited content.
  • the request receiving module 101 is further used for a second data request sent by the client; wherein, the second data request includes a universal language identifier and an identifier of a target document;
  • the device further includes: a query module for querying the general language description information corresponding to the general language identifier in the general language library;
  • a data sending module configured to send second data to the client, where the second data includes general language description information corresponding to the general language identifier.
  • the second data request further includes: a user identity; the apparatus further includes:
  • an authority determination module configured to determine whether the user has the authority of the target document based on the user identification and the target document identification; in response to the user having the authority of the target document, executing in the universal language library The operation of querying the general language description information corresponding to the general language identifier.
  • the apparatus further includes: a description information receiving module, configured to receive the name of the common language and the description information of the common language sent by the client;
  • a name query module for querying the universal language library based on the name of the universal language
  • the language base updating module is configured to update the universal language base based on the name of the universal language and the description information of the universal language in response to the name of the universal language not being queried.
  • the document processing apparatus provided by the embodiment of the present disclosure can execute the document processing method provided by any embodiment of the present disclosure, and has corresponding functional modules and beneficial effects for executing the data processing method.
  • FIG. 11 it shows a schematic structural diagram of an electronic device (eg, a terminal device or a server in FIG. 11 ) 1100 suitable for implementing an embodiment of the present disclosure.
  • Terminal devices in the embodiments of the present disclosure may include, but are not limited to, such as mobile phones, notebook computers, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablets), PMPs (portable multimedia players), vehicle-mounted terminals (eg, mobile terminals such as in-vehicle navigation terminals), etc., and stationary terminals such as digital TVs, desktop computers, and the like.
  • the electronic device shown in FIG. 11 is only an example, and should not impose any limitation on the function and scope of use of the embodiments of the present disclosure.
  • the electronic device 1100 may include a processing device (eg, a central processing unit, a graphics processor, etc.) 1101 that may be loaded into random access according to a program stored in a read only memory (ROM) 1102 or from a storage device 11011 Various appropriate actions and processes are executed by the programs in the memory (RAM) 1103 . In the RAM 1103, various programs and data necessary for the operation of the electronic device 1100 are also stored.
  • the processing device 1101, the ROM 1102, and the RAM 1103 are connected to each other through a bus 1104.
  • An input/output (I/O) interface 1105 is also connected to the bus 1104 .
  • the following devices can be connected to the I/O interface 1105: input devices 1108 including, for example, a touch screen, touchpad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a liquid crystal display (LCD), speakers, vibration An output device 11011 of a computer, etc.; a storage device 1108 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 1109. Communication means 1109 may allow electronic device 1100 to communicate wirelessly or by wire with other devices to exchange data.
  • FIG. 11 shows an electronic device 1100 having various means, it should be understood that not all of the illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided.
  • embodiments of the present disclosure include a computer program product comprising a computer program carried on a non-transitory computer readable medium, the computer program containing program code for performing the method illustrated in the flowchart.
  • the computer program may be downloaded and installed from the network via the communication device 1109, or from the storage device 1108, or from the ROM 1102.
  • the processing apparatus 1101 the above-mentioned functions defined in the methods of the embodiments of the present disclosure are executed.
  • the computer-readable medium mentioned above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two.
  • the computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above. More specific examples of computer readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable read only memory (EPROM or flash memory), fiber optics, portable compact disk read only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the foregoing.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave with computer-readable program code embodied thereon. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .
  • Program code embodied on a computer readable medium may be transmitted using any suitable medium including, but not limited to, electrical wire, optical fiber cable, RF (radio frequency), etc., or any suitable combination of the foregoing.
  • the client and server can use any currently known or future developed network protocols such as HTTP (HyperText Transfer Protocol) to communicate, and can communicate with digital data in any form or medium.
  • Data communications eg, communication networks
  • Examples of communication networks include local area networks (“LAN”), wide area networks (“WAN”), the Internet (eg, the Internet), and peer-to-peer networks (eg, ad hoc peer-to-peer networks), as well as any currently known or future development network of.
  • LAN local area networks
  • WAN wide area networks
  • the Internet eg, the Internet
  • peer-to-peer networks eg, ad hoc peer-to-peer networks
  • the above-mentioned computer-readable medium may be included in the above-mentioned electronic device; or may exist alone without being assembled into the electronic device.
  • the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device:
  • the target content in the edited content is updated to the universal language, and the universal language identifier is determined based on the edited content.
  • the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device:
  • Computer program code for performing operations of the present disclosure may be written in one or more programming languages, including but not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and This includes conventional procedural programming languages - such as the "C" language or similar programming languages.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server.
  • the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider through Internet connection).
  • LAN local area network
  • WAN wide area network
  • each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logical functions for implementing the specified functions executable instructions.
  • the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
  • the units involved in the embodiments of the present disclosure may be implemented in a software manner, and may also be implemented in a hardware manner. Among them, the name of the unit does not constitute a limitation of the unit itself under certain circumstances.
  • exemplary types of hardware logic components include: Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), Systems on Chips (SOCs), Complex Programmable Logical Devices (CPLDs) and more.
  • FPGAs Field Programmable Gate Arrays
  • ASICs Application Specific Integrated Circuits
  • ASSPs Application Specific Standard Products
  • SOCs Systems on Chips
  • CPLDs Complex Programmable Logical Devices
  • a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with the instruction execution system, apparatus or device.
  • the machine-readable medium can be a machine-readable signal medium or a machine-readable storage medium.
  • Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices, or devices, or any suitable combination of the foregoing.
  • machine-readable storage media would include one or more wire-based electrical connections, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), fiber optics, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
  • RAM random access memory
  • ROM read only memory
  • EPROM or flash memory erasable programmable read only memory
  • CD-ROM compact disk read only memory
  • magnetic storage or any suitable combination of the foregoing.
  • a document processing method, apparatus, device, and medium including:
  • the target content in the edited content is updated to the universal language, and the universal language identifier is determined based on the edited content.
  • a document processing method, apparatus, device, and medium are provided.
  • the target content in the edited content is updated in response to receiving a universal language identifier sent by the server.
  • for common languages including:
  • the first interactive area is associated with the target content in the edited content
  • the target content in the edited content is updated to a common language.
  • a document processing method, apparatus, device, and medium are provided.
  • the target content in the edited content is updated in response to receiving a universal language identifier sent by the server.
  • for common languages including:
  • the second interaction area including a common language identifier associated with the target content in the edited content
  • the target content in the edited content is updated to a general language corresponding to the general language identifier.
  • a document processing method, apparatus, device, and medium are provided.
  • the target content in the edited content is updated in response to receiving a universal language identifier sent by the server.
  • for common languages including:
  • a third interaction area is displayed, and the third interaction area includes general language information associated with the multiple target contents;
  • the target content in the edited content is updated to a common language corresponding to the matched common language identifier.
  • a document processing method, apparatus, device and medium further comprising:
  • the association relationship is stored in the mapping relationship table.
  • a document processing method, apparatus, device and medium further comprising:
  • the display mode of the target content in the editing content is changed from the third mode to the second mode associated with the third mode.
  • a document processing method, apparatus, device and medium further comprising:
  • the general language description information corresponding to the general language identifier is displayed.
  • a document processing method, apparatus, device and medium further comprising:
  • the common language is displayed in the target document.
  • a document processing method, apparatus, device, and medium are provided.
  • displaying the common language in the target document includes:
  • a first operation entry is displayed, and the first operation entry is used to obtain the target universal language;
  • the target universal language is displayed in the target document.
  • a document processing method, apparatus, device, and medium are provided.
  • displaying the common language in the target document includes:
  • the common language is displayed in the target document based on the target location information.
  • a document processing method, apparatus, device and medium further comprising:
  • the name of the common language and the description information of the common language are sent to the server, so that the server creates a corresponding common language record.
  • a document processing method, apparatus, device, and medium are provided, and the target content is a part or all of the edited content.
  • a document processing method, apparatus, device, and medium including: receiving a first data request sent by a client, where the first data request includes editing content;
  • a document processing method, apparatus, device, and medium are provided.
  • the obtaining, based on the edited content, a general language identifier corresponding to the edited content includes:
  • the general language identifier corresponding to each word element is determined.
  • a document processing method, apparatus, device, and medium are provided.
  • the obtaining, based on the edited content, a general language identifier corresponding to the edited content includes:
  • the general language identifier corresponding to the edited content is acquired.
  • a document processing method, apparatus, device and medium further comprising:
  • the client receiving a second data request sent by the client; wherein the second data request includes a universal language identifier and an identifier of the target document;
  • a document processing method, apparatus, device, and medium wherein the second data request further includes: a user identification; and the method further includes:
  • a document processing method, apparatus, device and medium further comprising:
  • the universal language database is updated based on the name of the universal language and the description information of the universal language.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Data Mining & Analysis (AREA)
  • Bioethics (AREA)
  • Computer Hardware Design (AREA)
  • Computer Security & Cryptography (AREA)
  • Software Systems (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Library & Information Science (AREA)
  • Document Processing Apparatus (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

提供一种文档处理方法、装置、设备和介质,文档处理方法包括:响应于接收到第一用户对目标文档的编辑操作,获取编辑内容(S11);基于编辑内容,向服务器发送第一数据请求(S12);响应于接收到服务器发送的通用语言标识,将编辑内容中的目标内容更新为通用语言,其中,通用语言标识基于编辑内容确定(S13)。实施例在编辑在线文档过程中,如果识别到输入的编辑内容中包括通用语言,自动将编辑内容中的目标内容更新为通用语言,实现了对在线文档中的通用语言进行有效管理。

Description

一种文档处理方法、装置、设备和介质
本公开要求于2021年03月02日提交中国国家知识产权局、申请号为202110232055.5、发明名称为“一种文档处理方法、装置、设备和介质”的中国专利申请的优先权,其全部内容通过引用结合在本公开中。
技术领域
本公开实施例涉及信息处理技术,尤其涉及一种文档处理方法、装置、设备和介质。
背景技术
用户在使用文档记录专业信息时,会将相关的资料写到文档中。由于每个团队涉及到的领域知识是不一样的,同个团队的不同用户负责的技术领域也不一样。导致团队的用户需要使用一个通用语言来描述一个概念,并且保持该通用语言不会引入其他的歧义以及方便汇总和清晰的解释,方便后续的知识传递和信息对齐。
但是现有技术中,没有有效的方法对在线文档中的通用语言进行管理。
发明内容
本公开实施例提供一种文档处理方法、装置、设备和介质,以对在线文档中的通用语言进行有效管理。
第一方面,本公开实施例提供了一种文档处理方法,包括:
响应于接收到第一用户对目标文档的编辑操作,获取编辑内容;
基于所述编辑内容,向服务器发送第一数据请求;
响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,所述通用语言标识基于所述编辑内容确定。
第二方面,本公开实施例还提供了一种文档处理方法,所述方法应用于服务器,包括:
接收客户端发送的第一数据请求,所述第一数据请求包括编辑内容;
基于所述编辑内容,获取所述编辑内容对应的通用语言标识;
将所述编辑内容对应的通用语言标识发送至所述客户端。
第三方面,本公开实施例还提供了一种文档处理装置,包括:
内容获取模块,用于响应于接收到第一用户对目标文档的编辑操作,获取编辑内容;
请求发送模块,用于基于所述编辑内容,向服务器发送第一数据请求;
通用语言更新模块,用于响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,所述通用语言标识基于所述编辑内容确定。
第四方面,本公开实施例还提供了一种文档处理装置,所述装置配置于服务器,包括:
请求接收模块,用于接收客户端发送的第一数据请求,所述第一数据请求包括编辑内容;
通用语言标识获取模块,用于基于所述编辑内容,获取所述编辑内容对应的通用语言标识;
通用语言标识发送模块,用于将所述编辑内容对应的通用语言标识发送至所述客户端。
第五方面,本公开实施例还提供了一种文档处理设备,包括:
一个或多个处理器;
存储器,用于存储一个或多个程序;
当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如本公开实施例中任一项所述的文档处理方法。
第六方面,本公开实施例还提供了一种介质,所述介质存储有计算机程序,所述计算机程序被处理器执行时实现如本公开实施例中任一项所述的文档处理方法。
本公开实施例提供一种文档处理方法、装置、设备和介质,上述文档处理方法包括:响应于接收到第一用户对目标文档的编辑操作,获取编辑内容;基于编辑内容,向服务器发送第一数据请求;响应于接收到服务器发送的通用语言标识,将编辑内容中的目标内容更新为通用语言,其中,通用语言标识基于编辑内容确定。本公开实施例在编辑在线文档过程中, 如果识别到输入的编辑内容中包括通用语言,自动将编辑内容中的目标内容更新为通用语言,实现了对在线文档中的通用语言进行有效管理。
附图说明
结合附图并参考以下具体实施方式,本公开各实施例的上述和其他特征、优点及方面将变得更加明显。贯穿附图中,相同或相似的附图标记表示相同或相似的元素。应当理解附图是示意性的,原件和元素不一定按照比例绘制。
图1是本公开实施例提供的一种文档处理方法的流程图;
图2是本公开实施例提供的第一交互区域的示意图;
图3是本公开实施例提供的第二交互区域的示意图;
图4是本公开实施例提供的一种文档操作页面示意图;
图5是本公开实施例提供的一种文档处理方法的流程图;
图6是本公开实施例提供的一种文档处理的交互方法的流程图;
图7是本公开实施例提供的一种文档处理的交互方法的流程图;
图8是本公开实施例提供的一种文档处理的交互方法的流程图;
图9是本公开实施例提供的一种文档处理装置的结构示意图;
图10是本公开实施例提供的一种文档处理装置的结构示意图;
图11是本公开实施例提供的一种电子设备的结构图。
具体实施方式
下面将参照附图更详细地描述本公开的实施例。虽然附图中显示了本公开的某些实施例,然而应当理解的是,本公开实施例可以通过各种形式来实现,而且不应该被解释为限于这里阐述的实施例,相反提供这些实施例是为了更加透彻和完整地理解本公开。应当理解的是,本公开的附图及实施例仅用于示例性作用,并非用于限制本公开的保护范围。
应当理解,本公开实施例的方法实施方式中记载的各个步骤可以按照不同的顺序执行,和/或并行执行。此外,方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。本公开实施例的范围在此方面不受限制。
本文使用的术语“包括”及其变形是开放性包括,即“包括但不限于”。术语“基于”是“至少部分地基于”。术语“一个实施例”表示“至少一个实施例”;术语“另一实施例”表示“至少一个另外的实施例”;术语“一些实施例”表示“至少一些实施例”。其他术语的相关定义将在下文描述中给出。
需要注意,本公开实施例中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。
需要注意,本公开实施例中提及的“一个”、“多个”的修饰是示意性而非限制性的,本领域技术人员应当理解,除非在上下文另有明确指出,否则应该理解为“一个或多个”。
本公开实施方式中的多个装置之间所交互的消息或者信息的名称仅用于说明性的目的,而并不是用于对这些消息或信息的范围进行限制。
图1是本公开实施例提供的一种文档处理方法的流程图,本实施例可适用于对文档中的通用语言进行管理的情况。该方法可以由文档处理装置来执行,所述装置可以通过软件和/或硬件的方式来实现。所述文档处理方法应用于客户端中。
需要说明的是,本实施例所提供的文档处理方法具体可以在客户端上使用,并可认为具体由集成在客户端上的文档处理装置执行。其中,客户端具体可以是包括了处理器、存储器、输入装置和输出装置的计算机设备,如笔记本电脑、台式计算机、平板电脑以及智能终端等。
如图1所示,本实施例提供的文档处理方法主要包括步骤S11、S12和S13。
S11、响应于接收到第一用户对目标文档的编辑操作,获取编辑内容。
其中,上述目标文档可以是客户端的展示区域中展示的可编辑文档,例如,可以是word文档、excel文档、pdf文档、txt文档等常用文档,也可以是其他可编辑文档,本实施例不进行限定。上述编辑内容是指工作人员在当前文档页面中输入的内容,可以是用户通过粘贴方式输入的内容,也可以是用户通过输入法输入的内容,本实施例中仅对编辑内容进行示例性说明,而非限定。
S12、基于所述编辑内容,向服务器发送第一数据请求。
其中,上述服务器可以是后台服务器或者后端服务器,在网络中为客户端提供计算或者应用服务。具备承担响应服务请求、承担服务、保障服务的能力,具有高速的CPU运算能力、长时间的可靠运行、强大的I/O外部数据吞吐能力以及更好的扩展性。
其中,第一数据请求中包括编辑内容,第一数据请求用于请求服务器识别编辑内容中是否存在通用语言。如果服务器识别到编辑内容对应的通用语言标识并将该通用语言标识发送至客户端。
S13、响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,所述通用语言标识基于所述编辑内容确定。
其中,通用语言是指一个工作团队用来描述相同概念或者知识点语言,该工作团队保持该通用语言不会引入其他的歧义以及方便汇总和清晰的解释,方便后续知识传递和信息对齐。
具体的,通用语言是在基于DDD(程序驱动设计)软件开发的过程中,团队范围内所有参与人员广泛使用,才能准确传递业务规则描述,方便团队成员对术语标识达成一致,方便理解和传播。通用语言标识是指每个通用语言都有唯一一个的标识ID,通过该通用语言标识可以找到对应的通用语言的名称以及对应的释义。所述目标内容为所述编辑内容的部分内容或者全部内容。
在本实施例中,客户端接收第一用户在目标文档页面中输入的编辑内容,基于上述编辑内容生成第一数据请求,并发送至服务器,服务器基于编辑内容识别编辑内容是否存在对应的通用语言标识。如果存在,则将上述通用语言标识返回至客户端。客户端根据接收到服务器发送的通用语言标识,将编辑内容中的目标内容自动更新为通用语言。如果编辑内容不存在对应的通用语言标识,则服务器不作任何处理。其中,编辑内容对应的通用语言标识是指编辑内容中的部分或者全部内容有对应的通用语言标识。
在本公开实施例中,将编辑内容中的目标内容自动更新为通用语言,可以理解为接收到服务器发送的通用语言标识后,直接将目标内容更新为 通用语言,不需要进行人工干预,方便快捷。
在一个实施方式中,所述响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,包括:展示第一交互区域,所述第一交互区域与所述编辑内容中的目标内容相关联;响应于接收到所述第一用户对所述第一交互区域的第一操作,将所述编辑内容中的目标内容更新为通用语言。
其中,交互区域可以理解为目标文档中展示的一个可供第一用户与客户端进行交互的显示区域。进一步的,交互区域可以是目标文档页面中的预设区域,或者,也可以是在目标文档中光标所在区域。预设区域可以是预先设置的固定位置,也可以是跟随编辑内容位置的变化而变化。例如:预定位置可以是文档页面的底部、顶部、左侧或者右侧。跟随编辑内容位置的变化而变化可以是编辑内容的右下方、右上方或者左下方。
在本实施例中,服务器将目标内容对应的通用语言标识返回至客户端,客户端在第一交互区域中显示与目标内容相关联的信息。上述相关联的信息用于提示第一用户是否将目标内容更新为通用语言。
图2是本公开实施例提供的第一交互区域的示意图,如图2所示,在第一交互区域11中显示“设置为通用语言?”,并显示两个选择按钮,“是”按钮和“否”按钮。
进一步的,响应于第一用户对第一交互区域的第一操作,将所述编辑内容中的目标内容更新为通用语言。响应于第一用户对第一交互区域的触发操作,不对编辑内容进行任何处理。
例如:上述第一操作可以是对第一交互区域11中“是”按钮的触发操作,将所述编辑内容中的目标内容更新为通用语言。进一步的,触发操作还可以是对第一交互区域11中“否”按钮的触发操作。上述对按钮的触发操作可以是对按钮单击,双击或者触摸操作,本实施例中并不进行限定。
具体的,接收到第一用户对“是”按钮的触发操作后,响应该操作,并将目标内容更新为通用语言。接收到第一用户对“否”按钮的触发操作后,响应该操作,不对目标内容进行任何处理,目标内容还是常规的编辑内容。
需要说明的是,图2中展示的第一交互区域以及操作方式仅仅是示例 性描述,而非限定,还可以根据实际页面展示场景进行其他设置。
在一个实施方式中,所述响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,包括:展示第二交互区域,所述第二交互区域包括与所述编辑内容中的目标内容相关联的通用语言标识;响应于接收到所述第一用户对所述第二交互区域的第二操作,获取与所述目标内容对应的通用语言标识;响应于接收到所述第一用户对所述第二交互区域的第三操作,将所述编辑内容中的目标内容更新为与所述通用语言标识对应的通用语言。
在本实施例中,服务器识别到编辑内容中的目标内容对应多个通用语言标识,服务器将目标内容对应的多个通用语言标识返回至客户端,客户端在第二交互区域中显示与目标内容相关联的多个通用语言标识。
图3是本公开实施例提供的第二交互区域的示意图,如图3所示,在编辑内容的右下角显示第一交互区域21,第一交互区域21中包括多个通用语言标识,例如:通用语言A,通用语言B,通用语言C,通用语言D,以及,不设置为通用语言等可操作选项,工作人员可以在上述第二交互区域中选出目标内容对应的通用语言标识。
示例性的,检测到第一用户对第二交互区域中的“不设置为通用语言”选项的点击操作,则将目标内容不进行任何处理,只是作为普通文本进行存储。
进一步的,在接收到服务器发送的通用语言标识的同时,也可以接收每个通用语言标识与目标内容的匹配度,通用语言标识在第二交互区域中的顺序由上述匹配度确定。例如:通用语言标识在第二交互区域中的由上到下的顺序由是该通用语言标识上对应目标内容的匹配度由大到小的顺序。匹配度最高的展示在第一位,然后随着匹配度依次降低。
在一个实施方式中,响应于接收到第一用户对第二交互区域的第二操作,获取与所述目标内容对应的通用语言标识,直接将目标内容设置为通用语言标识对应的通用语言。
例如:检测到第一用户对第二交互区域中的“通用语言A”选项的点击操作,则获取通用语言标识A,则直接将目标内容更新为通用语言标识A 对应的通用语言。
在一个实施方式中,响应于接收到第一用户对第二交互区域的第二操作,获取与所述目标内容对应的通用语言标识,并更新第二交互区域中的显示内容,此时,第二交互区域中显示与目标内容相关联的信息。响应于接收到第一用户对第二交互区域的第三操作,将目标内容更新为与所述通用语言标识对应的通用语言。响应于接收到第一用户对第二交互区域的第三操作与第一交互区域中第一操作相同,具体可以参照上述实施例中第一交互区域中第一操作的描述,本实施例中不再赘述。上述相关联的信息用于提示第一用户是否将目标内容设置为通用语言。
在一个实施方式中,所述响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,包括:当所述编辑内容中的目标内容为多个时,展示第三交互区域,所述第三交互区域包括与多个所述目标内容相关联的通用语言标识;响应于接收到所述第一用户对所述第三交互区域的第四操作,为所述目标内容匹配对应的通用语言标识;响应于接收到所述第一用户对所述第三交互区域的第五操作,将所述编辑内容中的目标内容更新为与匹配的通用语言标识对应的通用语言。
在本实施例中,服务器识别到编辑内容中的多个目标内容分别存在对应的通用语言标识,服务器将目标内容对应的多个通用语言标识返回至客户端,客户端在第三交互区域中显示与多个目标内容相关联的通用语言标识。客户端响应于接收到第一用户对所述第三交互区域的第五操作,将编辑内容中的目标内容更新为与匹配的通用语言标识对应的通用语言。
例如:编辑内容中的多个目标内容分别存在对应的通用语言标识可以理解为目标内容a对应通用语言标识A,目标内容b对应通用语言标识B,目标内容c对应通用语言标识C,目标内容d对应通用语言标识D。
在上述实施例的基础上,响应于接收到第一用户对目标文档的编辑操作,获取编辑内容之前,所述方法还包括:响应于第一用户对模式开关的操作,开启用户标识的通用语言识别模式。
具体的,可以在文档中设置模式开关,上述模式开关用于开启或者关闭通用语言识别模式,通用语言识别模式是指用户输入编辑内容之后会自 动识别编辑内容是否是通用语言。
如果通用语言识别模式是开启状态,则执行用户输入编辑内容之后会自动识别编辑内容是否是通用语言,即执行本公开实施例提供的文档处理方式,如上述步骤S11-S13。如果通用语言识别模式是关闭状态,则执行用户输入编辑内容之后,不进行通用语言的识别,直接作为普通的编辑文本进行存储。
进一步的,在所述模式开关是关闭状态时,响应于对模式开关的操作,开启通用语言识别模式,上述对模式开关的操作包括单击、双击或者触摸模式开关。在所述模式开关是开关状态时,响应于对模式开关的操作,关闭通用语言识别模式,上述对模式开关的操作包括单击、双击或者触摸模式开关。
本公开实施例提供的文档处理方法,包括:响应于接收到第一用户对目标文档的编辑操作,获取编辑内容;基于编辑内容,向服务器发送第一数据请求;响应于接收到服务器发送的通用语言标识,将编辑内容中的目标内容更新为通用语言,通用语言标识基于编辑内容确定。本公开实施例通过在编辑在线文档过程中,如果输入的编辑内容中包括通用语言,会自动将编辑内容中的目标内容更新为通用语言,实现了对在线文档中的通用语言进行有效管理。
在上述实施例的基础上,本公开实施例提供另一种文档处理方法,所述文档处理方法包括:响应于对通用语言的插入操作,在所述目标文档中显示所述通用语言。
在本实施例中,对通用语言的插入操作可以理解为在目标当前文档中的插入通用语言的操作。上述插入操作可以是通过点击右键进行操作,也可以通过文档顶部的功能框中的插入按钮进行操作。本实施例中不进行限定。
在一个实施方式中,所述响应于对通用语言的插入操作,在所述目标文档中显示所述通用语言,包括:响应于对通用语言的插入操作,展示第一操作入口,所述第一操作入口用于获取目标通用语言;响应于获取到所述目标通用语言,展示第二操作入口;响应于接收到对所述第二操作入口 的操作指令,在所述目标文档中显示所述目标通用语言。
其中,上述第二操作入口中包括一个或多个目标通用语言的操作入口。对第二操作入口的操作指令,可以是包括单击、双击或者触摸第二操作入口中的任意一个选项。
进一步的,对通用语言的插入操作,可以通过文档顶部的插入按钮进行操作,展示所有可以在文档中插入内容的选项,包括第一操作入口,响应于对第一操作入口对应的通用语言选项的操作。也可以在目标文档中展示插入选择框,插入选择框中展示所有可以在文档中添加内容的选项,包括第一操作入口,响应于对第一操作入口对应的通用语言选项的操作。
进一步的,响应于对第一操作入口中通用语言选项的操作之后,获取目标通用语言,响应于获取到所述目标通用语言,展示第二操作入口。响应于对第二操作入口的操作,从多个目标通用语言中选出一个目标通用语言。接收第一用户在目标文档中输入的目标内容,将该目标内容更新为通用语言。
在一个示例性的实施方式中,图4是本公开实施例提供的一种文档操作页面示意图。如图4所示,在目标文档中任意位置单击鼠标右键或者在文档顶部的功能栏中进行选择操作,目标文档中展示如图4所示的功能列表框31,对上述功能列表中的“在下方添加”选项22进行操作,当前文档中展示如图4所示的添加选择框23,对添加选择框23中的第一操作入口“通用语言”选项进行操作,目标文档中展示的第二选择框24。如图4所示,第二选择框24中包括第二操作入口,例如:通用语言A,通用语言B,通用语言C,通用语言D,工作人员可以任意选择一种通用语言ID作为接下来在文档输入的编辑内容对应的通用语言。
需要说明的是,图4仅是示例性的说明,通用语言的插入方法,而非限定。
在一个实施方式中,所述响应于对通用语言的插入操作,在所述目标文档中显示所述通用语言,包括:响应于对通用语言的插入操作,获取目标位置信息;基于所述目标位置信息,在所述目标文档中显示所述通用语言。
在本实施例中,目标位置信息是指插入通用语言的位置,该位置信息由第一用户对通用语言的插入操作来实现。第一用户可以在目标文档中的任何位置插入通用语言。
进一步的,基于上述实施例获取到目标通用语言后,将后续第一用户在目标位置输入的目标内容更新为目标通用语言。
在一个实施例中,基于上述实施例还包括:建立所述编辑内容中的目标内容与所述通用语言标识的关联关系;将所述关联关系存储至映射关系表中。
上述通用语言标识可以是上述实施例中从第一交互区域、第二交互区域或者第三交互区域中第一用户选出的通用语言标识。
建立目标内容与选出的通用语言标识之间的对应关系。需要说明的是,目标内容与通用语言标识可以是一对一的关系,也可以是多对一的关系。即不同的目标内容可以同一个通用语言标识。
映射关系表中存储目标内容与通用语言标识的对应关系,方便后续文档使用过程中,工作人员可以方便的获取到目标内容对应的通用语言ID。
进一步的,基于上述实施例还包括:响应于当前编辑模式为第一模式,将所述编辑内容中的目标内容的显示模式由第一模式变更为第二模式;或者,响应于当前编辑模式为第三模式,将所述编辑内容中的目标内容的显示模式由第三模式变更为与所述第三模式相关联的第二模式。
在本实施例中,第一模式可以理解为目标文档中常规内容的显示模式。第二模式是指存在通用语言的目标内容的专有显示方式。
例如:第一显示模式是宋体,四号字体。第二显示模式可以是添加下划线,添加文字背景,加粗,或者切斜等显示方式。
在本实施例中,第三模式可以理解为目标文档中常规内容的显示模式的基础上已经增加了渲染的显示方式。第二模式是指在存在通用语言的目标内容的专有显示方式。
需要说明的是,第三显示模式与第一显示模式的区别是第一模式仅仅是文档内容的基础显示,第三显示方式在第一模式的基础上增加了渲染的显示方式。与第三模式关联的第二显示模式是在第三模式的基础上添加渲 染显示方式。
例如:第一显示模式是宋体,四号字体。第三模式是宋体,四号字体,加粗。相应的,与第三模式关联的第二显示模式是宋体,四号字体,加粗,添加下划线。或者与第三模式关联的第二显示模式是宋体,四号字体,加粗,添加文字背景。
上述将目标内容进行区别显示方式可以使得工作人员在后续使用文档时,明确知道哪些内容是通用语言。
在上述实施例的基础上,上述文档处理方法还包括:响应于对更新后的所述目标内容的触发操作,向所述服务器发送第二数据请求,其中,所述第二数据请求包括通用语言标识和所述目标文档的标识;响应于接收到所述服务器发送的第二数据,展示与所述通用语言标识对应的通用语言描述信息。
在本实施例中,对更新后的所述目标内容的触发操作可以是对更新后的目标内容的选中操作,也可以是对更新后的目标内容的点击操作,还可以是鼠标的光标指示到更新后的目标内容。通用语言描述信息是指目标内容对应的通用语言的详细解释信息。
第二数据请求主要用于客户端向服务器请求目标内容对应的通用语言的详细解释信息。
在本实施例中,响应于对更新后的所述目标内容的触发操作之后,向所述服务器发送第二数据请求。服务器基于目标文档的标识确定对应的预设映射关系表,在预设的映射关系表中基于目标内容进行查询。如果在预设映射关系表中查询到目标内容对应的通用语言标识,则表示当前操作的目标内容是一种通用语言,并基于该通用语言标识获取目标内容对应的通用语言描述信息,并发送至客户端进行展示。
进一步的,客户端接收到服务器返回的目标内容对应的通用语言描述信息后,可以在释义展示框中展示通用语言的描述信息。
其中,目标文档中上释义展示框的方式可以有多种,例如,可以在目标文档页面上的预设区域显示释义展示框,或者,在文档区域中光标所在区域上显示释义展示框。释义展示框是悬浮在文档上的展示框。预设区域 可以是预先设置的固定位置,也可以是跟随文档内容的变化位置。例如:预定位置可以是文档页面的底部、顶部、左侧或者右侧。跟随文档内容的变化位置可以是编辑内容的右下方、右上方或者左下方。
例如:释义展示框设置在文档的底部。在工作人员浏览文档的过程中,工作人员需要查看目标内容“ABCD”对应的通用语言的具体内容,可以将鼠标的光标放置在目标内容“ABCD”上,或者用鼠标单击目标内容“ABCD”,客户端检测到工作人员的操作之后,向服务器发送第二数据请求。服务器返回该目标内容对应的通用语言描述信息,并在目标文档中的释义展示框中展示通用语言的描述信息。进一步的,当工作人员查看完毕之后,可以通过释义展示框的关闭按钮关闭释义展示框;也可以通过再次对文档内容“ABCD”进行操作,关闭释义展示框。
本实施例的技术方案,工作人员将鼠标放通用语言上时,会有浮窗展示解析该通用语言的具体含义。
在上述实施例的基础上,上述文档处理方法还包括:响应于对通用语言的添加操作,接收通用语言的名称和通用语言的描述信息;将所述通用语言的名称和通用语言的描述信息发送至所述服务器,以使所述服务器创建对应的通用语言记录。
其中,通用语言名称可以是工作人员给通用语言规定的名字。
在本实施例中,客户端可以接收用户在文档中输入的建立或更新通用语言库的指令,响应于对通用语言的添加操作,接收用户输入的通用语言名称和通用语言描述信息,请求服务器的通用语言库的接口,传入通用语言名称以及对应的通用语言描述信息。服务器接收到请求后,判断没有重复的通用语言,如果没有重复的通用语言,就生成一个通用语言记录,并更新通用语言库,然后返回处理结果给客户端。
图5是本公开实施例提供的一种文档处理方法的流程图,本实施例可适用于对文档中的通用语言进行管理的情况。该方法可以由文档处理装置来执行,所述装置可以通过软件和/或硬件的方式来实现。所述文档处理方法应用于服务器中。
如图5所示,本实施例提供的文档处理方法主要包括步骤S21、S22和 S23。
S21、接收客户端发送的第一数据请求,所述第一数据请求包括编辑内容。
其中,上述服务器可以是后台服务器或者后端服务器,在网络中为客户端提供计算或者应用服务。具备承担响应服务请求、承担服务、保障服务的能力,具有高速的CPU运算能力、长时间的可靠运行、强大的I/O外部数据吞吐能力以及更好的扩展性。
进一步的,服务端是为客户端服务的。客户端和服务器之间建立通信连接,可以进行数据的交互和传输。
具体的,客户端接收工作人员在目标文档页面中输入的编辑内容,并基于编辑内容生成第一数据请求,将上述第一数据请求发送至服务器,服务器接收客户端发送的第一数据请求。
S22、基于所述编辑内容,获取所述编辑内容对应的通用语言标识。
在一个实施方式中,所述基于所述编辑内容,获取所述编辑内容对应的通用语言标识,包括:将至少一个所述编辑内容对应的数据信息进行聚合处理,确定聚合处理之后的数据信息总量;当所述数据信息总量满足阈值条件时,获取所述编辑内容对应的通用语言标识。
其中,阈值条件是指基于编辑内容查询对应通用语言标识的条件。上述阈值条件可以是编辑内容的属性特征,例如:可以是编辑内容的文本字数,文本格式,编辑内容的字体大小等等。
在本实施例中,监听MySQL数据库中的binlog日志;获取所述目标文档对应的多个binlog日志。将所述多个binlog日志对应的修改聚合成一次修改,并提取具体修改内容。在所述具体修改内容的字数达到阈值条件,获取所述编辑内容对应的通用语言标识。
MySQL数据库一种开源的行列式数据库,可以存放用户的数据。Binlog日志是MySQL数据库服务层维护的一种二进制日志,其主要是用来记录对MySQL数据更新或潜在发生更新的SQL语句,并以"事务"的形式保存在磁盘中。简单说就是记录了每次操作数据库的动作,可以通过binlog将操作MySQL的动作像放电影一样重来一次。
具体的,每次用户在目标文档页面中的输入操作完成后,会将对应的变更内容写入MySQL数据库。接收到客户端发送的编辑内容后,通用语言模块异步监听MySQL数据库的Binlog日志,解析binlog日志。在本实施例中,采用延迟队列消费,每次会消费某文档多个binlog记录,然后将对应的修改聚合成一次修改,然后提取具体修改的内容。如果具体修改的内容增加的文本字数超过阈值条件,则获取所述编辑内容对应的通用语言标识。
优选的,本实施例中的阈值条件是10个字符。需要说明的是,阈值条件是10个字符是根据经验得到的,阈值条件还可以根据实际应用情况进行设定,本实施例中仅进行说明,而非限定。
在一个实施方式中,所述基于所述编辑内容,获取所述编辑内容对应的通用语言标识,包括:将所述编辑内容进行分词处理,得到所述编辑内容对应的多个词元;通过所述词元在通用语言库中进行匹配,确定每个词元对应的通用语言标识。
具体的,将编辑内容在本地进行分词处理,将编辑内容提取出对应的词元,通过词元去通用语言ElasticSearch搜索库中去精准匹配词元。
上述分词处理可以采用任何分词策略,例如结巴中文分词策略,本实施例中仅对分词策略进行说明,而非限定。
在另一个实施方式中,查询所述编辑内容对应的通用语言标识,包括:通过编辑内容在通用语言库中进行模糊匹配,确定查询结果。
具体的,通过未分词的编辑内容去ElasticSearch搜索库去模糊查找。
其中,ElasticSearch是一个基于Lucene库的搜索引擎。它提供了一个分布式、支持多租户的全文搜索引擎。
在另一个实施方式中,通过词元去通用语言ElasticSearch搜索库中去精准匹配词元,并且通过未分词的编辑内容去ElasticSearch搜索库去模糊查找,根据两次查找的结果确定查询结果。
本实施例中,通过词元去通用语言ElasticSearch搜索库中去精准匹配词元可以提高查询的速度,通过未分词的编辑内容去ElasticSearch搜索库去模糊查找,可以提高查询的召回率。
S23、将所述编辑内容对应的通用语言标识发送至所述客户端。
在一个实施方式中,如果未查询到编辑内容对应的通用语言标识,可以不返回任何内容。
在另一个实施方式中,如果查询到一个或多个通用语言标识,则可以按照查询到的通用语言标识的数量进行不同的处理。
具体的,如果查询到的通用语言标识的数量小于或等于预设阈值,则可以将查询到的所有通用语言标识返回至客户端。其中,返回至客户端的通用语言标识可以是按照匹配度由高到低排序后的通用语言标识。
具体的,如果查询到的通用语言标识的数量大于预设阈值,则将查询到的所有通用语言标识按照匹配度由高到低进行排序,将排在前L个的通用语言标识返回至客户端。其中,L与预设阈值相等。上述预设阈值可以根据实际应用情况进行设置。例如:可以是3个或者5个。
在另一个实施方式中,也可以不进行排序处理,将所有查询到的通用语言ID均返回至客户端。
在本实施例中,当前用户标识开启通用语言识别模式之后,才可以执行具体的自动识别通用语言的操作。如果当前用户标识未开启通用语言识别模式,则可以不执行任何操作。
在一个实施例中,上述文档处理方法还包括:接收所述客户端发送的第二数据请求;其中,所述第二数据请求包括通用语言标识和目标文档的标识;在通用语言库中查询所述通用语言标识对应的通用语言描述信息;将第二数据发送至客户端,所述第二数据包括所述通用语言标识对应的通用语言描述信息。
在本实施例中,服务器接收到第二数据请求后,需要基于第二数据请求中的通用语言标识确定目标内容对应的通用语言描述信息。具体的,服务器基于目标文档的标识确定对应的预设映射关系表,在预设的映射关系表中基于目标内容进行查询。如果在预设映射关系表中查询到目标内容对应的通用语言标识,则表示当前操作的目标内容是一种通用语言,并基于该通用语言标识获取目标内容对应的通用语言描述信息,并发送至客户端进行展示。
进一步的,所述第二数据请求还包括:用户标识;所述方法还包括:基于所述用户标识和所述目标文档标识,确定所述用户是否具有所述目标文档的权限;响应于所述用户具有所述目标文档的权限,执行在通用语言库中查询所述通用语言标识对应的通用语言描述信息的操作。
在目标文档标识与用户标识对应关系表中进行查询,确定用户标识是否有目标文档的权限。上述权项可以是指修改、输入、删除、编辑等权限。本实施例中的权限主要是指查询通用语言标识的权限。
如果在文档标识与用户标识对应关系表可以查询到当前用户标识和目标文档标识的对应关系,则确定当前用户标识有目标文档权限。然后执行在通用语言库中查询所述目标内容在所述通用语言标识下对应的通用语言描述信息的操作。
如果在文档标识与用户标识对应关系表未查询到当前用户标识和目标文档标识的对应关系,则确定当前用户标识没当前文档权限,直接提示用户,不具备查询通用语言标识的权限。这样,可以提高文档的安全性和保密性。
在一个实施例中,上述文档处理方法,还包括:接收客户端发送的通用语言的名称和通用语言的描述信息;基于所述通用语言的名称查询通用语言库;响应于未查询到所述通用语言的名称,则基于所述通用语言的名称和通用语言的描述信息更新通用语言库。
在本实施例中,客户端可以接收用户在文档中输入的建立或更新通用语言库的指令,响应于对通用语言的添加操作,接收用户输入的通用语言名称和通用语言描述信息,请求服务器的通用语言库的接口,传入通用语言名称以及对应的通用语言描述信息。服务器接收到请求后,判断没有重复的通用语言,如果没有重复的通用语言,就生成一个通用语言记录,并更新通用语言库,然后返回处理结果给客户端。
图6是本公开实施例提供的一种文档处理的交互方法的流程图,如图6所示,本实施例中提供的交互方法主要包括如下步骤:
S31、客户端接收目标文档中的编辑内容。
S32、客户端将所述编辑内容发送至服务器。
S33、服务器在所述编辑内容满足查询条件时,查询所述编辑内容对应的通用语言标识。
S34、服务器基于查询结果确定所述编辑内容对应的通用语言标识。
S35、服务器将所述通用语言标识发送至客户端。
S36、客户端在所述目标文档中展示第二交互区域,所述第二交互区域中包括一种或多种通用语言标识。
S37、客户端响应于对所述二交互区域的操作,将所述编辑内容中的目标内容更新为通用语言。
上述各个步骤的具体实施方式可以参照上述实施例中的描述,本实施例中不再赘述。
图7是本公开实施例提供的一种文档处理的交互方法的流程图,如图7所示,本实施例中提供的交互方法主要包括如下步骤:
S41、客户端响应于对所述目标文档中文档内容的操作。
S42、客户端所述服务器发送第二数据请求。
S43、服务器基于所述当前用户标识和所述目标文档标识,确定当前用户标识是否有当前文档权限。
S44、服务器在当前用户标识有当前文档权限时,在通用语言库中查询在所述通用语言标识下所述文档内容对应的通用语言描述信息。
S45、服务器将通用语言描述信息返回至客户端。
S46、客户端展示通用语言描述信息。
上述各个步骤的具体实施方式可以参照上述实施例中的描述,本实施例中不再赘述。
图8是本公开实施例提供的一种文档处理的交互方法的流程图,如图8所示,本实施例中提供的交互方法主要包括如下步骤:
S51、客户端响应于对所述目标文档的操作,接收通用语言名称和通用语言描述信息。
S52、客户端将通用语言名称和通用语言描述信息发送至所述服务器。
S53、服务器通过所述通用语言名称在所述通用语言库中进行查询。
S54、服务器若未查询到所述通用语言名称,则基于所述通用语言名称 和通用语言描述信息更新通用语言库。
S55、服务器将通用语言库更新结果返回至客户端。
上述各个步骤的具体实施方式可以参照上述实施例中的描述,本实施例中不再赘述。
图9是本公开实施例提供的一种文档处理装置的结构示意图,本实施例可适用于对文档中的通用语言进行管理的情况,所述装置可以通过软件和/或硬件的方式来实现。所述文档处理装置配置于客户端中。
如图9所示,本实施例提供的文档处理装置主要包括:内容获取模块91、请求发送模块92和通用语言更新模块93。
其中,内容获取模块91,用于响应于接收到第一用户对目标文档的编辑操作,获取编辑内容;
请求发送模块92,用于基于所述编辑内容,向服务器发送第一数据请求;
通用语言更新模块93,用于响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,所述通用语言标识基于所述编辑内容确定。
本实施例提供的文档处理装置,通过响应于接收到第一用户对目标文档的编辑操作,获取编辑内容;基于编辑内容,向服务器发送第一数据请求;响应于接收到服务器发送的通用语言标识,将编辑内容中的目标内容更新为通用语言,其中,通用语言标识基于编辑内容确定。本公开实施例在编辑在线文档过程中,如果识别到输入的编辑内容中包括通用语言,自动将编辑内容中的目标内容更新为通用语言,实现了对在线文档中的通用语言进行有效管理。
在一个实施方式中,通用语言更新模块93,具体用于展示第一交互区域,所述第一交互区域与所述编辑内容中的目标内容相关联;响应于接收到所述第一用户对所述第一交互区域的第一操作,将所述编辑内容中的目标内容更新为通用语言。
在一个实施方式中,通用语言更新模块93,具体用于展示第二交互区域,所述第二交互区域包括与所述编辑内容中的目标内容相关联的通用语 言标识;响应于接收到所述第一用户对所述第二交互区域的第二操作,获取与所述目标内容对应的通用语言标识;响应于接收到所述第一用户对所述第二交互区域的第三操作,将所述编辑内容中的目标内容更新为与所述通用语言标识对应的通用语言。
在一个实施方式中,通用语言更新模块93,具体用于当所述编辑内容中的目标内容为多个时,展示第三交互区域,所述第三交互区域包括与多个所述目标内容相关联的通用语言信息;响应于接收到所述第一用户对所述第三交互区域的第四操作,为所述目标内容匹配对应的通用语言标识;响应于接收到所述第一用户对所述第三交互区域的第五操作,将所述编辑内容中的目标内容更新为与匹配的通用语言标识对应的通用语言。
在一个实施方式中,通用语言更新模块93,还用于建立所述编辑内容中的目标内容与所述通用语言标识的关联关系;将所述关联关系存储至映射关系表中。
在一个实施方式中,通用语言更新模块93,还用于响应于当前编辑模式为第一模式,将所述编辑内容中的目标内容的显示模式由第一模式变更为第二模式;或者,响应于当前编辑模式为第三模式,将所述编辑内容中的目标内容的显示模式由第三模式变更为与所述第三模式相关联的第二模式。
在一个实施方式中,请求发送模块92,还用于响应于对更新后的所述目标内容的触发操作,向所述服务器发送第二数据请求,其中,所述第二数据请求包括通用语言标识和所述目标文档的标识;
相应的,所述装置还包括:描述信息展示模块,用于响应于接收到所述服务器发送的第二数据,展示与所述通用语言标识对应的通用语言描述信息。
在一个实施方式中,所述装置还包括:通用语言显示模块,用于响应于对通用语言的插入操作,在所述目标文档中显示所述通用语言。
在一个实施方式中,通用语言显示模块,具体用于响应于对通用语言的插入操作,展示第一操作入口,所述第一操作入口用于获取目标通用语言;响应于获取到所述目标通用语言,展示第二操作入口;响应于接收到 对所述第二操作入口的操作指令,在所述目标文档中显示所述目标通用语言。
在一个实施方式中,通用语言显示模块,具体用于响应于对通用语言的插入操作,获取目标位置信息;基于所述目标位置信息,在所述目标文档中显示所述通用语言。
在一个实施方式中,所述装置还包括:添加操作响应模块,用于响应于对通用语言的添加操作,接收通用语言的名称和通用语言的描述信息;
描述信息发送模块,用于将所述通用语言的名称和通用语言的描述信息发送至所述服务器,以使所述服务器创建对应的通用语言记录。
在一个实施方式中,所述目标内容为所述编辑内容的部分内容或者全部内容。
本公开实施例所提供的文档处理装置可执行本公开任意实施例所提供的文档处理方法,具备执行数据处理方法相应的功能模块和有益效果。
图10是本公开实施例提供的一种文档处理装置的结构示意图,本实施例可适用于对文档中的通用语言进行管理的情况,所述装置可以通过软件和/或硬件的方式来实现。所述文档处理装置配置于服务器端中。
如图10所示,本实施例提供的文档处理装置主要包括:请求接收模块101、通用语言标识获取模块102和通用语言标识发送模块103。
请求接收模块101,用于接收客户端发送的第一数据请求,所述第一数据请求包括编辑内容;
通用语言标识获取模块102,用于基于所述编辑内容,获取所述编辑内容对应的通用语言标识;
通用语言标识发送模块103,用于将所述编辑内容对应的通用语言标识发送至所述客户端。
在一个实施方式中,通用语言标识获取模块102,具体用于将所述编辑内容进行分词处理,得到所述编辑内容对应的多个词元;通过所述词元在通用语言库中进行匹配,确定每个词元对应的通用语言标识。
在一个实施方式中,通用语言标识获取模块102,具体将至少一个所述编辑内容对应的数据信息进行聚合处理,确定聚合处理之后的数据信息 总量;当所述数据信息总量满足阈值条件时,获取所述编辑内容对应的通用语言标识。
在一个实施方式中,请求接收模块101,还用于所述客户端发送的第二数据请求;其中,所述第二数据请求包括通用语言标识和目标文档的标识;
相应的,所述装置还包括:查询模块,用于在通用语言库中查询所述通用语言标识对应的通用语言描述信息;
数据发送模块,用于将第二数据发送至客户端,所述第二数据包括所述通用语言标识对应的通用语言描述信息。
在一个实施方式中,所述第二数据请求还包括:用户标识;所述装置还包括:
权限确定模块,用于基于所述用户标识和所述目标文档标识,确定所述用户是否具有所述目标文档的权限;响应于所述用户具有所述目标文档的权限,执行在通用语言库中查询所述通用语言标识对应的通用语言描述信息的操作。
在一个实施方式中,所述装置还包括:描述信息接收模块,用于接收客户端发送的通用语言的名称和通用语言的描述信息;
名称查询模块,用于基于所述通用语言的名称查询通用语言库;
语言库更新模块,用于响应于未查询到所述通用语言的名称,则基于所述通用语言的名称和通用语言的描述信息更新通用语言库。
本公开实施例所提供的文档处理装置可执行本公开任意实施例所提供的文档处理方法,具备执行数据处理方法相应的功能模块和有益效果。
下面参考图11,其示出了适于用来实现本公开实施例的电子设备(例如图11中的终端设备或服务端)1100的结构示意图。本公开实施例中的终端设备可以包括但不限于诸如移动电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、车载终端(例如车载导航终端)等等的移动终端以及诸如数字TV、台式计算机等等的固定终端。图11示出的电子设备仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。
如图11所示,电子设备1100可以包括处理装置(例如中央处理器、图形处理器等)1101,其可以根据存储在只读存储器(ROM)1102中的程序或者从存储装置11011加载到随机访问存储器(RAM)1103中的程序而执行各种适当的动作和处理。在RAM 1103中,还存储有电子设备1100操作所需的各种程序和数据。处理装置1101、ROM 1102以及RAM 1103通过总线1104彼此相连。输入/输出(I/O)接口1105也连接至总线1104。
通常,以下装置可以连接至I/O接口1105:包括例如触摸屏、触摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置1108;包括例如液晶显示器(LCD)、扬声器、振动器等的输出装置11011;包括例如磁带、硬盘等的存储装置1108;以及通信装置1109。通信装置1109可以允许电子设备1100与其他设备进行无线或有线通信以交换数据。虽然图11示出了具有各种装置的电子设备1100,但是应理解的是,并不要求实施或具备所有示出的装置。可以替代地实施或具备更多或更少的装置。
特别地,根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在非暂态计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信装置1109从网络上被下载和安装,或者从存储装置1108被安装,或者从ROM 1102被安装。在该计算机程序被处理装置1101执行时,执行本公开实施例的方法中限定的上述功能。
需要说明的是,本公开上述的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质, 该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、RF(射频)等等,或者上述的任意合适的组合。
在一些实施方式中,客户端、服务端可以利用诸如HTTP(HyperText Transfer Protocol,超文本传输协议)之类的任何当前已知或未来研发的网络协议进行通信,并且可以与任意形式或介质的数字数据通信(例如,通信网络)互连。通信网络的示例包括局域网(“LAN”),广域网(“WAN”),网际网(例如,互联网)以及端对端网络(例如,ad hoc端对端网络),以及任何当前已知或未来研发的网络。
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。
上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备:
响应于接收到第一用户对目标文档的编辑操作,获取编辑内容;
基于所述编辑内容,向服务器发送第一数据请求;
响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,所述通用语言标识基于所述编辑内容确定。
上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备:
接收客户端发送的第一数据请求,所述第一数据请求包括编辑内容;
基于所述编辑内容,获取所述编辑内容对应的通用语言标识;
将所述编辑内容对应的通用语言标识发送至所述客户端。
可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操 作的计算机程序代码,上述程序设计语言包括但不限于面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务端上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。
附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。
描述于本公开实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,单元的名称在某种情况下并不构成对该单元本身的限定。
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、片上系统(SOC)、复杂可编程逻辑设备(CPLD)等等。
在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读 储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,包括:
响应于接收到第一用户对目标文档的编辑操作,获取编辑内容;
基于所述编辑内容,向服务器发送第一数据请求;
响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,所述通用语言标识基于所述编辑内容确定。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,所述响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,包括:
展示第一交互区域,所述第一交互区域与所述编辑内容中的目标内容相关联;
响应于接收到所述第一用户对所述第一交互区域的第一操作,将所述编辑内容中的目标内容更新为通用语言。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,所述响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,包括:
展示第二交互区域,所述第二交互区域包括与所述编辑内容中的目标内容相关联的通用语言标识;
响应于接收到所述第一用户对所述第二交互区域的第二操作,获取与所述目标内容对应的通用语言标识;
响应于接收到所述第一用户对所述第二交互区域的第三操作,将所述编辑内容中的目标内容更新为与所述通用语言标识对应的通用语言。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,所述响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,包括:
当所述编辑内容中的目标内容为多个时,展示第三交互区域,所述第三交互区域包括与多个所述目标内容相关联的通用语言信息;
响应于接收到所述第一用户对所述第三交互区域的第四操作,为所述目标内容匹配对应的通用语言标识;
响应于接收到所述第一用户对所述第三交互区域的第五操作,将所述编辑内容中的目标内容更新为与匹配的通用语言标识对应的通用语言。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,还包括:
建立所述编辑内容中的目标内容与所述通用语言标识的关联关系;
将所述关联关系存储至映射关系表中。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,还包括:
响应于当前编辑模式为第一模式,将所述编辑内容中的目标内容的显示模式由第一模式变更为第二模式;或者,
响应于当前编辑模式为第三模式,将所述编辑内容中的目标内容的显示模式由第三模式变更为与所述第三模式相关联的第二模式。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,还包括:
响应于对更新后的所述目标内容的触发操作,向所述服务器发送第二数据请求,其中,所述第二数据请求包括通用语言标识和所述目标文档的标识;
响应于接收到所述服务器发送的第二数据,展示与所述通用语言标识对应的通用语言描述信息。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,还包括:
响应于对通用语言的插入操作,在所述目标文档中显示所述通用语言。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,所述响应于对通用语言的插入操作,在所述目标文档中显示所述通用语言,包括:
响应于对通用语言的插入操作,展示第一操作入口,所述第一操作入口用于获取目标通用语言;
响应于获取到所述目标通用语言,展示第二操作入口;
响应于接收到对所述第二操作入口的操作指令,在所述目标文档中显示所述目标通用语言。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,所述响应于对通用语言的插入操作,在所述目标文档中显示所述通用语言,包括:
响应于对通用语言的插入操作,获取目标位置信息;
基于所述目标位置信息,在所述目标文档中显示所述通用语言。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,还包括:
响应于对通用语言的添加操作,接收通用语言的名称和通用语言的描述信息;
将所述通用语言的名称和通用语言的描述信息发送至所述服务器,以使所述服务器创建对应的通用语言记录。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,所述目标内容为所述编辑内容的部分内容或者全部内容。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,包括:接收客户端发送的第一数据请求,所述第一数据请求包括编辑内容;
基于所述编辑内容,获取所述编辑内容对应的通用语言标识;
将所述编辑内容对应的通用语言标识发送至所述客户端。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,所述基于所述编辑内容,获取所述编辑内容对应的通用语言标识,包括:
将所述编辑内容进行分词处理,得到所述编辑内容对应的多个词元;
通过所述词元在通用语言库中进行匹配,确定每个词元对应的通用语言标识。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,所述基于所述编辑内容,获取所述编辑内容对应的通用语言标识,包括:
将至少一个所述编辑内容对应的数据信息进行聚合处理,确定聚合处理之后的数据信息总量;
当所述数据信息总量满足阈值条件时,获取所述编辑内容对应的通用语言标识。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,还包括:
接收所述客户端发送的第二数据请求;其中,所述第二数据请求包括通用语言标识和目标文档的标识;
在通用语言库中查询所述通用语言标识对应的通用语言描述信息;
将第二数据发送至客户端,所述第二数据包括所述通用语言标识对应的通用语言描述信息。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,所述第二数据请求还包括:用户标识;所述方法还包括:
基于所述用户标识和所述目标文档标识,确定所述用户是否具有所述目标文档的权限;
响应于所述用户具有所述目标文档的权限,执行在通用语言库中查询所述通用语言标识对应的通用语言描述信息的操作。
根据本公开的一个或多个实施例,提供了一种文档处理方法、装置、设备和介质,还包括:
接收客户端发送的通用语言的名称和通用语言的描述信息;
基于所述通用语言的名称查询通用语言库;
响应于未查询到所述通用语言的名称,则基于所述通用语言的名称和通用语言的描述信息更新通用语言库。
以上描述仅为本公开的较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本公开中所涉及的公开范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离上述公开构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。
此外,虽然采用特定次序描绘了各操作,但是这不应当理解为要求这些操作以所示出的特定次序或以顺序次序执行来执行。在一定环境下,多任务和并行处理可能是有利的。同样地,虽然在上面论述中包含了若干具体实现细节,但是这些不应当被解释为对本公开的范围的限制。在单独的实施例的上下文中描述的某些特征还可以组合地实现在单个实施例中。相反地,在单个实施例的上下文中描述的各种特征也可以单独地或以任何合适的子组合的方式实现在多个实施例中。
尽管已经采用特定于结构特征和/或方法逻辑动作的语言描述了本主题,但是应当理解所附权利要求书中所限定的主题未必局限于上面描述的特定特征或动作。相反,上面所描述的特定特征和动作仅仅是实现权利要求书的示例形式。

Claims (22)

  1. 一种文档处理方法,其特征在于,包括:
    响应于接收到第一用户对目标文档的编辑操作,获取编辑内容;
    基于所述编辑内容,向服务器发送第一数据请求;
    响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,所述通用语言标识基于所述编辑内容确定。
  2. 根据权利要求1所述的方法,其特征在于,所述响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,包括:
    展示第一交互区域,所述第一交互区域与所述编辑内容中的目标内容相关联;
    响应于接收到所述第一用户对所述第一交互区域的第一操作,将所述编辑内容中的目标内容更新为通用语言。
  3. 根据权利要求1所述的方法,其特征在于,所述响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,包括:
    展示第二交互区域,所述第二交互区域包括与所述编辑内容中的目标内容相关联的通用语言标识;
    响应于接收到所述第一用户对所述第二交互区域的第二操作,获取与所述目标内容对应的通用语言标识;
    响应于接收到所述第一用户对所述第二交互区域的第三操作,将所述编辑内容中的目标内容更新为与所述通用语言标识对应的通用语言。
  4. 根据权利要求1所述的方法,其特征在于,所述响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,包括:
    当所述编辑内容中的目标内容为多个时,展示第三交互区域,所述第三交互区域包括与多个所述目标内容相关联的通用语言信息;
    响应于接收到所述第一用户对所述第三交互区域的第四操作,为所述目标内容匹配对应的通用语言标识;
    响应于接收到所述第一用户对所述第三交互区域的第五操作,将所述编辑内容中的目标内容更新为与匹配的通用语言标识对应的通用语言。
  5. 根据权利要求1所述的方法,其特征在于,所述方法还包括:
    建立所述编辑内容中的目标内容与所述通用语言标识的关联关系;
    将所述关联关系存储至映射关系表中。
  6. 根据权利要求1所述的方法,其特征在于,所述方法还包括:
    响应于当前编辑模式为第一模式,将所述编辑内容中的目标内容的显示模式由第一模式变更为第二模式;或者,
    响应于当前编辑模式为第三模式,将所述编辑内容中的目标内容的显示模式由第三模式变更为与所述第三模式相关联的第二模式。
  7. 根据权利要求1所述的方法,其特征在于,所述方法还包括:
    响应于对更新后的所述目标内容的触发操作,向所述服务器发送第二数据请求,其中,所述第二数据请求包括通用语言标识和所述目标文档的标识;
    响应于接收到所述服务器发送的第二数据,展示与所述通用语言标识对应的通用语言描述信息。
  8. 根据权利要求1所述的方法,其特征在于,所述方法还包括:
    响应于对通用语言的插入操作,在所述目标文档中显示所述通用语言。
  9. 根据权利要求8所述的方法,其特征在于,所述响应于对通用语言的插入操作,在所述目标文档中显示所述通用语言,包括:
    响应于对通用语言的插入操作,展示第一操作入口,所述第一操作入口用于获取目标通用语言;
    响应于获取到所述目标通用语言,展示第二操作入口;
    响应于接收到对所述第二操作入口的操作指令,在所述目标文档中显示所述目标通用语言。
  10. 根据权利要求8所述的方法,其特征在于,所述响应于对通用语言的插入操作,在所述目标文档中显示所述通用语言,包括:
    响应于对通用语言的插入操作,获取目标位置信息;
    基于所述目标位置信息,在所述目标文档中显示所述通用语言。
  11. 根据权利要求1所述的方法,其特征在于,所述方法还包括:
    响应于对通用语言的添加操作,接收通用语言的名称和通用语言的描述信息;
    将所述通用语言的名称和通用语言的描述信息发送至所述服务器,以使所述服务器创建对应的通用语言记录。
  12. 根据权利要求1所述的方法,其特征在于,所述目标内容为所述编辑内容的部分内容或者全部内容。
  13. 一种文档处理方法,其特征在于,包括:
    接收客户端发送的第一数据请求,所述第一数据请求包括编辑内容;
    基于所述编辑内容,获取所述编辑内容对应的通用语言标识;
    将所述编辑内容对应的通用语言标识发送至所述客户端。
  14. 根据权利要求13所述的方法,其特征在于,所述基于所述编辑内容,获取所述编辑内容对应的通用语言标识,包括:
    将所述编辑内容进行分词处理,得到所述编辑内容对应的多个词元;
    通过所述词元在通用语言库中进行匹配,确定每个词元对应的通用语言标识。
  15. 根据权利要求13所述的方法,所述基于所述编辑内容,获取所述编辑内容对应的通用语言标识,包括:
    将至少一个所述编辑内容对应的数据信息进行聚合处理,确定聚合处理之后的数据信息总量;
    当所述数据信息总量满足阈值条件时,获取所述编辑内容对应的通用语言标识。
  16. 根据权利要求13所述的方法,其特征在于,所述方法还包括:
    接收所述客户端发送的第二数据请求;其中,所述第二数据请求包括通用语言标识和目标文档的标识;
    在通用语言库中查询所述通用语言标识对应的通用语言描述信息;
    将第二数据发送至客户端,所述第二数据包括所述通用语言标识对应的通用语言描述信息。
  17. 根据权利要求16所述的方法,其特征在于,所述第二数据请求还 包括:用户标识;所述方法还包括:
    基于所述用户标识和所述目标文档标识,确定所述用户是否具有所述目标文档的权限;
    响应于所述用户具有所述目标文档的权限,执行在通用语言库中查询所述通用语言标识对应的通用语言描述信息的操作。
  18. 根据权利要求13所述的方法,其特征在于,所述方法还包括:
    接收客户端发送的通用语言的名称和通用语言的描述信息;
    基于所述通用语言的名称查询通用语言库;
    响应于未查询到所述通用语言的名称,则基于所述通用语言的名称和通用语言的描述信息更新通用语言库。
  19. 一种文档处理装置,其特征在于,包括:
    内容获取模块,用于响应于接收到第一用户对目标文档的编辑操作,获取编辑内容;
    请求发送模块,用于基于所述编辑内容,向服务器发送第一数据请求;
    通用语言更新模块,用于响应于接收到所述服务器发送的通用语言标识,将所述编辑内容中的目标内容更新为通用语言,所述通用语言标识基于所述编辑内容确定。
  20. 一种文档处理装置,其特征在于,包括:
    请求接收模块,用于接收客户端发送的第一数据请求,所述第一数据请求包括编辑内容;
    通用语言标识获取模块,用于基于所述编辑内容,获取所述编辑内容对应的通用语言标识;
    通用语言标识发送模块,用于将所述编辑内容对应的通用语言标识发送至所述客户端。
  21. 一种文档处理设备,其特征在于,包括:
    一个或多个处理器;
    存储器,用于存储一个或多个程序;
    当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如权利要求1-18任一项所述的文档处理方法。
  22. 一种介质,其特征在于,所述介质存储有计算机程序,所述计算机程序被处理器执行时实现如权利要求1-18任一项所述的文档处理方法。
PCT/CN2022/078732 2021-03-02 2022-03-02 一种文档处理方法、装置、设备和介质 WO2022184079A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2023553218A JP2024509149A (ja) 2021-03-02 2022-03-02 ドキュメント処理方法、装置、機器及び媒体
US18/458,704 US20230409815A1 (en) 2021-03-02 2023-08-30 Document processing method and apparatus, and device and medium

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110232055.5 2021-03-02
CN202110232055.5A CN114997118B (zh) 2021-03-02 2021-03-02 一种文档处理方法、装置、设备和介质

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US18/458,704 Continuation US20230409815A1 (en) 2021-03-02 2023-08-30 Document processing method and apparatus, and device and medium

Publications (1)

Publication Number Publication Date
WO2022184079A1 true WO2022184079A1 (zh) 2022-09-09

Family

ID=83018193

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/078732 WO2022184079A1 (zh) 2021-03-02 2022-03-02 一种文档处理方法、装置、设备和介质

Country Status (4)

Country Link
US (1) US20230409815A1 (zh)
JP (1) JP2024509149A (zh)
CN (1) CN114997118B (zh)
WO (1) WO2022184079A1 (zh)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100115424A1 (en) * 2008-10-31 2010-05-06 Microsoft Corporation Web-based language translation memory compilation and application
CN104424180A (zh) * 2013-09-09 2015-03-18 佳能株式会社 文本输入方法及设备
CN106415531A (zh) * 2014-04-28 2017-02-15 谷歌公司 用于输入法编辑器的特定于场境的语言模型
CN110162798A (zh) * 2018-08-09 2019-08-23 腾讯科技(深圳)有限公司 文档翻译方法、装置、计算机可读存储介质和计算机设备
CN111857906A (zh) * 2020-06-03 2020-10-30 北京旷视科技有限公司 终端的文本显示方法、装置、系统和存储介质
CN112001153A (zh) * 2020-08-12 2020-11-27 腾讯科技(深圳)有限公司 文本处理方法、装置、计算机设备和存储介质

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2987099B2 (ja) * 1996-03-27 1999-12-06 株式会社日立国際ビジネス 文書作成支援システム及び用語辞書
US6785869B1 (en) * 1999-06-17 2004-08-31 International Business Machines Corporation Method and apparatus for providing a central dictionary and glossary server
CN100555264C (zh) * 2003-10-21 2009-10-28 国际商业机器公司 电子文档的注释方法、装置和系统
US9582503B2 (en) * 2010-09-29 2017-02-28 Microsoft Technology Licensing, Llc Interactive addition of semantic concepts to a document
CN101980184A (zh) * 2010-10-28 2011-02-23 传神联合(北京)信息技术有限公司 对文档内容进行批注的方法和系统
BR112013010284A2 (pt) * 2010-11-01 2016-09-20 Koninkl Philips Electronics Nv sistema para sugerir termos de preenchimento automático durante a inserção do texto de um relatório, estação de trabalho, método de sugerir termos de preenchimento automático durante a inserção do texto de um relatório, método de análisar uma pluralidade de relatórios e produto de programa de computador
US20170169192A1 (en) * 2014-02-20 2017-06-15 Koninklijke Philips N.V. Inserting structured content in itemized reports
CN104462046A (zh) * 2014-12-24 2015-03-25 语联网(武汉)信息技术有限公司 对文档内容进行区别批注的方法和系统
US20160283473A1 (en) * 2015-03-26 2016-09-29 Gnoetics, Inc. Method and Computer Program Product for Implementing an Identity Control System
US20170109335A1 (en) * 2015-10-16 2017-04-20 Microsoft Technology Licensing, Llc Authoring visual representations for text-based documents
US11468245B1 (en) * 2018-11-13 2022-10-11 Leaflet Corporation Method for clause generation
CN110276056B (zh) * 2019-05-28 2023-08-22 创新先进技术有限公司 文档编辑方法、装置、设备及系统
US12079569B2 (en) * 2019-06-07 2024-09-03 Microsoft Technology Licensing, Llc Document editing models and localized content management
US11443122B2 (en) * 2020-03-03 2022-09-13 Dell Products L.P. Image analysis-based adaptation techniques for localization of content presentation
CN111399954A (zh) * 2020-03-10 2020-07-10 北京字节跳动网络技术有限公司 界面交互方法、装置、存储介质及电子设备
CN111580714A (zh) * 2020-04-27 2020-08-25 广州视源电子科技股份有限公司 智能交互平板的页面编辑方法、装置、设备和存储介质
US20230274096A1 (en) * 2022-02-25 2023-08-31 Microsoft Technology Licensing, Llc Multilingual support for natural language processing applications

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100115424A1 (en) * 2008-10-31 2010-05-06 Microsoft Corporation Web-based language translation memory compilation and application
CN104424180A (zh) * 2013-09-09 2015-03-18 佳能株式会社 文本输入方法及设备
CN106415531A (zh) * 2014-04-28 2017-02-15 谷歌公司 用于输入法编辑器的特定于场境的语言模型
CN110162798A (zh) * 2018-08-09 2019-08-23 腾讯科技(深圳)有限公司 文档翻译方法、装置、计算机可读存储介质和计算机设备
CN111857906A (zh) * 2020-06-03 2020-10-30 北京旷视科技有限公司 终端的文本显示方法、装置、系统和存储介质
CN112001153A (zh) * 2020-08-12 2020-11-27 腾讯科技(深圳)有限公司 文本处理方法、装置、计算机设备和存储介质

Also Published As

Publication number Publication date
US20230409815A1 (en) 2023-12-21
CN114997118A (zh) 2022-09-02
JP2024509149A (ja) 2024-02-29
CN114997118B (zh) 2024-09-17

Similar Documents

Publication Publication Date Title
US9288285B2 (en) Recommending content in a client-server environment
WO2022127838A1 (zh) 文档处理方法、装置和电子设备
US9235636B2 (en) Presenting data in response to an incomplete query
US20140330821A1 (en) Recommending context based actions for data visualizations
WO2015010566A1 (zh) 综合信息精准搜索方法
US10614057B2 (en) Shared processing of rulesets for isolated collections of resources and relationships
US10152538B2 (en) Suggested search based on a content item
US12072878B2 (en) Search architecture for hierarchical data using metadata defined relationships
US20210019360A1 (en) Crowdsourcing-based structure data/knowledge extraction
WO2024131595A1 (zh) 数据处理方法、装置、电子设备及存储介质
US20150081718A1 (en) Identification of entity interactions in business relevant data
US11847145B2 (en) Aliased data retrieval for key-value stores
WO2024179308A1 (zh) 一种信息处理方法、装置、电子设备、计算机可读介质
CN115587575A (zh) 数据表创建方法、目标数据查询方法、装置及设备
WO2022184034A1 (zh) 一种文档处理方法、装置、设备和介质
US11567908B1 (en) Virtual storage interface
WO2024087821A1 (zh) 信息处理方法、装置和电子设备
US10262061B2 (en) Hierarchical data classification using frequency analysis
US10944756B2 (en) Access control
WO2022184079A1 (zh) 一种文档处理方法、装置、设备和介质
WO2023001276A1 (zh) 信息展示方法、装置、终端和存储介质
WO2022184013A1 (zh) 一种文档的编辑方法、装置、设备及存储介质
WO2022151835A1 (zh) 一种样例报文处理方法及装置
US20230066143A1 (en) Generating similarity scores between different document schemas
US9679010B2 (en) Methods, systems, and apparatus for search of electronic information attachments

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22762543

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2023553218

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 16.01.2024)

122 Ep: pct application non-entry in european phase

Ref document number: 22762543

Country of ref document: EP

Kind code of ref document: A1