WO2019153685A1 - Text processing method, apparatus, computer device and storage medium - Google Patents

Text processing method, apparatus, computer device and storage medium Download PDF

Info

Publication number
WO2019153685A1
WO2019153685A1 PCT/CN2018/099965 CN2018099965W WO2019153685A1 WO 2019153685 A1 WO2019153685 A1 WO 2019153685A1 CN 2018099965 W CN2018099965 W CN 2018099965W WO 2019153685 A1 WO2019153685 A1 WO 2019153685A1
Authority
WO
WIPO (PCT)
Prior art keywords
term
text
target
information
processed
Prior art date
Application number
PCT/CN2018/099965
Other languages
French (fr)
Chinese (zh)
Inventor
艾明
Original Assignee
深圳壹账通智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳壹账通智能科技有限公司 filed Critical 深圳壹账通智能科技有限公司
Publication of WO2019153685A1 publication Critical patent/WO2019153685A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/134Hyperlinking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/117Tagging; Marking up; Designating a block; Setting of attributes

Definitions

  • the present application relates to a text processing method, apparatus, computer device, and storage medium.
  • the traditional information sharing website platform is often filled with large texts, making it difficult for viewers to obtain key information.
  • the query engine needs to be queried by the search engine. The viewer needs to spend a lot of time to read, and the convenience of text reading is not high, so that the browsing interest of the viewer is reduced, thereby The information that leads to the text is provided less efficiently.
  • a text processing method, apparatus, computer device, and storage medium capable of improving information providing efficiency of text are provided.
  • a text processing method includes: acquiring a text to be processed; inputting the to-be-processed text into a preset text matching model, and obtaining a matching target term and a position of the target term in the to-be-processed text Determining, according to the to-be-processed text, term information corresponding to the target term; converting the target term in the to-be-processed text into a hyperlink according to the term information and the term position; And when the referral instruction for the hyperlink sent by the terminal is received, the term information is sent to the terminal.
  • a text processing apparatus comprising: a to-be-processed text acquisition module, configured to acquire a text to be processed; and a target term matching module, configured to input the to-be-processed text into a preset text matching model to obtain a matching target term and a term position of the target term in the to-be-processed text; a term information determining module, configured to determine term information corresponding to the target term according to the to-be-processed text; a hyperlink generating module, Converting the target term in the to-be-processed text into a hyperlink according to the term information and the term position; and the term information sending module, configured to receive the super-sentence sent by the terminal The linked term information is sent to the terminal when the linked lookup instruction is issued.
  • a computer device comprising a memory and one or more processors having stored therein computer readable instructions, the computer readable instructions being executable by the processor to cause the one or more processors to execute The following steps: obtaining a text to be processed; inputting the to-be-processed text into a preset text matching model, and obtaining a matching target term and a position of the target term in the to-be-processed text; Processing the text to determine the term information corresponding to the target term; converting the target term in the to-be-processed text into a hyperlink according to the term information and the term position; and receiving the terminal
  • the sent word information is sent to the terminal when the read command for the hyperlink is sent.
  • One or more non-transitory computer readable storage mediums storing computer readable instructions, when executed by one or more processors, cause one or more processors to perform the steps of: acquiring a text to be processed Entering the to-be-processed text into a preset text matching model to obtain a matching target term and a term position of the target term in the to-be-processed text; determining and the target according to the to-be-processed text The term information corresponding to the entry; converting the target term in the to-be-processed text into a hyperlink according to the term information and the term position; and receiving the hyperlink sent by the terminal When the instruction is read, the term information is sent to the terminal.
  • FIG. 1 is an application scenario diagram of a text processing method in accordance with one or more embodiments.
  • FIG. 2 is a flow diagram of a text processing method in accordance with one or more embodiments.
  • FIG. 3 is a schematic diagram of a term tree constructed from a plurality of target terms in accordance with one or more embodiments.
  • FIG. 4A is a schematic diagram of a terminal displaying entry information through a hyperlink in accordance with one or more embodiments.
  • 4B is a schematic diagram of a terminal displaying item information through a hyperlink according to one or more embodiments.
  • 4C is a schematic diagram of a terminal displaying entry information through a hyperlink in accordance with still another embodiment.
  • FIG. 5 is a block diagram showing the structure of a text processing apparatus in accordance with one or more embodiments.
  • FIG. 6 is a diagram showing the internal structure of a computer device in accordance with one or more embodiments.
  • Terminal 102 communicates with server 104 over a network over a network.
  • the terminal 102 can send the to-be-processed text to the server 104 for storage; the server 104 can send the term information to the terminal 102 or the like upon receiving the access instruction for the hyperlink sent by the terminal 102.
  • the terminal 102 can be, but is not limited to, various personal computers, notebook computers, smart phones, tablets, and portable wearable devices, and the server 104 can be implemented with a stand-alone server or a server cluster composed of a plurality of servers.
  • a text processing method is provided, which is applied to the server in FIG. 1 as an example, and includes the following steps:
  • Step 202 Acquire a text to be processed.
  • the text to be processed refers to the text that needs to be processed by the text.
  • the text to be processed may be composed of one or more of Chinese characters, English letters, punctuation marks, and Arabic numerals, but is not limited thereto.
  • the text to be processed may be text pre-stored in the database, and may also be text sent by the terminal, and may also be text collected on a preset webpage through a text collection script.
  • Step 204 Enter the text to be processed into a preset text matching model to obtain a matching target term and a term position of the target term in the to-be-processed text.
  • a text matching model is a model used to detect one or more target terms in a text.
  • the target term refers to the terms in the text to be processed that need to be matched.
  • the term position refers to the position of the target term in the text to be processed. For example, for the pending text "Sky in Shenzhen today", the target entry for the default text matching model output can be "Shenzhen" and the position is "3 ⁇ 4".
  • the target term may be a POI (Point of Interest), which is a term in a geographic information system, and refers to all geographic objects that can be abstracted into points, especially those closely related to people's lives. Geographic entities such as schools, banks, restaurants, gas stations, hospitals, supermarkets, etc.). POI entries in various fields, including abbreviations, full names, nicknames, etc., can be collected, and POI terms can be classified and stored in stock. For example, the POI entry for the tourism industry can be collected, and the Shanghai abbreviation Shanghai and nickname Shencheng can be stored. After constructing the entry tree according to all POI terms, the POI entry included in the to-be-processed text can be detected according to the entry tree by the execution function, and the position of the POI entry can be determined.
  • POI Point of Interest
  • Step 206 Determine term information corresponding to the target term according to the to-be-processed text.
  • the term information corresponding to the target term may be acquired.
  • the term information includes, but is not limited to, a combination of one or more of text information, picture information, audio information, and the like for interpreting the term, and may also be product information related to the target term.
  • the term information corresponding to the target term may be selected according to the type of the text to be processed.
  • the type of text to be processed may be determined according to the proportion of the target term of the same subject category in the tag of the text to be processed or the text to be processed. For example, when the pending text is a travel guide, the target entry may be the name of the attraction, the entry information may be an explanation of the attraction, and the related product information such as the attraction ticket purchase link and the travel insurance may be provided.
  • Step 206 may include determining whether the target term corresponds to the term information of the plurality of topic categories; if yes, determining the topic category of the target term according to the term position; and using the term information belonging to the determined topic category as the target term Corresponding entry information.
  • each target term may also correspond to multiple item information under the same topic category, and after determining the topic category of the to-be-processed text, the matching probability of the term information and the to-be-processed text may also be calculated.
  • the keyword in the term information is matched with the keyword in the text to be processed, and the term information with the largest number of matching keywords is used as the term information corresponding to the target term.
  • the pending text contains the target term “talent park”, but the target term “talent park” can correspond to the entry information of Century Park in multiple cities, the entry information of multiple “talent parks” can be obtained.
  • Step 208 Convert the target term in the to-be-processed text into a hyperlink according to the term information and the position of the term.
  • a hyperlink is a connection between a target term and a target content.
  • the target content may be another webpage, or may be a different location on the same webpage, or may be a term information such as a picture, a text, an email address, a file, etc., but is not limited thereto.
  • a hyperlink is an object. After a hyperlink is generated based on the term information, if a click operation or a mouse hover operation or the like is performed on the link, it is equivalent to instructing the browser to display the term information having a connection relationship with the hyperlink. After the target term is converted into a hyperlink according to the term information, the hyperlink may have a connection relationship with the term information.
  • the terminal can asynchronously call the corresponding database in the server database through the JS script. Entry information.
  • the term position includes a starting position and an ending position, and converting the target term in the to-be-processed text into a hyperlink according to the term information and the term position includes corresponding to the target term according to the term information.
  • the starting position is inserted into the start tag; the end tag is inserted at the end position corresponding to the target term; the target term is converted into a hyperlink according to the start tag and the end tag; the hyperlink is used to connect the term information.
  • the starting position is the position of the target term before the first character appearing in the text to be processed, and the ending position is the position of the target term after the last character appearing in the text to be processed.
  • the target term can be converted to a hyperlink by a hypertext markup language, including but not limited to anchor tags, bold tags, font tags, and the like.
  • the target entry "Oriental Mandarin Hotel” in the pending text will be displayed in the form of a hyperlink.
  • "XXX” may be a link to a webpage to be converted when the terminal detects a click operation or a hovering operation on the "Oriental Mandarin Hotel”.
  • the webpage link can be generated by the entry information corresponding to the target entry "Oriental Mandarin Hotel".
  • Step 210 When receiving the referral instruction for the hyperlink sent by the terminal, sending the term information to the terminal.
  • the lookup instruction refers to an instruction generated by the terminal according to the detected lookup operation acting on the hyperlink.
  • Review operations include, but are not limited to, click operations and hover operations.
  • the server may obtain the term information having a connection relationship with the hyperlink from the database, and send the term information to the terminal, so that the terminal can display the term information.
  • the term identifier can be preset for each term information, and the target term can be converted into a hyperlink according to the term identifier.
  • the corresponding entry information may be searched in the database by the entry identifier, and the term information is sent to the terminal.
  • the position of the target term and the target term existing in the to-be-processed text is obtained by the preset text matching model.
  • the target term After determining the term information corresponding to the target term, the target term can be converted into a hyperlink according to the term information, so that the target term can have a connection relationship with the term information.
  • the term information is sent to the terminal, so that when the viewer reads the text to be processed, the corresponding term information of the target term can be obtained in time, and the text is determined according to the to-be-processed text.
  • the term information corresponding to the target term improves the accuracy of providing the corresponding term information, which saves the time for the viewer to query the target term, thereby improving the information providing efficiency of the text.
  • steps in the flowchart of FIG. 2 are sequentially displayed as indicated by the arrows, these steps are not necessarily performed in the order indicated by the arrows. Except as explicitly stated herein, the execution of these steps is not strictly limited, and the steps may be performed in other orders. Moreover, at least some of the steps in FIG. 2 may include a plurality of sub-steps or stages, which are not necessarily performed at the same time, but may be executed at different times, the execution of these sub-steps or stages The order is also not necessarily sequential, but may be performed alternately or alternately with other steps or at least a portion of the sub-steps or stages of the other steps.
  • the text matching model can be constructed by a preset term tree and an execution function.
  • the preset text matching model can be an AC (Aho-Corasick automaton) automaton, and multiple target terms are constructed into a finite state automaton by an AC algorithm to implement one or more scan texts. Matching of target terms.
  • the entry tree can be constructed according to multiple target terms, ⁇ Oriental Mandarin Hotel; Oriental Pearl; Oriental Oasis ⁇ , also known as Trie Tree (Dictionary Tree).
  • Execution functions may include a goto function, a failure function, and an output function.
  • the steering function refers to the next state that is transferred when the next character of the text to be processed is successfully matched; the invalid function refers to the next one that is transferred when the next character of the text to be processed is input, and the matching fails.
  • State; the output function refers to the output after a node matches a target term.
  • each number is a state node, and each character in the input text to be processed is transferred from the current state node to the next state node. For example, if the text to be processed is “coming to the Oriental Pearl Hotel”, the text to be processed can be matched to node 0, node 1, node 2, node 7, and node 8 in turn by the transfer function.
  • node 8 can be pointed to node 4 by a failure function, and node 5 and node 6 can continue to be matched. Since the node 6 is a leaf node, indicating that the term matching is successful, the matching target term "Oriental Pearl Hotel" is output through the output function.
  • a leaf node is a node in the entry tree that has no children.
  • the longest target term may also be taken as the matching target term according to the position of the matched state node.
  • the matching entry position is "0-6"; "Hua”, the matching entry position is "0 ⁇ 4". Since “0 ⁇ 6” is longer than “0 ⁇ 4”, “Oriental Mandarin Hotel” can be selected as the matching target entry.
  • the statement keyword refers to a keyword that reflects the topic category in the preceding sentence and subsequent statements, and can obtain the statement keyword by performing Chinese word segmentation on the preceding sentence and the subsequent statement.
  • the keyword database of different topic categories may be preset for the extracted sentence keywords, and the preceding and subsequent sentences are matched with the keywords in the keyword database, and the matched keywords are used as the statement keywords.
  • the number of matching of the matching keyword keywords in the same subject category is counted, and the subject category corresponding to the maximum matching number is used as the subject category to which the sentence keyword belongs.
  • the text to be processed is “starting on the whole point, the journey is 20 minutes, the fare is 5 horses, the taxi needs 30 horses”, and the previous sentence “Ma Yuan” of the target entry “Take 20 minutes” can be extracted.
  • Taxi requires 30 horses” in the statement keywords "trip” and "taxi”. Since the "trip" and "taxi" belong to the keyword of the traffic subject category, it can be determined that the subject category of the target term "Ma Yuan" is also the traffic subject category.
  • the term information corresponding to the traffic subject category in the term information corresponding to the “Ma Yuan” can be obtained.
  • sending the term information to the terminal includes: transmitting the term information to the terminal; the term information is used to generate a corresponding term popup box at the terminal.
  • the term popup box refers to a popup box used by the terminal to display the term information corresponding to the target term.
  • the entry pop-up box automatically determines the position of the pop-up display based on the position of the target entry in the browser window. For example, you can display the item pop-up box directly above the target entry, or you can display the word pop-up box on the side of the top and bottom of the window.
  • FIG. 4A is a schematic diagram of a terminal displaying a term information through a hyperlink, including a target term 402 "horse", a term popup 404A, and a cursor 406.
  • a lookup instruction may be generated, and the lookup instruction is sent to the server, and the server may search for the target term 402 "mayuan” according to the lookup instruction. Entry information.
  • the "Ma Yuan” corresponds to the item information of a plurality of subject types
  • the preceding sentence "Journey 20 minutes" of the target term "Ma Yuan” and the subsequent sentence "Taxi needs 30 Ma Yuan” can be extracted.
  • the terminal may populate and generate a term popup 404A according to the term information, and the term popup box 404A is displayed above the target term. As shown in FIG. 4B, the terminal may also populate and generate an entry popup 404B according to the entry information, and the entry popup 404B is displayed on the sidebar of the browser window to avoid occluding the text around the target entry.
  • each term information corresponding to the same target term may also be tagged, and the partial term information may be selected and sent to the terminal in combination with the preceding sentence of the target term and the statement keyword in the subsequent statement.
  • the target entry "Oriental Pearl” when the content of the pending text is "Eating in the Oriental Pearl”, you can select the entry information corresponding to "Eating”, such as the entry information of the "Food” label. It can be a brief introduction of the restaurant near the Oriental Pearl; when the content of the pending text is “Playing the Oriental Pearl”, you can select the entry information corresponding to “Play”, such as the entry information of the “Entertainment” label, which can be near the Oriental Pearl.
  • the entry pop-up box includes a resource control; after the term information is sent to the terminal, the method further includes: counting the number of requests for the resource link request sent by the terminal within the preset duration; generating the resource link according to the number of instructions The resource link response is used to instruct the terminal to change the shape of the resource control according to the number of instructions.
  • a resource control is a control that encapsulates all or part of the term information.
  • the term information includes but is not limited to a combination of one or more of picture information, text information, webpage links, audio information, etc., correspondingly, the resource control may be a control for turning to other webpage links, or may display image information.
  • the control can also be a control for playing audio information, but is not limited thereto.
  • the server may count the number of requests for resource link requests sent by the terminal within the preset duration, and generate a resource link response according to the number of instructions.
  • the resource link response may include a shape parameter of the resource control, and the resource link response may be used to instruct the terminal to change the shape of the resource control according to the shape parameter.
  • the shape parameter can be generated according to the number of requests. When the number of requests is larger, a larger shape parameter can be generated, so the size of the resource control can be increased.
  • the resource control included in the entry pop-up box can also be buried, so that the number of times the resource control click operation can be directly counted.
  • the entry popup may include a plurality of regions in which at least one of text, a picture, a resource control, and the like may be displayed.
  • the entry popup 404C includes three regions, a region 404C-a, a region 404C-b, and a region 404C-c.
  • the area 404C-a can be used to display the term interpretation text of the target term "Ma Yuan”
  • the area 404C-b can be used to display the product information related to the target item "Ma Yuan”
  • the area 404C-c can be used to display the target entry " ⁇ " pronunciation.
  • the terminal sends the number of instructions b corresponding to the product information in the area 404C-b, respectively, and the number of instructions c corresponding to the pronunciation of the "Ma Yuan" in the area 404C-c within the statistical preset duration, according to the number of instructions b and the instruction
  • the quantity c change area 404C-b and the area 404C-c account for the area of the entire entry popup. For example, when the number of instructions b is much larger than the number of instructions c, it means that the viewer's interest in the product information in the area 404C-b is far greater than the interest in the pronunciation of the "Ma Yuan" in the area 404C-c, which may be increased.
  • the area 404C-b occupies an area of the entire vocabulary popup, and the reduced area 404C-c occupies the area of the entire vocabulary popup.
  • the shape of the resource control is changed by the number of requests for the resource link corresponding to the resource control in the entry pop-up box,
  • a text processing apparatus 500 including: a text processing module 502 to be processed for acquiring text to be processed; and a target term matching module 504 for processing Text inputting a preset text matching model, obtaining a matching target term and a term position of the target term in the to-be-processed text; the term information determining module 506 is configured to determine a word corresponding to the target term according to the to-be-processed text a piece of information; a hyperlink generating module 508, configured to convert a target term in the to-be-processed text into a hyperlink according to the term information and the term position; the term information sending module 510 is configured to receive the super-send sent by the terminal When the link is read, the term information is sent to the terminal.
  • the term information determining module 506 is further configured to determine whether the target term corresponds to the term information of the plurality of topic categories; if yes, determine the topic category of the target term according to the term position; The term information of the subject category is used as the term information corresponding to the target term.
  • the term information determining module 506 is further configured to: obtain a preceding sentence and a subsequent statement of the target term in the to-be-processed text according to the position of the entry; and extract the statement keyword in the preceding sentence and the subsequent statement; Determine the topic category to which the statement keyword belongs, and use the determined topic category as the topic category of the target term.
  • the term position includes a starting position and an ending position; the hyperlink generating module 508 is further configured to insert a starting tag according to the entry information at a starting position corresponding to the target term; The corresponding end position is inserted into the end tag; the target term is converted into a hyperlink according to the start tag and the end tag; the hyperlink is used to connect the term information.
  • the term information sending module 510 is further configured to send the term information to the terminal; the term information is used to generate a corresponding term popup box at the terminal.
  • the entry pop-up box includes a resource control; the entry information sending module 510 is further configured to count the number of requests for the resource link request sent by the terminal within the preset duration; generate a resource link response according to the number of instructions; The response is used to instruct the terminal to change the shape of the resource control according to the number of instructions.
  • the various modules in the text processing apparatus described above may be implemented in whole or in part by software, hardware, and combinations thereof. Each of the above modules may be embedded in or independent of the processor in the computer device, or may be stored in a memory in the computer device in a software form, so that the processor invokes the operations corresponding to the above modules.
  • the text processing apparatus described above may be embodied in the form of a computer readable instruction that can be executed on a computer device as shown in FIG.
  • a computer device which may be a server, and its internal structure diagram may be as shown in FIG. 6.
  • the computer device includes a processor, memory, network interface, and database connected by a system bus.
  • the processor of the computer device is used to provide computing and control capabilities.
  • the memory of the computer device includes a non-transitory computer readable storage medium, an internal memory.
  • the non-transitory computer readable storage medium stores an operating system, computer readable instructions, and a database.
  • the internal memory provides an environment for the operation of an operating system and computer readable instructions in a non-transitory computer readable storage medium.
  • the database of the computer device is used to store data such as text to be processed, a term tree, and the like.
  • the network interface of the computer device is used to communicate with an external terminal via a network connection.
  • the computer device can receive the pending text sent by the terminal, and can also send the term information to the terminal.
  • the computer readable instructions are executed by a processor to implement a text processing method.
  • FIG. 6 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation of the computer device to which the solution of the present application is applied.
  • the specific computer device may It includes more or fewer components than those shown in the figures, or some components are combined, or have different component arrangements.
  • a computer apparatus comprising a memory and one or more processors having stored therein computer readable instructions that, when executed by a processor, implement any of the embodiments of the present application The steps provided for the text processing method.
  • one or more non-transitory computer readable storage mediums storing computer readable instructions that, when executed by one or more processors, cause one or more processes
  • the steps of the text processing method provided in any one of the embodiments of the present application are implemented.
  • Non-volatile memory can include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
  • Volatile memory can include random access memory (RAM) or external cache memory.
  • RAM is available in a variety of formats, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronization chain.
  • SRAM static RAM
  • DRAM dynamic RAM
  • SDRAM synchronous DRAM
  • DDRSDRAM double data rate SDRAM
  • ESDRAM enhanced SDRAM
  • Synchlink DRAM SLDRAM
  • Memory Bus Radbus
  • RDRAM Direct RAM
  • DRAM Direct Memory Bus Dynamic RAM
  • RDRAM Memory Bus Dynamic RAM

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

A text processing method, comprising: acquiring text to be processed; inputting the text to be processed into a predetermined text matching model, so as to obtain a matched target entry and a position of the target entry in the text to be processed; determining, according to the text to be processed, entry information corresponding to the target entry; converting the target entry, in the text to be processed, into a hyperlink according to the entry information and said position of the entry; and upon receipt of a hyperlink reading instruction, sent from a terminal,, sending the entry information to the terminal.

Description

文本处理方法、装置、计算机设备和存储介质Text processing method, device, computer device and storage medium
本申请要求于2018年02月07日提交中国专利局,申请号为2018101245521,申请名称为“文本处理方法、装置、计算机设备和存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese Patent Application entitled "Text Processing Method, Apparatus, Computer Equipment, and Storage Media" by the Chinese Patent Office, filed on February 7, 2018, the entire disclosure of which is incorporated by reference. In this application.
技术领域Technical field
本申请涉及一种文本处理方法、装置、计算机设备和存储介质。The present application relates to a text processing method, apparatus, computer device, and storage medium.
背景技术Background technique
随着物质生活水平的提高,在人们对文化产品、文化服务表现出越来越高的追求的同时,也越来越愿意将经历与感想分享给大众。由此衍生出许多美食点评或旅游攻略的网站,用户可以在网站平台发表文本、图片等信息供人阅览。With the improvement of material living standards, people are increasingly willing to share experiences and feelings with the public while they are increasingly pursuing cultural products and cultural services. This has led to many food reviews or travel guides, users can post text, pictures and other information on the website platform for people to read.
然而,传统的信息共享网站平台上通常会充斥着大篇幅的文本,使得浏览者难以获取关键信息。且当浏览者遇到不熟悉的行业词条时,还需要再通过搜索引擎进行查询,浏览者需要耗费大量的时间进行阅读,文本阅读的便捷性不高,使得降低浏览者的浏览兴趣,从而导致文本的信息提供效率较低。However, the traditional information sharing website platform is often filled with large texts, making it difficult for viewers to obtain key information. When the viewer encounters an unfamiliar industry entry, the query engine needs to be queried by the search engine. The viewer needs to spend a lot of time to read, and the convenience of text reading is not high, so that the browsing interest of the viewer is reduced, thereby The information that leads to the text is provided less efficiently.
发明内容Summary of the invention
根据本申请公开的各种实施例,提供一种能够提高文本的信息提供效率的文本处理方法、装置、计算机设备和存储介质。According to various embodiments disclosed herein, a text processing method, apparatus, computer device, and storage medium capable of improving information providing efficiency of text are provided.
一种文本处理方法,包括:获取待处理文本;将所述待处理文本输入预设的文本匹配模型,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置;根据所述待处理文本确定与所述目标词条对应的词条信息;根据所述词条信息和所述词条位置将所述待处理文本中的所述目标词条转换成超链接;及当接收到终端发送的对所述超链接的查阅指令时,将所述词条信息发送至所述终端。A text processing method includes: acquiring a text to be processed; inputting the to-be-processed text into a preset text matching model, and obtaining a matching target term and a position of the target term in the to-be-processed text Determining, according to the to-be-processed text, term information corresponding to the target term; converting the target term in the to-be-processed text into a hyperlink according to the term information and the term position; And when the referral instruction for the hyperlink sent by the terminal is received, the term information is sent to the terminal.
一种文本处理装置,包括:待处理文本获取模块,用于获取待处理文本;目标词条匹配模块,用于将所述待处理文本输入预设的文本匹配模型,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置;词条信息确定模块,用于根据所述待处理文本确定与所述目标词条对应的词条信息;超链接生成模块,用于根据所述词条信息和所述词条位置将所述待处理文本中的所述目标词条转换成超链接;及词条信息发送模块,用于当接收到终端发送的对所述超链接的查阅指令时,将所述词条信息发送至所述终端。A text processing apparatus, comprising: a to-be-processed text acquisition module, configured to acquire a text to be processed; and a target term matching module, configured to input the to-be-processed text into a preset text matching model to obtain a matching target term and a term position of the target term in the to-be-processed text; a term information determining module, configured to determine term information corresponding to the target term according to the to-be-processed text; a hyperlink generating module, Converting the target term in the to-be-processed text into a hyperlink according to the term information and the term position; and the term information sending module, configured to receive the super-sentence sent by the terminal The linked term information is sent to the terminal when the linked lookup instruction is issued.
一种计算机设备,包括存储器和一个或多个处理器,所述存储器中储存有计算机可读指令,所述计算机可读指令被所述处理器执行时,使得所述一个或多个处理器执行以下步 骤:获取待处理文本;将所述待处理文本输入预设的文本匹配模型,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置;根据所述待处理文本确定与所述目标词条对应的词条信息;根据所述词条信息和所述词条位置将所述待处理文本中的所述目标词条转换成超链接;及当接收到终端发送的对所述超链接的查阅指令时,将所述词条信息发送至所述终端。A computer device comprising a memory and one or more processors having stored therein computer readable instructions, the computer readable instructions being executable by the processor to cause the one or more processors to execute The following steps: obtaining a text to be processed; inputting the to-be-processed text into a preset text matching model, and obtaining a matching target term and a position of the target term in the to-be-processed text; Processing the text to determine the term information corresponding to the target term; converting the target term in the to-be-processed text into a hyperlink according to the term information and the term position; and receiving the terminal The sent word information is sent to the terminal when the read command for the hyperlink is sent.
一个或多个存储有计算机可读指令的非易失性计算机可读存储介质,计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器执行以下步骤:获取待处理文本;将所述待处理文本输入预设的文本匹配模型,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置;根据所述待处理文本确定与所述目标词条对应的词条信息;根据所述词条信息和所述词条位置将所述待处理文本中的所述目标词条转换成超链接;及当接收到终端发送的对所述超链接的查阅指令时,将所述词条信息发送至所述终端。One or more non-transitory computer readable storage mediums storing computer readable instructions, when executed by one or more processors, cause one or more processors to perform the steps of: acquiring a text to be processed Entering the to-be-processed text into a preset text matching model to obtain a matching target term and a term position of the target term in the to-be-processed text; determining and the target according to the to-be-processed text The term information corresponding to the entry; converting the target term in the to-be-processed text into a hyperlink according to the term information and the term position; and receiving the hyperlink sent by the terminal When the instruction is read, the term information is sent to the terminal.
本申请的一个或多个实施例的细节在下面的附图和描述中提出。本申请的其它特征和优点将从说明书、附图以及权利要求书变得明显。Details of one or more embodiments of the present application are set forth in the accompanying drawings and description below. Other features and advantages of the present invention will be apparent from the description, drawings and claims.
附图说明DRAWINGS
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings to be used in the embodiments or the prior art description will be briefly described below. Obviously, the drawings in the following description are only It is a certain embodiment of the present application, and other drawings can be obtained according to the drawings without any creative work for those skilled in the art.
图1为根据一个或多个实施例中文本处理方法的应用场景图。1 is an application scenario diagram of a text processing method in accordance with one or more embodiments.
图2为根据一个或多个实施例中文本处理方法的流程示意图。2 is a flow diagram of a text processing method in accordance with one or more embodiments.
图3为根据一个或多个实施例中多个目标词条构成的词条树示意图。3 is a schematic diagram of a term tree constructed from a plurality of target terms in accordance with one or more embodiments.
图4A为根据一个或多个实施例中终端通过超链接显示词条信息的示意图。4A is a schematic diagram of a terminal displaying entry information through a hyperlink in accordance with one or more embodiments.
图4B为另根据一个或多个实施例中终端通过超链接显示词条信息的示意图。4B is a schematic diagram of a terminal displaying item information through a hyperlink according to one or more embodiments.
图4C为根据又一个或多个实施例中终端通过超链接显示词条信息的示意图。4C is a schematic diagram of a terminal displaying entry information through a hyperlink in accordance with still another embodiment.
图5为根据一个或多个实施例中文本处理装置的结构框图。FIG. 5 is a block diagram showing the structure of a text processing apparatus in accordance with one or more embodiments.
图6为根据一个或多个实施例中计算机设备的内部结构图。6 is a diagram showing the internal structure of a computer device in accordance with one or more embodiments.
具体实施方式Detailed ways
为了使本申请的技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。In order to make the technical solutions and advantages of the present application more clear, the present application will be further described in detail below with reference to the accompanying drawings and embodiments. It is understood that the specific embodiments described herein are merely illustrative of the application and are not intended to be limiting.
本申请提供的文本处理方法,可以应用于如图1所示的应用环境中。终端102通过网络与服务器104通过网络进行通信。比如说,终端102可将待处理文本发送至服务器104进行存储;服务器104可在接收到终端102发送的对超链接的查阅指令时,将词条信息发 送至终端102等。终端102可以但不限于是各种个人计算机、笔记本电脑、智能手机、平板电脑和便携式可穿戴设备,服务器104可以用独立的服务器或者是多个服务器组成的服务器集群来实现。The text processing method provided by the present application can be applied to an application environment as shown in FIG. 1. Terminal 102 communicates with server 104 over a network over a network. For example, the terminal 102 can send the to-be-processed text to the server 104 for storage; the server 104 can send the term information to the terminal 102 or the like upon receiving the access instruction for the hyperlink sent by the terminal 102. The terminal 102 can be, but is not limited to, various personal computers, notebook computers, smart phones, tablets, and portable wearable devices, and the server 104 can be implemented with a stand-alone server or a server cluster composed of a plurality of servers.
在其中一个实施例中,如图2所示,提供了一种文本处理方法,以该方法应用于图1中的服务器为例进行说明,包括以下步骤:In one embodiment, as shown in FIG. 2, a text processing method is provided, which is applied to the server in FIG. 1 as an example, and includes the following steps:
步骤202,获取待处理文本。Step 202: Acquire a text to be processed.
待处理文本是指需要进行文本处理的文本。待处理文本可由中文字符、英文字母、标点符号及阿拉伯数字等其中一种或多种组成,但不限于此。待处理文本可为预存于数据库中的文本,也可为终端发送的文本,还可为通过文本采集脚本在预设网页上采集得到的文本。The text to be processed refers to the text that needs to be processed by the text. The text to be processed may be composed of one or more of Chinese characters, English letters, punctuation marks, and Arabic numerals, but is not limited thereto. The text to be processed may be text pre-stored in the database, and may also be text sent by the terminal, and may also be text collected on a preset webpage through a text collection script.
步骤204,将待处理文本输入预设的文本匹配模型,得到匹配的目标词条及目标词条在待处理文本中的词条位置。Step 204: Enter the text to be processed into a preset text matching model to obtain a matching target term and a term position of the target term in the to-be-processed text.
文本匹配模型是指用于检测文本中一个或多个目标词条的模型。目标词条是指待处理文本中需要进行匹配的词条。词条位置是指目标词条在待处理文本中的位置。比如说,针对待处理文本“今天深圳的天气晴朗”,预设的文本匹配模型输出的目标词条可为“深圳”,位置为“3~4”。A text matching model is a model used to detect one or more target terms in a text. The target term refers to the terms in the text to be processed that need to be matched. The term position refers to the position of the target term in the text to be processed. For example, for the pending text "Sky in Shenzhen today", the target entry for the default text matching model output can be "Shenzhen" and the position is "3~4".
在其中一个实施例中,目标词条可为POI(Point of Interest,兴趣点,是地理信息系统中的一个术语,泛指一切可以抽象为点的地理对象,尤其是一些与人们生活密切相关的地理实体,如学校、银行、餐馆、加油站、医院、超市等。)词条。可通过搜集各个领域的POI词条,包括简称、全称、别称等,还可对POI词条进行分类分库存储。比如说可搜集旅游行业的POI词条,可将上海的简称沪和别称申城都进行存储。根据所有POI词条构建词条树之后,可通过执行函数根据词条树检测出待处理文本中包含的POI词条,并确定POI词条的词条位置。In one embodiment, the target term may be a POI (Point of Interest), which is a term in a geographic information system, and refers to all geographic objects that can be abstracted into points, especially those closely related to people's lives. Geographic entities such as schools, banks, restaurants, gas stations, hospitals, supermarkets, etc.). POI entries in various fields, including abbreviations, full names, nicknames, etc., can be collected, and POI terms can be classified and stored in stock. For example, the POI entry for the tourism industry can be collected, and the Shanghai abbreviation Shanghai and nickname Shencheng can be stored. After constructing the entry tree according to all POI terms, the POI entry included in the to-be-processed text can be detected according to the entry tree by the execution function, and the position of the POI entry can be determined.
步骤206,根据待处理文本确定与目标词条对应的词条信息。Step 206: Determine term information corresponding to the target term according to the to-be-processed text.
当检测到待处理文本中匹配的目标词条之后,可获取与该目标词条对应的词条信息。词条信息包括但不限于用于解释词条的文本信息、图片信息、音频信息等其中一种或多种的组合,还可以是与目标词条相关的产品信息。可根据待处理文本的类型选取与目标词条对应的词条信息。待处理文本的类型可根据待处理文本的标签或待处理文本中相同主题类别的目标词条的占比来确定。比如说,当待处理文本为旅游攻略时,目标词条可为景点名称,词条信息可为景点解释,还可为景点门票购买链接及旅游保险等相关的产品信息。After the matching target term in the to-be-processed text is detected, the term information corresponding to the target term may be acquired. The term information includes, but is not limited to, a combination of one or more of text information, picture information, audio information, and the like for interpreting the term, and may also be product information related to the target term. The term information corresponding to the target term may be selected according to the type of the text to be processed. The type of text to be processed may be determined according to the proportion of the target term of the same subject category in the tag of the text to be processed or the text to be processed. For example, when the pending text is a travel guide, the target entry may be the name of the attraction, the entry information may be an explanation of the attraction, and the related product information such as the attraction ticket purchase link and the travel insurance may be provided.
在其中一个实施例中,由于每个目标词条可对应多个主题类别的词条信息,比如“马元”既可代表人物主题类别的马元,也可代表交通主题类别的马来西亚元,因此在获取词条信息之前需要先确定目标词条的主题类别。步骤206可包括判断目标词条是否对应多个主题类别的词条信息;若是,则根据词条位置确定目标词条的主题类别;将属于所确定的主题类别的词条信息作为与目标词条对应的词条信息。In one embodiment, since each target term can correspond to the term information of a plurality of topic categories, for example, the “Ma Yuan” can represent both the subject of the character category and the Malaysian element of the traffic subject category, The subject category of the target term needs to be determined before the term information is obtained. Step 206 may include determining whether the target term corresponds to the term information of the plurality of topic categories; if yes, determining the topic category of the target term according to the term position; and using the term information belonging to the determined topic category as the target term Corresponding entry information.
在其中一个实施例中,每个目标词条在同一主题类别下还可对应多种词条信息,则在确定待处理文本的主题类别之后,还可计算词条信息与待处理文本的匹配概率。比如说,将词条信息中的关键词与待处理文本中的关键词进行匹配,将匹配的关键词数量最多的词条信息作为与目标词条对应的词条信息。比如说,待处理文本中包含目标词条“人才公园”,但目标词条“人才公园”可对应多个城市的世纪公园的词条信息时,可将多个“人才公园”的词条信息与待处理文本进行匹配。当检测到待处理文本与深圳的“人才公园”所对应的词条信息中都包含多个“深圳”时,可将深圳的“人才公园”所对应的词条信息作为与目标词条对应的词条信息。In one embodiment, each target term may also correspond to multiple item information under the same topic category, and after determining the topic category of the to-be-processed text, the matching probability of the term information and the to-be-processed text may also be calculated. . For example, the keyword in the term information is matched with the keyword in the text to be processed, and the term information with the largest number of matching keywords is used as the term information corresponding to the target term. For example, if the pending text contains the target term “talent park”, but the target term “talent park” can correspond to the entry information of Century Park in multiple cities, the entry information of multiple “talent parks” can be obtained. Matches the text to be processed. When it is detected that the pending text corresponding to the "talent park" in Shenzhen contains a plurality of "Shenzhen", the entry information corresponding to the "talent park" of Shenzhen may be used as the target entry. Entry information.
步骤208,根据词条信息和词条位置将待处理文本中的目标词条转换成超链接。Step 208: Convert the target term in the to-be-processed text into a hyperlink according to the term information and the position of the term.
超链接是指目标词条与目标内容具有连接关系。目标内容可以是另一个网页,也可以是相同网页上的不同位置,还可以是词条信息,比如图片、文本、电子邮件地址、文件等,但不限于此。超链接是一种对象,根据词条信息生成超链接之后,使得如果对该链接执行点击操作或鼠标悬停操作等,则相当于指示浏览器显示与该超链接具有连接关系的词条信息。根据词条信息将目标词条转换为超链接之后,该超链接可与词条信息具有连接关系。比如说,根据待处理文本生成网页之后,且待处理文本中的目标词条已转换为超链接,当检测到作用于该超链接的操作时,终端可通过JS脚本异步调用服务器数据库中相应的词条信息。A hyperlink is a connection between a target term and a target content. The target content may be another webpage, or may be a different location on the same webpage, or may be a term information such as a picture, a text, an email address, a file, etc., but is not limited thereto. A hyperlink is an object. After a hyperlink is generated based on the term information, if a click operation or a mouse hover operation or the like is performed on the link, it is equivalent to instructing the browser to display the term information having a connection relationship with the hyperlink. After the target term is converted into a hyperlink according to the term information, the hyperlink may have a connection relationship with the term information. For example, after the webpage is generated according to the to-be-processed text, and the target term in the to-be-processed text has been converted into a hyperlink, when the operation acting on the hyperlink is detected, the terminal can asynchronously call the corresponding database in the server database through the JS script. Entry information.
在其中一个实施例中,词条位置包括起始位置和结束位置,根据词条信息和词条位置将待处理文本中的目标词条转换成超链接包括根据词条信息在目标词条所对应的起始位置插入起始标签;在目标词条所对应的结束位置插入结束标签;根据起始标签和结束标签将目标词条转换成超链接;超链接用于连接词条信息。起始位置是指目标词条在待处理文本中出现的第一个字符之前的位置,结束位置是指目标词条在待处理文本中出现的最后一个字符之后的位置。可通过超文本标记语言将目标词条转换为超链接,起始标签和结束标签包括但不限于锚标签、加粗标签、字体标签等。In one embodiment, the term position includes a starting position and an ending position, and converting the target term in the to-be-processed text into a hyperlink according to the term information and the term position includes corresponding to the target term according to the term information. The starting position is inserted into the start tag; the end tag is inserted at the end position corresponding to the target term; the target term is converted into a hyperlink according to the start tag and the end tag; the hyperlink is used to connect the term information. The starting position is the position of the target term before the first character appearing in the text to be processed, and the ending position is the position of the target term after the last character appearing in the text to be processed. The target term can be converted to a hyperlink by a hypertext markup language, including but not limited to anchor tags, bold tags, font tags, and the like.
举例来说,可在目标词条“东方文华酒店”的前后分别插入起始标签和结束标签,转换成<a href=“XXX”>东方文华酒店</a>。这样网页上就会以超链接的形式展示待处理文本中的目标词条“东方文华酒店”。“XXX”可为当终端检测到作用于“东方文华酒店”的点击操作或悬停操作时,所转换至的网页链接。该网页链接可通过与目标词条“东方文华酒店”对应的词条信息所生成。For example, the start tag and the end tag can be inserted before and after the target entry "Oriental Mandarin Hotel", and converted into <a href=“XXX”>Oriental Mandarin Hotel</a>. In this way, the target entry "Oriental Mandarin Hotel" in the pending text will be displayed in the form of a hyperlink. "XXX" may be a link to a webpage to be converted when the terminal detects a click operation or a hovering operation on the "Oriental Mandarin Hotel". The webpage link can be generated by the entry information corresponding to the target entry "Oriental Mandarin Hotel".
步骤210,当接收到终端发送的对超链接的查阅指令时,将词条信息发送至终端。Step 210: When receiving the referral instruction for the hyperlink sent by the terminal, sending the term information to the terminal.
查阅指令是指终端根据所检测到的作用于超链接的查阅操作所生成的指令。查阅操作包括但不限于点击操作及悬停操作等。在接收到查阅指令之后,服务器可从数据库中获取与该超链接具有连接关系的词条信息,并将该词条信息发送至终端,使得终端能够显示词条信息。The lookup instruction refers to an instruction generated by the terminal according to the detected lookup operation acting on the hyperlink. Review operations include, but are not limited to, click operations and hover operations. After receiving the referral instruction, the server may obtain the term information having a connection relationship with the hyperlink from the database, and send the term information to the terminal, so that the terminal can display the term information.
在其中一个实施例中,由于相同的目标词条可对应多个词条信息,因此可针对每个词 条信息预设词条标识,并根据词条标识将目标词条转换超链接。当接收到终端发送的对超链接的查阅指令时,可通过词条标识在数据库中查找相应的词条信息,并将词条信息发送至终端。In one of the embodiments, since the same target term can correspond to a plurality of term information, the term identifier can be preset for each term information, and the target term can be converted into a hyperlink according to the term identifier. When receiving the query instruction for the hyperlink sent by the terminal, the corresponding entry information may be searched in the database by the entry identifier, and the term information is sent to the terminal.
上述文本处理方法中,在获取待处理文本之后,通过预设的文本匹配模型得到待处理文本中存在的目标词条及目标词条的词条位置。确定与目标词条对应的词条信息之后,可根据词条信息将目标词条转换为超链接,使得目标词条能够与词条信息具有连接关系。当接收到终端发送的对超链接的查阅指令时,将词条信息发送至终端,使得在浏览者阅读待处理文本时能够及时获得目标词条相应的词条信息,且通过根据待处理文本确定与目标词条对应的词条信息,提高了提供相应的词条信息的精准度,节约了浏览者查询目标词条的时间,因此提高了文本的信息提供效率。In the above text processing method, after the text to be processed is acquired, the position of the target term and the target term existing in the to-be-processed text is obtained by the preset text matching model. After determining the term information corresponding to the target term, the target term can be converted into a hyperlink according to the term information, so that the target term can have a connection relationship with the term information. When receiving the query instruction for the hyperlink sent by the terminal, the term information is sent to the terminal, so that when the viewer reads the text to be processed, the corresponding term information of the target term can be obtained in time, and the text is determined according to the to-be-processed text. The term information corresponding to the target term improves the accuracy of providing the corresponding term information, which saves the time for the viewer to query the target term, thereby improving the information providing efficiency of the text.
应该理解的是,虽然图2的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,图2中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the various steps in the flowchart of FIG. 2 are sequentially displayed as indicated by the arrows, these steps are not necessarily performed in the order indicated by the arrows. Except as explicitly stated herein, the execution of these steps is not strictly limited, and the steps may be performed in other orders. Moreover, at least some of the steps in FIG. 2 may include a plurality of sub-steps or stages, which are not necessarily performed at the same time, but may be executed at different times, the execution of these sub-steps or stages The order is also not necessarily sequential, but may be performed alternately or alternately with other steps or at least a portion of the sub-steps or stages of the other steps.
在其中一个实施例中,可通过预设的词条树和执行函数构成文本匹配模型。比如说,预设的文本匹配模型可为AC(Aho-Corasick automaton,多模式匹配)自动机,通过AC算法将多个目标词条构建成有限状态自动机,实现一次扫描文本完成一个或多个目标词条的匹配。如图3所示,可根据多个目标词条,{东方文华酒店;东方明珠;东方绿洲},构造词条树,也可称为Trie树(字典树)。执行函数可包括转移函数(goto function)、失效函数(failure function)和输出函数(output function)。转向函数指的是当输入待处理文本的下一个字符后,匹配成功时转到的下一个状态;失效函数指的是当输入待处理文本的下一个字符后,匹配失败时转到的下一个状态;输出函数指的是在某个节点匹配到某个目标词条后的输出。如图3所示的词条树中,每个数字都是一个状态节点,每输入待处理文本中的一个字符就会从当前状态节点转入下一个状态节点。比如待处理文本为“来到东方明珠酒店”,可通过转移函数将待处理文本依次匹配节点0、节点1、节点2、节点7、节点8,由于“珠”的下一个字符是“酒”,不存在匹配的节点,因此,可通过失效函数将节点8指向节点4,则能继续匹配节点5和节点6。由于节点6是叶子节点,说明词条匹配成功,则会通过输出函数输出匹配的目标词条“东方明珠酒店”。叶子节点是指词条树中没有子节点的节点。In one of the embodiments, the text matching model can be constructed by a preset term tree and an execution function. For example, the preset text matching model can be an AC (Aho-Corasick automaton) automaton, and multiple target terms are constructed into a finite state automaton by an AC algorithm to implement one or more scan texts. Matching of target terms. As shown in Figure 3, the entry tree can be constructed according to multiple target terms, {Oriental Mandarin Hotel; Oriental Pearl; Oriental Oasis}, also known as Trie Tree (Dictionary Tree). Execution functions may include a goto function, a failure function, and an output function. The steering function refers to the next state that is transferred when the next character of the text to be processed is successfully matched; the invalid function refers to the next one that is transferred when the next character of the text to be processed is input, and the matching fails. State; the output function refers to the output after a node matches a target term. In the entry tree shown in Figure 3, each number is a state node, and each character in the input text to be processed is transferred from the current state node to the next state node. For example, if the text to be processed is “coming to the Oriental Pearl Hotel”, the text to be processed can be matched to node 0, node 1, node 2, node 7, and node 8 in turn by the transfer function. Since the next character of “bead” is “wine”, There are no matching nodes, so node 8 can be pointed to node 4 by a failure function, and node 5 and node 6 can continue to be matched. Since the node 6 is a leaf node, indicating that the term matching is successful, the matching target term "Oriental Pearl Hotel" is output through the output function. A leaf node is a node in the entry tree that has no children.
在其中一个实施例中,当检测到多种匹配可能性时,还可根据匹配的状态节点的位置,取最长的目标词条作为匹配的目标词条。比如说,当待处理文本输入图3所示词条树所对应的AC自动机时,匹配出两个结果:“东方文华酒店”,匹配词条位置是“0~6”; “东方文华”,匹配的词条位置是“0~4”。由于“0~6”长于“0~4”,因此可选取“东方文华酒店”作为匹配的目标词条。In one of the embodiments, when a plurality of matching possibilities are detected, the longest target term may also be taken as the matching target term according to the position of the matched state node. For example, when the text to be processed is input to the AC automaton corresponding to the entry tree shown in Figure 3, two results are matched: "Oriental Mandarin Hotel", the matching entry position is "0-6"; "Hua", the matching entry position is "0 ~ 4". Since “0~6” is longer than “0~4”, “Oriental Mandarin Hotel” can be selected as the matching target entry.
在其中一个实施例中,根据词条位置确定目标词条的主题类别,包括:根据词条位置,获取待处理文本中目标词条的前续语句和后续语句;提取前续语句和后续语句中的语句关键词;确定语句关键词所属的主题类别,并将所确定的主题类别作为目标词条的主题类别。前续语句是指目标词条在待处理文本中所处的词条位置前的语句;后续语句是指目标词条在待处理文本中所处的词条位置后的语句。前续语句和后续语句可以是以句号结尾的完整语句,也可以是以句号或分号等结尾的不完整语句。语句关键词是指前续语句及后续语句中的能够反映主题类别的关键词,可通过对前续语句和后续语句进行中文分词得到语句关键词。可针对提取语句关键词预设不同主题类别的关键词数据库,将前续语句和后续语句与关键词数据库中的关键词进行匹配,将匹配的关键词作为语句关键词。统计相同主题类别下匹配的语句关键词的匹配数量,可将最大匹配数量所对应的主题类别作为语句关键词所属的主题类别。In one embodiment, determining a topic category of the target term according to the position of the entry includes: obtaining a preceding sentence and a subsequent statement of the target term in the to-be-processed text according to the position of the entry; extracting the preceding sentence and the subsequent statement The statement keyword; determine the topic category to which the statement keyword belongs, and use the determined topic category as the topic category of the target term. The preceding statement refers to the statement in front of the position of the target in the text to be processed; the subsequent statement refers to the statement after the position of the target in the text to be processed. The preceding and subsequent statements may be complete statements ending in a period, or incomplete statements ending in a period or a semicolon. The statement keyword refers to a keyword that reflects the topic category in the preceding sentence and subsequent statements, and can obtain the statement keyword by performing Chinese word segmentation on the preceding sentence and the subsequent statement. The keyword database of different topic categories may be preset for the extracted sentence keywords, and the preceding and subsequent sentences are matched with the keywords in the keyword database, and the matched keywords are used as the statement keywords. The number of matching of the matching keyword keywords in the same subject category is counted, and the subject category corresponding to the maximum matching number is used as the subject category to which the sentence keyword belongs.
举例来说,待处理文本为“整点出发,行程20分钟,票价5马元,出租车需要30马元”,可提取出目标词条“马元”的前续语句“行程20分钟”和后续语句“出租车需要30马元”中的语句关键词“行程”和“出租车”。由于“行程”和“出租车”属于交通主题类别的关键词,因此可确定目标词条“马元”的主题类别也为交通主题类别。相应的,可获取“马元”对应的词条信息中,与交通主题类别对应的词条信息。For example, the text to be processed is “starting on the whole point, the journey is 20 minutes, the fare is 5 horses, the taxi needs 30 horses”, and the previous sentence “Ma Yuan” of the target entry “Take 20 minutes” can be extracted. And the following statement "Taxi requires 30 horses" in the statement keywords "trip" and "taxi". Since the "trip" and "taxi" belong to the keyword of the traffic subject category, it can be determined that the subject category of the target term "Ma Yuan" is also the traffic subject category. Correspondingly, the term information corresponding to the traffic subject category in the term information corresponding to the “Ma Yuan” can be obtained.
在其中一个实施例中,将词条信息发送至终端,包括:将词条信息发送至终端;词条信息用于在终端生成相应的词条弹出框。词条弹出框是指终端用于显示目标词条对应的词条信息的弹出框。词条弹出框可根据目标词条在浏览器窗口的位置自动判断弹出框显示位置。比如说,可直接在目标词条上方显示词条弹出框,也可以在窗口上方下方等侧边显示词弹出框。In one of the embodiments, sending the term information to the terminal includes: transmitting the term information to the terminal; the term information is used to generate a corresponding term popup box at the terminal. The term popup box refers to a popup box used by the terminal to display the term information corresponding to the target term. The entry pop-up box automatically determines the position of the pop-up display based on the position of the target entry in the browser window. For example, you can display the item pop-up box directly above the target entry, or you can display the word pop-up box on the side of the top and bottom of the window.
图4A为终端通过超链接显示词条信息的示意图,包括目标词条402“马元”、词条弹出框404A和光标406。当检测到光标406作用于目标词条402的点击操作或悬停操作时,可生成查阅指令,并将查阅指令发送至服务器,服务器可根据查阅指令查找与目标词条402“马元”对应的词条信息。当检测到“马元”对应多种主题类型的词条信息时,可提取出目标词条“马元”的前续语句“行程20分钟”和后续语句“出租车需要30马元”中的语句关键词“行程”和“出租车”。由于“行程”和“出租车”属于交通主题类别的关键词,因此可确定目标词条“马元”的主题类别也为交通主题类别。相应的,可获取“马元”对应的词条信息中,与交通主题类别对应的词条信息。服务器将确定的词条信息发送至终端之后,终端可根据词条信息填充并生成词条弹出框404A,词条弹出框404A显示在目标词条上方。如图4B所示,终端也可根据词条信息填充并生成词条弹出框404B,词条弹出框404B显示在浏览器窗口上侧边栏,能够避免遮挡目标词条周围的文本。4A is a schematic diagram of a terminal displaying a term information through a hyperlink, including a target term 402 "horse", a term popup 404A, and a cursor 406. When it is detected that the cursor 406 acts on the click operation or the hovering operation of the target entry 402, a lookup instruction may be generated, and the lookup instruction is sent to the server, and the server may search for the target term 402 "mayuan" according to the lookup instruction. Entry information. When it is detected that the "Ma Yuan" corresponds to the item information of a plurality of subject types, the preceding sentence "Journey 20 minutes" of the target term "Ma Yuan" and the subsequent sentence "Taxi needs 30 Ma Yuan" can be extracted. The phrase keywords "trip" and "taxi". Since the "trip" and "taxi" belong to the keyword of the traffic subject category, it can be determined that the subject category of the target term "Ma Yuan" is also the traffic subject category. Correspondingly, the term information corresponding to the traffic subject category in the term information corresponding to the “Ma Yuan” can be obtained. After the server sends the determined term information to the terminal, the terminal may populate and generate a term popup 404A according to the term information, and the term popup box 404A is displayed above the target term. As shown in FIG. 4B, the terminal may also populate and generate an entry popup 404B according to the entry information, and the entry popup 404B is displayed on the sidebar of the browser window to avoid occluding the text around the target entry.
在其中一个实施例中,还可针对同一目标词条对应的每个词条信息打上标签,可结合 目标词条的前续语句和后续语句中的语句关键词,选取部分词条信息发送至终端。比如说,针对目标词条“东方明珠”,当待处理文本中的内容为“在东方明珠吃饭”时,可选取与“吃饭”对应的词条信息,比如打上了“餐饮”标签的词条信息,可为东方明珠附近餐厅的简介;当待处理文本中的内容为“游玩东方明珠”时,可选取与“游玩”对应的词条信息,比如打上了“娱乐”标签的词条信息,可为东方明珠附近的娱乐活动的简介及门票购买链接。In one embodiment, each term information corresponding to the same target term may also be tagged, and the partial term information may be selected and sent to the terminal in combination with the preceding sentence of the target term and the statement keyword in the subsequent statement. . For example, for the target entry "Oriental Pearl", when the content of the pending text is "Eating in the Oriental Pearl", you can select the entry information corresponding to "Eating", such as the entry information of the "Food" label. It can be a brief introduction of the restaurant near the Oriental Pearl; when the content of the pending text is “Playing the Oriental Pearl”, you can select the entry information corresponding to “Play”, such as the entry information of the “Entertainment” label, which can be near the Oriental Pearl. A brief introduction to the entertainment activities and a ticket purchase link.
在其中一个实施例中,词条弹出框中包含资源控件;在将词条信息发送至终端之后,还包括:统计预设时长内终端发送的资源链接请求的请求数量;根据指令数量生成资源链接响应;资源链接响应用于指示终端根据指令数量改变资源控件的形状。In one embodiment, the entry pop-up box includes a resource control; after the term information is sent to the terminal, the method further includes: counting the number of requests for the resource link request sent by the terminal within the preset duration; generating the resource link according to the number of instructions The resource link response is used to instruct the terminal to change the shape of the resource control according to the number of instructions.
资源控件是指封装了全部或部分词条信息的控件。词条信息包括但不限于图片信息、文本信息、网页链接、音频信息等其中一种或多种的组合,相应地,资源控件可为用于转向其他网页链接的控件,也可为显示图片信息的控件,还可为用于播放音频信息的控件,但不限于此。终端显示包含资源控件的词条弹出框之后,还可接收作用于资源控件的点击操作。终端可根据作用于资源控件的点击操作生成资源链接请求,并将资源链接请求发送至服务器。服务器可统计预设时长内终端发送的资源链接请求的请求数量,并根据指令数量生成资源链接响应。其中资源链接响应中可包含资源控件的形状参数,资源链接响应可用于指示终端根据形状参数改变资源控件的形状。比如说,该形状参数可根据请求数量生成,当请求数量越大时,可生成更大的形状参数,因此可增大资源控件的尺寸。A resource control is a control that encapsulates all or part of the term information. The term information includes but is not limited to a combination of one or more of picture information, text information, webpage links, audio information, etc., correspondingly, the resource control may be a control for turning to other webpage links, or may display image information. The control can also be a control for playing audio information, but is not limited thereto. After the terminal displays the entry pop-up box containing the resource control, it can also receive a click operation on the resource control. The terminal may generate a resource link request according to a click operation acting on the resource control, and send the resource link request to the server. The server may count the number of requests for resource link requests sent by the terminal within the preset duration, and generate a resource link response according to the number of instructions. The resource link response may include a shape parameter of the resource control, and the resource link response may be used to instruct the terminal to change the shape of the resource control according to the shape parameter. For example, the shape parameter can be generated according to the number of requests. When the number of requests is larger, a larger shape parameter can be generated, so the size of the resource control can be increased.
在其中一个实施例中,还可对词条弹出框中包含的资源控件进行埋点处理,使得能够直接统计作用于资源控件点击操作的次数。In one embodiment, the resource control included in the entry pop-up box can also be buried, so that the number of times the resource control click operation can be directly counted.
在其中一个实施例中,词条弹出框可包括多个区域,在每个区域可显示文本、图片、资源控件等其中至少一种。举例来说,如图4C所示,词条弹出框404C中包含三个区域,区域404C-a、区域404C-b和区域404C-c。区域404C-a可用于显示目标词条“马元”的词条解释文本,区域404C-b可用于显示目标词条“马元”相关的产品信息,区域404C-c可用于显示目标词条“马元”的发音。在统计预设时长内终端发送分别与区域404C-b中产品信息对应的指令数量b,和与区域404C-c中“马元”的发音对应的指令数量c之后,可根据指令数量b和指令数量c改变区域404C-b和区域404C-c在整个词条弹出框的区域占比。比如说,当指令数量b远远大于指令数量c时,说明浏览者对区域404C-b中产品信息的兴趣远远大于对区域404C-c中“马元”的发音的兴趣,则可以增大区域404C-b在整个词条弹出框的区域占比,减小区域404C-c在整个词条弹出框的区域占比。In one of the embodiments, the entry popup may include a plurality of regions in which at least one of text, a picture, a resource control, and the like may be displayed. For example, as shown in FIG. 4C, the entry popup 404C includes three regions, a region 404C-a, a region 404C-b, and a region 404C-c. The area 404C-a can be used to display the term interpretation text of the target term "Ma Yuan", the area 404C-b can be used to display the product information related to the target item "Ma Yuan", and the area 404C-c can be used to display the target entry "马元" pronunciation. The terminal sends the number of instructions b corresponding to the product information in the area 404C-b, respectively, and the number of instructions c corresponding to the pronunciation of the "Ma Yuan" in the area 404C-c within the statistical preset duration, according to the number of instructions b and the instruction The quantity c change area 404C-b and the area 404C-c account for the area of the entire entry popup. For example, when the number of instructions b is much larger than the number of instructions c, it means that the viewer's interest in the product information in the area 404C-b is far greater than the interest in the pronunciation of the "Ma Yuan" in the area 404C-c, which may be increased. The area 404C-b occupies an area of the entire vocabulary popup, and the reduced area 404C-c occupies the area of the entire vocabulary popup.
上述实施例中,通过与词条弹出框中资源控件对应的资源链接请求的请求数量,改变资源控件的形状,In the above embodiment, the shape of the resource control is changed by the number of requests for the resource link corresponding to the resource control in the entry pop-up box,
在其中一个实施例中,如图5所示,提供了一种文本处理装置500,包括:待处理文本获取模块502,用于获取待处理文本;目标词条匹配模块504,用于将待处理文本输入预设的文本匹配模型,得到匹配的目标词条及目标词条在待处理文本中的词条位置;词条 信息确定模块506,用于根据待处理文本确定与目标词条对应的词条信息;超链接生成模块508,用于根据词条信息和词条位置将待处理文本中的目标词条转换成超链接;词条信息发送模块510,用于当接收到终端发送的对超链接的查阅指令时,将词条信息发送至终端。In one embodiment, as shown in FIG. 5, a text processing apparatus 500 is provided, including: a text processing module 502 to be processed for acquiring text to be processed; and a target term matching module 504 for processing Text inputting a preset text matching model, obtaining a matching target term and a term position of the target term in the to-be-processed text; the term information determining module 506 is configured to determine a word corresponding to the target term according to the to-be-processed text a piece of information; a hyperlink generating module 508, configured to convert a target term in the to-be-processed text into a hyperlink according to the term information and the term position; the term information sending module 510 is configured to receive the super-send sent by the terminal When the link is read, the term information is sent to the terminal.
在其中一个实施例中,目标词条匹配模块504还用于将待处理文本输入预设的文本匹配模型;文本匹配模型通过预设的词条树和执行函数构成;根据执行函数将待处理文本与预设的词条树进行匹配,得到匹配的目标词条及目标词条在待处理文本中的词条位置。In one embodiment, the target term matching module 504 is further configured to input the text to be processed into a preset text matching model; the text matching model is configured by a preset term tree and an execution function; and the text to be processed according to the execution function Matches with the preset term tree to get the matching target term and the position of the target term in the pending text.
在其中一个实施例中,词条信息确定模块506还用于判断目标词条是否对应多个主题类别的词条信息;若是,则根据词条位置确定目标词条的主题类别;将属于所确定的主题类别的词条信息作为与目标词条对应的词条信息。In one embodiment, the term information determining module 506 is further configured to determine whether the target term corresponds to the term information of the plurality of topic categories; if yes, determine the topic category of the target term according to the term position; The term information of the subject category is used as the term information corresponding to the target term.
在其中一个实施例中,词条信息确定模块506还用于根据词条位置,获取待处理文本中目标词条的前续语句和后续语句;提取前续语句和后续语句中的语句关键词;确定语句关键词所属的主题类别,并将所确定的主题类别作为目标词条的主题类别。In one embodiment, the term information determining module 506 is further configured to: obtain a preceding sentence and a subsequent statement of the target term in the to-be-processed text according to the position of the entry; and extract the statement keyword in the preceding sentence and the subsequent statement; Determine the topic category to which the statement keyword belongs, and use the determined topic category as the topic category of the target term.
在其中一个实施例中,词条位置包括起始位置和结束位置;超链接生成模块508还用于根据词条信息在目标词条所对应的起始位置插入起始标签;在目标词条所对应的结束位置插入结束标签;根据起始标签和结束标签将目标词条转换成超链接;超链接用于连接词条信息。In one embodiment, the term position includes a starting position and an ending position; the hyperlink generating module 508 is further configured to insert a starting tag according to the entry information at a starting position corresponding to the target term; The corresponding end position is inserted into the end tag; the target term is converted into a hyperlink according to the start tag and the end tag; the hyperlink is used to connect the term information.
在其中一个实施例中,词条信息发送模块510还用于将词条信息发送至终端;词条信息用于在终端生成相应的词条弹出框。In one embodiment, the term information sending module 510 is further configured to send the term information to the terminal; the term information is used to generate a corresponding term popup box at the terminal.
在其中一个实施例中,词条弹出框中包含资源控件;词条信息发送模块510还用于统计预设时长内终端发送的资源链接请求的请求数量;根据指令数量生成资源链接响应;资源链接响应用于指示终端根据指令数量改变资源控件的形状。In one embodiment, the entry pop-up box includes a resource control; the entry information sending module 510 is further configured to count the number of requests for the resource link request sent by the terminal within the preset duration; generate a resource link response according to the number of instructions; The response is used to instruct the terminal to change the shape of the resource control according to the number of instructions.
关于文本处理装置的具体限定可以参见上文中对于文本处理方法的限定,在此不再赘述。上述文本处理装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For specific definitions of the text processing apparatus, reference may be made to the definition of the text processing method in the above, and details are not described herein again. The various modules in the text processing apparatus described above may be implemented in whole or in part by software, hardware, and combinations thereof. Each of the above modules may be embedded in or independent of the processor in the computer device, or may be stored in a memory in the computer device in a software form, so that the processor invokes the operations corresponding to the above modules.
上述文本处理装置可以实现为一种计算机可读指令的形式,计算机可读指令可以在如图6所示的计算机设备上运行。The text processing apparatus described above may be embodied in the form of a computer readable instruction that can be executed on a computer device as shown in FIG.
在其中一个实施例中,提供了一种计算机设备,该计算机设备可以是服务器,其内部结构图可以如图6所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口和数据库。该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性计算机可读存储介质、内存储器。该非易失性计算机可读存储介质存储有操作系统、计算机可读指令和数据库。该内存储器为非易失性计算机可读存储介质中的操作系统和计算机可读指令的运行提供环境。该计算机设备的数据库用于存储待处理文本、词条树等数据。该计算机设备的网络接口用于与外部的终端通过网络连接通信。比如说,该计 算机设备可接收终端发送的待处理文本,还可以将词条信息发送至终端。该计算机可读指令被处理器执行时以实现一种文本处理方法。In one embodiment, a computer device is provided, which may be a server, and its internal structure diagram may be as shown in FIG. 6. The computer device includes a processor, memory, network interface, and database connected by a system bus. The processor of the computer device is used to provide computing and control capabilities. The memory of the computer device includes a non-transitory computer readable storage medium, an internal memory. The non-transitory computer readable storage medium stores an operating system, computer readable instructions, and a database. The internal memory provides an environment for the operation of an operating system and computer readable instructions in a non-transitory computer readable storage medium. The database of the computer device is used to store data such as text to be processed, a term tree, and the like. The network interface of the computer device is used to communicate with an external terminal via a network connection. For example, the computer device can receive the pending text sent by the terminal, and can also send the term information to the terminal. The computer readable instructions are executed by a processor to implement a text processing method.
本领域技术人员可以理解,图6中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。It will be understood by those skilled in the art that the structure shown in FIG. 6 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation of the computer device to which the solution of the present application is applied. The specific computer device may It includes more or fewer components than those shown in the figures, or some components are combined, or have different component arrangements.
在其中一个实施例中,提供了一种计算机设备,包括存储器和一个或多个处理器,存储器中存储有计算机可读指令,计算机可读指令被处理器执行时实现本申请任意一个实施例中提供的文本处理方法的步骤。In one embodiment, a computer apparatus is provided comprising a memory and one or more processors having stored therein computer readable instructions that, when executed by a processor, implement any of the embodiments of the present application The steps provided for the text processing method.
在其中一个实施例中,提供了一个或多个存储有计算机可读指令的非易失性计算机可读存储介质,计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器实现本申请任意一个实施例中提供的文本处理方法的步骤。In one of the embodiments, there is provided one or more non-transitory computer readable storage mediums storing computer readable instructions that, when executed by one or more processors, cause one or more processes The steps of the text processing method provided in any one of the embodiments of the present application are implemented.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机可读指令来指令相关的硬件来完成,所述的计算机可读指令可存储于一非易失性计算机可读取存储介质中,该计算机可读指令在执行时,可包括如上述各方法的实施例的流程。本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。One of ordinary skill in the art can understand that all or part of the process of implementing the above embodiments can be completed by computer readable instructions, which can be stored in a non-volatile computer. The readable storage medium, which when executed, may include the flow of an embodiment of the methods as described above. Any reference to a memory, storage, database, or other medium used in the various embodiments provided herein can include non-volatile and/or volatile memory. Non-volatile memory can include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory can include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of formats, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronization chain. Synchlink DRAM (SLDRAM), Memory Bus (Rambus) Direct RAM (RDRAM), Direct Memory Bus Dynamic RAM (DRDRAM), and Memory Bus Dynamic RAM (RDRAM).
以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments may be arbitrarily combined. For the sake of brevity of description, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction in the combination of these technical features, It is considered to be the range described in this specification.
以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。The above-mentioned embodiments are merely illustrative of several embodiments of the present application, and the description thereof is more specific and detailed, but is not to be construed as limiting the scope of the invention. It should be noted that a number of variations and modifications may be made by those skilled in the art without departing from the spirit and scope of the present application. Therefore, the scope of the invention should be determined by the appended claims.

Claims (20)

  1. 一种文本处理方法,包括:A text processing method comprising:
    获取待处理文本;Get the text to be processed;
    将所述待处理文本输入预设的文本匹配模型,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置;Inputting the to-be-processed text into a preset text matching model to obtain a matching target term and a term position of the target term in the to-be-processed text;
    根据所述待处理文本确定与所述目标词条对应的词条信息;Determining the term information corresponding to the target term according to the to-be-processed text;
    根据所述词条信息和所述词条位置将所述待处理文本中的所述目标词条转换成超链接;及Converting the target term in the to-be-processed text into a hyperlink according to the term information and the term position; and
    当接收到终端发送的对所述超链接的查阅指令时,将所述词条信息发送至所述终端。The term information is sent to the terminal when receiving a lookup instruction for the hyperlink sent by the terminal.
  2. 根据权利要求1所述的方法,其特征在于,所述将所述待处理文本输入预设的文本匹配模型,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置,包括:The method according to claim 1, wherein the text to be processed is input into a preset text matching model, and a matching target term and a word of the target term in the to-be-processed text are obtained. Location, including:
    将所述待处理文本输入预设的文本匹配模型;所述文本匹配模型通过预设的词条树和执行函数构成;及Inputting the to-be-processed text into a preset text matching model; the text matching model is configured by a preset term tree and an execution function; and
    根据所述执行函数将所述待处理文本与所述预设的词条树进行匹配,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置。And matching the to-be-processed text with the preset term tree according to the execution function, to obtain a matching target term and a term position of the target term in the to-be-processed text.
  3. 根据权利要求1所述的方法,其特征在于,所述根据所述待处理文本确定与所述目标词条对应的词条信息,包括:The method according to claim 1, wherein the determining the term information corresponding to the target term according to the to-be-processed text comprises:
    判断所述目标词条是否对应多个主题类别的词条信息;Determining whether the target term corresponds to entry information of multiple topic categories;
    若是,则根据所述词条位置确定所述目标词条的主题类别;及If yes, determining a subject category of the target term according to the term position; and
    将属于所确定的主题类别的词条信息作为与所述目标词条对应的词条信息。The term information belonging to the determined topic category is taken as the term information corresponding to the target term.
  4. 根据权利要求3所述的方法,其特征在于,所述根据所述词条位置确定所述目标词条的主题类别,包括:The method according to claim 3, wherein the determining the subject category of the target term according to the term position comprises:
    根据所述词条位置,获取所述待处理文本中所述目标词条的前续语句和后续语句;Obtaining a preceding statement and a subsequent statement of the target entry in the to-be-processed text according to the term position;
    提取所述前续语句和所述后续语句中的语句关键词;及Extracting the statement keywords in the preceding statement and the subsequent statement; and
    确定所述语句关键词所属的主题类别,并将所确定的主题类别作为所述目标词条的主题类别。Determining a topic category to which the sentence keyword belongs, and using the determined topic category as a topic category of the target term.
  5. 根据权利要求1至4中任意一项所述的方法,其特征在于,所述词条位置包括起始位置和结束位置;所述根据所述词条信息和所述词条位置将所述待处理文本中的所述目标词条转换成超链接,包括:The method according to any one of claims 1 to 4, wherein the term position includes a start position and an end position; the said to be based on the term information and the term position Translating the target entry in the text into a hyperlink, including:
    根据所述词条信息在所述目标词条所对应的起始位置插入起始标签;Inserting a start tag according to the entry information at a starting position corresponding to the target term;
    在所述目标词条所对应的结束位置插入结束标签;及Inserting an end tag at an end position corresponding to the target term; and
    根据所述起始标签和所述结束标签将所述目标词条转换成超链接;所述超链接用于连接所述词条信息。Converting the target term into a hyperlink according to the start tag and the end tag; the hyperlink is used to connect the term information.
  6. 根据权利要求1至4中任意一项所述的方法,其特征在于,所述将所述词条信息发送至所述终端,包括:The method according to any one of claims 1 to 4, wherein the transmitting the term information to the terminal comprises:
    将所述词条信息发送至所述终端;所述词条信息用于在所述终端生成相应的词条弹出框。Sending the term information to the terminal; the term information is used to generate a corresponding term popup box at the terminal.
  7. 根据权利要求6所述的方法,其特征在于,所述词条弹出框中包含资源控件;在所述将所述词条信息发送至所述终端之后,还包括:The method according to claim 6, wherein the entry pop-up box includes a resource control; after the sending the term information to the terminal, the method further includes:
    统计预设时长内终端发送的资源链接请求的请求数量;及Counting the number of requests for resource link requests sent by the terminal within the preset duration; and
    根据所述指令数量生成资源链接响应;所述资源链接响应用于指示所述终端根据所述指令数量改变所述资源控件的形状。Generating a resource link response according to the number of instructions; the resource link response is used to instruct the terminal to change a shape of the resource control according to the number of instructions.
  8. 一种文本处理装置,包括:A text processing device comprising:
    待处理文本获取模块,用于获取待处理文本;a text obtaining module to be processed, which is used to obtain a text to be processed;
    目标词条匹配模块,用于将所述待处理文本输入预设的文本匹配模型,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置;a target term matching module, configured to input the to-be-processed text into a preset text matching model, to obtain a matching target term and a term position of the target term in the to-be-processed text;
    词条信息确定模块,用于根据所述待处理文本确定与所述目标词条对应的词条信息;a term information determining module, configured to determine term information corresponding to the target term according to the to-be-processed text;
    超链接生成模块,用于根据所述词条信息和所述词条位置将所述待处理文本中的所述目标词条转换成超链接;及a hyperlink generating module, configured to convert the target term in the to-be-processed text into a hyperlink according to the term information and the term position; and
    词条信息发送模块,用于当接收到终端发送的对所述超链接的查阅指令时,将所述词条信息发送至所述终端。The term information sending module is configured to send the term information to the terminal when receiving a query instruction for the hyperlink sent by the terminal.
  9. 一种计算机设备,包括存储器及一个或多个处理器,所述存储器中储存有计算机可读指令,所述计算机可读指令被所述一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:A computer device comprising a memory and one or more processors having stored therein computer readable instructions, the computer readable instructions being executed by the one or more processors to cause the one or more The processors perform the following steps:
    获取待处理文本;Get the text to be processed;
    将所述待处理文本输入预设的文本匹配模型,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置;Inputting the to-be-processed text into a preset text matching model to obtain a matching target term and a term position of the target term in the to-be-processed text;
    根据所述待处理文本确定与所述目标词条对应的词条信息;Determining the term information corresponding to the target term according to the to-be-processed text;
    根据所述词条信息和所述词条位置将所述待处理文本中的所述目标词条转换成超链接;及Converting the target term in the to-be-processed text into a hyperlink according to the term information and the term position; and
    当接收到终端发送的对所述超链接的查阅指令时,将所述词条信息发送至所述终端。The term information is sent to the terminal when receiving a lookup instruction for the hyperlink sent by the terminal.
  10. 根据权利要求9所述的计算机设备,其特征在于,所述将所述待处理文本输入预设的文本匹配模型,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置的步骤,包括执行以下步骤:The computer device according to claim 9, wherein the text to be processed is input into a preset text matching model, and a matching target term and the target term are obtained in the to-be-processed text. The steps of the entry location include the following steps:
    将所述待处理文本输入预设的文本匹配模型;所述文本匹配模型通过预设的词条树和执行函数构成;及Inputting the to-be-processed text into a preset text matching model; the text matching model is configured by a preset term tree and an execution function; and
    根据所述执行函数将所述待处理文本与所述预设的词条树进行匹配,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置。And matching the to-be-processed text with the preset term tree according to the execution function, to obtain a matching target term and a term position of the target term in the to-be-processed text.
  11. 根据权利要求9所述的计算机设备,其特征在于,所述根据所述待处理文本确定与所述目标词条对应的词条信息的步骤,包括执行以下步骤:The computer device according to claim 9, wherein the step of determining the term information corresponding to the target term according to the to-be-processed text comprises performing the following steps:
    判断所述目标词条是否对应多个主题类别的词条信息;Determining whether the target term corresponds to entry information of multiple topic categories;
    若是,则根据所述词条位置确定所述目标词条的主题类别;及If yes, determining a subject category of the target term according to the term position; and
    将属于所确定的主题类别的词条信息作为与所述目标词条对应的词条信息。The term information belonging to the determined topic category is taken as the term information corresponding to the target term.
  12. 根据权利要求11所述的计算机设备,其特征在于,所述根据所述词条位置确定所述目标词条的主题类别的步骤,包括执行以下步骤:The computer apparatus according to claim 11, wherein said step of determining a subject category of said target term based on said term position comprises performing the following steps:
    根据所述词条位置,获取所述待处理文本中所述目标词条的前续语句和后续语句;Obtaining a preceding statement and a subsequent statement of the target entry in the to-be-processed text according to the term position;
    提取所述前续语句和所述后续语句中的语句关键词;及Extracting the statement keywords in the preceding statement and the subsequent statement; and
    确定所述语句关键词所属的主题类别,并将所确定的主题类别作为所述目标词条的主题类别。Determining a topic category to which the sentence keyword belongs, and using the determined topic category as a topic category of the target term.
  13. 根据权利要求9至12中任意一项所述的计算机设备,其特征在于,所述词条位置包括起始位置和结束位置;所述根据所述词条信息和所述词条位置将所述待处理文本中的所述目标词条转换成超链接的步骤,包括执行以下步骤:The computer device according to any one of claims 9 to 12, wherein the term position includes a start position and an end position; the said according to the term information and the term position The step of converting the target term in the pending text into a hyperlink includes performing the following steps:
    根据所述词条信息在所述目标词条所对应的起始位置插入起始标签;Inserting a start tag according to the entry information at a starting position corresponding to the target term;
    在所述目标词条所对应的结束位置插入结束标签;及Inserting an end tag at an end position corresponding to the target term; and
    根据所述起始标签和所述结束标签将所述目标词条转换成超链接;所述超链接用于连接所述词条信息。Converting the target term into a hyperlink according to the start tag and the end tag; the hyperlink is used to connect the term information.
  14. 根据权利要求9至12中任意一项所述的计算机设备,其特征在于,所述将所述词条信息发送至所述终端的步骤,包括执行以下步骤:The computer device according to any one of claims 9 to 12, wherein the step of transmitting the term information to the terminal comprises performing the following steps:
    将所述词条信息发送至所述终端;所述词条信息用于在所述终端生成相应的词条弹出框,所述词条弹出框中包含资源控件;Transmitting the term information to the terminal; the term information is used to generate a corresponding term popup box in the terminal, where the term popup box includes a resource control;
    在所述将所述词条信息发送至所述终端之后,还包括:After the sending the term information to the terminal, the method further includes:
    统计预设时长内终端发送的资源链接请求的请求数量;及Counting the number of requests for resource link requests sent by the terminal within the preset duration; and
    根据所述指令数量生成资源链接响应;所述资源链接响应用于指示所述终端根据所述指令数量改变所述资源控件的形状。Generating a resource link response according to the number of instructions; the resource link response is used to instruct the terminal to change a shape of the resource control according to the number of instructions.
  15. 一个或多个存储有计算机可读指令的非易失性计算机可读存储介质,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行以下步骤:One or more non-transitory computer readable storage mediums storing computer readable instructions, when executed by one or more processors, cause the one or more processors to perform the following steps:
    获取待处理文本;Get the text to be processed;
    将所述待处理文本输入预设的文本匹配模型,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置;Inputting the to-be-processed text into a preset text matching model to obtain a matching target term and a term position of the target term in the to-be-processed text;
    根据所述待处理文本确定与所述目标词条对应的词条信息;Determining the term information corresponding to the target term according to the to-be-processed text;
    根据所述词条信息和所述词条位置将所述待处理文本中的所述目标词条转换成超链接;及Converting the target term in the to-be-processed text into a hyperlink according to the term information and the term position; and
    当接收到终端发送的对所述超链接的查阅指令时,将所述词条信息发送至所述终端。The term information is sent to the terminal when receiving a lookup instruction for the hyperlink sent by the terminal.
  16. 根据权利要求15所述的存储介质,其特征在于,所述将所述待处理文本输入预设的文本匹配模型,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置 的步骤,包括执行以下步骤:The storage medium according to claim 15, wherein the text to be processed is input into a preset text matching model, and a matching target term and the target term are obtained in the to-be-processed text. The steps of the entry location include the following steps:
    将所述待处理文本输入预设的文本匹配模型;所述文本匹配模型通过预设的词条树和执行函数构成;及Inputting the to-be-processed text into a preset text matching model; the text matching model is configured by a preset term tree and an execution function; and
    根据所述执行函数将所述待处理文本与所述预设的词条树进行匹配,得到匹配的目标词条及所述目标词条在所述待处理文本中的词条位置。And matching the to-be-processed text with the preset term tree according to the execution function, to obtain a matching target term and a term position of the target term in the to-be-processed text.
  17. 根据权利要求15所述的存储介质,其特征在于,所述根据所述待处理文本确定与所述目标词条对应的词条信息的步骤,包括执行以下步骤:The storage medium according to claim 15, wherein the step of determining the term information corresponding to the target term according to the to-be-processed text comprises performing the following steps:
    判断所述目标词条是否对应多个主题类别的词条信息;Determining whether the target term corresponds to entry information of multiple topic categories;
    若是,则根据所述词条位置确定所述目标词条的主题类别;及If yes, determining a subject category of the target term according to the term position; and
    将属于所确定的主题类别的词条信息作为与所述目标词条对应的词条信息。The term information belonging to the determined topic category is taken as the term information corresponding to the target term.
  18. 根据权利要求17所述的存储介质,其特征在于,所述根据所述词条位置确定所述目标词条的主题类别的步骤,包括执行以下步骤:The storage medium according to claim 17, wherein said step of determining a subject category of said target term based on said term position comprises performing the following steps:
    根据所述词条位置,获取所述待处理文本中所述目标词条的前续语句和后续语句;Obtaining a preceding statement and a subsequent statement of the target entry in the to-be-processed text according to the term position;
    提取所述前续语句和所述后续语句中的语句关键词;及Extracting the statement keywords in the preceding statement and the subsequent statement; and
    确定所述语句关键词所属的主题类别,并将所确定的主题类别作为所述目标词条的主题类别。Determining a topic category to which the sentence keyword belongs, and using the determined topic category as a topic category of the target term.
  19. 根据权利要求15至18中任意一项所述的存储介质,其特征在于,所述词条位置包括起始位置和结束位置;所述根据所述词条信息和所述词条位置将所述待处理文本中的所述目标词条转换成超链接的步骤,包括执行以下步骤:The storage medium according to any one of claims 15 to 18, wherein the term position includes a start position and an end position; the said according to the term information and the term position The step of converting the target term in the pending text into a hyperlink includes performing the following steps:
    根据所述词条信息在所述目标词条所对应的起始位置插入起始标签;Inserting a start tag according to the entry information at a starting position corresponding to the target term;
    在所述目标词条所对应的结束位置插入结束标签;及Inserting an end tag at an end position corresponding to the target term; and
    根据所述起始标签和所述结束标签将所述目标词条转换成超链接;所述超链接用于连接所述词条信息。Converting the target term into a hyperlink according to the start tag and the end tag; the hyperlink is used to connect the term information.
  20. 根据权利要求15至8中任意一项所述的存储介质,其特征在于,所述将所述词条信息发送至所述终端的步骤,包括执行以下步骤:The storage medium according to any one of claims 15 to 8, wherein the step of transmitting the term information to the terminal comprises performing the following steps:
    将所述词条信息发送至所述终端;所述词条信息用于在所述终端生成相应的词条弹出框,所述词条弹出框中包含资源控件;Transmitting the term information to the terminal; the term information is used to generate a corresponding term popup box in the terminal, where the term popup box includes a resource control;
    在所述将所述词条信息发送至所述终端之后,还包括:After the sending the term information to the terminal, the method further includes:
    统计预设时长内终端发送的资源链接请求的请求数量;及Counting the number of requests for resource link requests sent by the terminal within the preset duration; and
    根据所述指令数量生成资源链接响应;所述资源链接响应用于指示所述终端根据所述指令数量改变所述资源控件的形状。Generating a resource link response according to the number of instructions; the resource link response is used to instruct the terminal to change a shape of the resource control according to the number of instructions.
PCT/CN2018/099965 2018-02-07 2018-08-10 Text processing method, apparatus, computer device and storage medium WO2019153685A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810124552.1 2018-02-07
CN201810124552.1A CN108399150B (en) 2018-02-07 2018-02-07 Text processing method and device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2019153685A1 true WO2019153685A1 (en) 2019-08-15

Family

ID=63095401

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/099965 WO2019153685A1 (en) 2018-02-07 2018-08-10 Text processing method, apparatus, computer device and storage medium

Country Status (2)

Country Link
CN (1) CN108399150B (en)
WO (1) WO2019153685A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021093333A1 (en) * 2019-11-14 2021-05-20 网易(杭州)网络有限公司 Audio playback method, electronic device, and storage medium

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109657094B (en) * 2018-11-27 2024-05-07 平安科技(深圳)有限公司 Audio processing method and terminal equipment
WO2020133186A1 (en) * 2018-12-28 2020-07-02 深圳市世强元件网络有限公司 Document information extraction method, storage medium, and terminal
CN110532440A (en) * 2019-08-21 2019-12-03 深圳追一科技有限公司 Information processing method, device, equipment and storage medium
CN110515530B (en) * 2019-08-28 2021-04-20 北京思维造物信息科技股份有限公司 Method, device and equipment for processing character operation and storage medium
CN113051925B (en) * 2019-12-26 2024-06-18 中国移动通信集团有限公司 Time identification method, device, equipment and computer storage medium
CN111597324B (en) * 2020-05-20 2023-10-03 北京搜狗科技发展有限公司 Text query method and device
CN112257388A (en) * 2020-10-19 2021-01-22 深圳市大成天下信息技术有限公司 Content display method, mobile terminal and system
CN112307175B (en) * 2020-12-02 2021-11-02 龙马智芯(珠海横琴)科技有限公司 Text processing method, text processing device, server and computer readable storage medium
CN113222492B (en) * 2021-03-29 2024-05-03 北京中交兴路信息科技有限公司 Method and device for discriminating type of vehicle driving line, storage medium and terminal
CN113139033A (en) * 2021-05-13 2021-07-20 平安国际智慧城市科技股份有限公司 Text processing method, device, equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101777054A (en) * 2009-01-14 2010-07-14 北京搜狗科技发展有限公司 Method for displaying attribute information of entry and device thereof
CN105893422A (en) * 2015-12-03 2016-08-24 乐视网信息技术(北京)股份有限公司 Method and system for creating hyperlink in video play page and server
CN107273549A (en) * 2017-07-24 2017-10-20 山东中创软件工程股份有限公司 A kind of entry method for transformation and device

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103853544A (en) * 2012-12-07 2014-06-11 华为技术有限公司 Method and device for generating hyperlinks as well as terminal
CN107391667B (en) * 2017-07-20 2020-02-21 维沃移动通信有限公司 Entry processing method and mobile terminal

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101777054A (en) * 2009-01-14 2010-07-14 北京搜狗科技发展有限公司 Method for displaying attribute information of entry and device thereof
CN105893422A (en) * 2015-12-03 2016-08-24 乐视网信息技术(北京)股份有限公司 Method and system for creating hyperlink in video play page and server
CN107273549A (en) * 2017-07-24 2017-10-20 山东中创软件工程股份有限公司 A kind of entry method for transformation and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021093333A1 (en) * 2019-11-14 2021-05-20 网易(杭州)网络有限公司 Audio playback method, electronic device, and storage medium

Also Published As

Publication number Publication date
CN108399150B (en) 2020-05-22
CN108399150A (en) 2018-08-14

Similar Documents

Publication Publication Date Title
WO2019153685A1 (en) Text processing method, apparatus, computer device and storage medium
US11294968B2 (en) Combining website characteristics in an automatically generated website
WO2018149115A1 (en) Method and apparatus for providing search results
WO2016206210A1 (en) Information pushing method and device
US8874590B2 (en) Apparatus and method for supporting keyword input
CN101826096B (en) Information display method, device and system based on mouse pointing
WO2017092294A1 (en) Webpage generation method and device
CN107798001B (en) Webpage processing method, device and equipment
JP6462970B1 (en) Classification device, classification method, generation method, classification program, and generation program
US20220284218A1 (en) Video classification method, electronic device and storage medium
CN114595686B (en) Knowledge extraction method, and training method and device of knowledge extraction model
US20170372700A1 (en) Method of entering data in an electronic device
US20150227276A1 (en) Method and system for providing an interactive user guide on a webpage
CN111046272A (en) Intelligent question-answering system based on medical knowledge map
US20160299951A1 (en) Processing a search query and retrieving targeted records from a networked database system
CN112926308B (en) Method, device, equipment, storage medium and program product for matching text
US20220121668A1 (en) Method for recommending document, electronic device and storage medium
US20150106701A1 (en) Input support method and information processing system
WO2020238498A1 (en) Question and answer information processing method and system, computer device and storage medium
CN114021042A (en) Webpage content extraction method and device, computer equipment and storage medium
US20170293683A1 (en) Method and system for providing contextual information
CN116662495A (en) Question-answering processing method, and method and device for training question-answering processing model
EP3731108A1 (en) Search system, search method, and program
KR20110074423A (en) Egf file searching system service and method therefor
US20190303452A1 (en) Deep search embedding of inferred document characteristics

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18905135

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A 03.12.2020)

122 Ep: pct application non-entry in european phase

Ref document number: 18905135

Country of ref document: EP

Kind code of ref document: A1