CN110866085A - Data feedback method and device - Google Patents

Data feedback method and device Download PDF

Info

Publication number
CN110866085A
CN110866085A CN201911064999.5A CN201911064999A CN110866085A CN 110866085 A CN110866085 A CN 110866085A CN 201911064999 A CN201911064999 A CN 201911064999A CN 110866085 A CN110866085 A CN 110866085A
Authority
CN
China
Prior art keywords
tables
target
data query
data
query information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911064999.5A
Other languages
Chinese (zh)
Inventor
苏萌
刘钰
代其锋
徐林杰
刘译璟
于帮付
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baifendian Information Science & Technology Co Ltd
Original Assignee
Beijing Baifendian Information Science & Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baifendian Information Science & Technology Co Ltd filed Critical Beijing Baifendian Information Science & Technology Co Ltd
Priority to CN201911064999.5A priority Critical patent/CN110866085A/en
Publication of CN110866085A publication Critical patent/CN110866085A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/316Indexing structures
    • G06F16/319Inverted lists
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application discloses a data feedback method and device, and relates to the technical field of data query. The data query information is input into a multi-mode matching automaton, so that a second target keyword matched with the data query information is extracted from first target keywords in a plurality of tables, wherein the multi-mode matching automaton is constructed in advance according to the first target keywords in the plurality of tables; if tables comprising second target keywords are matched according to a pre-established inverted index, recording the number of the second target keywords in each table, wherein the inverted index is constructed according to the first target keywords and the table IDs of the plurality of tables in advance; the tables with the number sorted to the top N are fed back to the terminal equipment, so that the worksheets do not need to be traversed one by one manually, the efficiency of data question answering is greatly improved, and the accuracy is high.

Description

Data feedback method and device
Technical Field
The present application relates to the field of data query technologies, and in particular, to a data feedback method and apparatus.
Background
BI is a business analysis service that provides insight to make quick and informed decisions, to convert data into visual objects, and to share with colleagues on any device. The BI includes a large amount of interactive shared worksheets, and the interactive worksheets are worksheets which are used for visually browsing and analyzing local data and cloud data in one view, collaborating and sharing the interactive worksheets.
In the prior art, when a user asks and answers data to a BI by using natural language, the user needs to manually traverse worksheets one by one until the worksheet meeting the requirement is selected, however, the data asking and answering mode is low in efficiency and needs to spend a large amount of time and cost.
Disclosure of Invention
The embodiment of the application provides a data feedback method and device, which aim to solve the problem of low efficiency of data question-answer query table.
In a first aspect, an embodiment of the present application provides a data feedback method, including:
responding a data access request sent by the terminal equipment, wherein the data access request carries data query information;
inputting the data query information into a multi-mode matching automaton to extract a second target keyword matched with the data query information from first target keywords in a plurality of tables, wherein the multi-mode matching automaton is constructed in advance according to the first target keywords in the plurality of tables;
if tables comprising second target keywords are matched according to a pre-established inverted index, recording the number of the second target keywords in each table, wherein the inverted index is constructed according to the first target keywords and the table IDs of the plurality of tables in advance;
and feeding back the table with the number sorted as the top N to the terminal equipment.
In a second aspect, an embodiment of the present application further provides a data feedback apparatus, including:
the request response unit is configured to respond to a data access request sent by the terminal equipment, wherein the data access request carries data query information;
an information extraction unit configured to input the data query information to a multi-pattern matching automaton to extract a second target keyword matched with the data query information from first target keywords in a plurality of tables, wherein the multi-pattern matching automaton is constructed in advance according to the first target keywords in the plurality of tables;
an information recording unit configured to record the number of tables including the second target keyword for each table if the table including the second target keyword is matched according to a pre-established inverted index, wherein the inverted index is previously constructed according to the first target keyword and the table ID of the plurality of tables;
and the information feedback unit is configured to feed back the table with the number of the top N to the terminal equipment.
The embodiment of the application adopts at least one technical scheme which can achieve the following beneficial effects: the data query information is input into a multi-mode matching automaton, so that a second target keyword matched with the data query information is extracted from first target keywords in a plurality of tables, wherein the multi-mode matching automaton is constructed in advance according to the first target keywords in the plurality of tables; if tables comprising second target keywords are matched according to a pre-established inverted index, recording the number of the second target keywords in each table, wherein the inverted index is constructed according to the first target keywords and the table IDs of the plurality of tables in advance; the tables with the number sorted to the top N are fed back to the terminal equipment, so that the worksheets do not need to be traversed one by one manually, the efficiency of data question answering is greatly improved, and the accuracy is high.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the application and together with the description serve to explain the application and not to limit the application. In the drawings:
fig. 1 is a flowchart of a data feedback method according to an embodiment of the present application;
fig. 2 is a schematic interaction diagram of a terminal device and a server according to an embodiment of the present application;
fig. 3 is a flowchart of a data feedback method according to an embodiment of the present application;
FIG. 4 is a flowchart of a data feedback method according to an embodiment of the present disclosure;
FIG. 5 is a flowchart of a data feedback method according to an embodiment of the present disclosure;
FIG. 6 is a functional block diagram of a data feedback device according to an embodiment of the present disclosure;
fig. 7 is a functional unit block diagram of a data feedback method provided in one implementation manner of an embodiment of the present application;
fig. 8 is a functional unit block diagram of a data feedback method provided in one implementation manner of an embodiment of the present application;
fig. 9 is a block diagram of a server according to an implementation manner of the embodiment of the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the technical solutions of the present application will be described in detail and completely with reference to the following specific embodiments of the present application and the accompanying drawings. It should be apparent that the described embodiments are only some of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
The technical solutions provided by the embodiments of the present application are described in detail below with reference to the accompanying drawings.
Referring to fig. 1, an embodiment of the present application provides a data feedback method, where the data feedback method is applied to a server 101, and as shown in fig. 2, the server 101 is communicatively connected to a plurality of terminal devices 102 for data interaction. The method comprises the following steps:
s11: and responding to a data access request sent by the terminal equipment 102, wherein the data access request carries data query information.
The user may input data query information in the application program of the terminal device 102, and may send a data access request to the server 101 by clicking "query". The data query information may be "population of each province", "sales amount of each province", and the like, which are merely examples.
S12: and inputting the data query information into the multi-mode matching automaton so as to extract a second target keyword matched with the data query information from the first target keywords in the plurality of tables.
Specifically, the first target keyword may include a field name, an enumerated value, and an entity tag. Wherein the field name comprises an enumerated value field and a non-enumerated value field. When the field name is an enumeration value field, the enumeration value field is subordinate with an enumeration value. For example, the enumerated value field name may be "province," and the enumerated value may be "Sichuan," "Hunan," "Hebei," "Guangdong," and so on. The entity tag may be "person name", "place name", "organization name", "time", "mailbox", "phone", and the like.
The multi-mode matching automaton is constructed in advance according to first target keywords in a plurality of tables. Specifically, the process of constructing the multi-pattern matching automaton is as follows: the first target keyword of the table is identified, and the first target keyword is taken as a field name and an enumerated value for example. For example, when a field is a "name", it is recognized that values of the field greater than 50% are all human names, the field is determined to be a human name field, a repetition rate is checked for the field name of the text type, and if the repetition rate of the field name of the text type is greater than a preset threshold (e.g., 80%), the field is determined to be an enumerated value field, and the enumerated value of the field is extracted. The enumerated values, the field names and the entity labels of all the tables are placed in a list, and then the enumerated values, the field names and the entity labels are built according to a building rule of a multi-mode matching automaton, namely a prefix tree mode, wherein each word is a node in the prefix tree. During the construction process, failure pointers on the nodes of the prefix tree need to be constructed according to the strategy of the kmp algorithm. And finally, all enumerated values, field names and words on the entity labels are contained in the prefix tree with the failure pointers, so that the construction of the multi-mode matching automaton is completed. Wherein the enumerated value of the field is in the same position as the field name.
After the data query information is input into the multi-mode matching automaton, matching is carried out in the multi-mode matching automaton word by word until the matching fails when a node representing successful matching is found or a pointer cannot be found for transferring, and therefore a second target keyword matched with the data query information can be efficiently extracted. For example, the data query information includes field names of "population" and "identity", and the first target keyword in the table includes field names of "population", "identity", "expense", "income", "goods", and the like, and the matched second target keyword is "population" and "identity".
S13: judging whether a table including the second target keyword is matched according to a pre-established inverted index, if so, executing S14, wherein the inverted index is constructed according to the first target keyword and table ID of a plurality of tables in advance, and optionally, if not, executing S41.
Specifically, the manner of constructing the inverted index may be: the method is constructed according to field names, enumerated values, entity labels and table IDs of a plurality of tables. It will be appreciated that a table ID containing the second target key may be derived from the inverted index described above.
S14: the number of second target keywords included in each table is recorded.
For example, if table a includes 2 field names, 3 enumerated values, and 0 entity tags, then table a includes 5 second target keywords, and if table B includes 1 field name, 1 enumerated value, and 2 entity tags, then table a includes 4 second target keywords.
S15: the table with the number sorted to the top N is fed back to the terminal device 102.
It will be appreciated that the greater the number of second target keywords included in the table, the more matching the data access request. The table with the number ranked as the top N may be most matched with the requirement of the user, and therefore, the table with the number ranked as the top N is fed back to the terminal device 102 for the user to select. Wherein N may be equal to 1, 2, 3, etc., and is not limited herein.
According to the data feedback method provided by the embodiment of the application, data query information is input into the multi-mode matching automaton, so that a second target keyword matched with the data query information is extracted from first target keywords in a plurality of tables, wherein the multi-mode matching automaton is constructed in advance according to the first target keywords in the plurality of tables; if tables comprising second target keywords are matched according to a pre-established inverted index, recording the number of the second target keywords in each table, wherein the inverted index is constructed according to the first target keywords and the table IDs of the plurality of tables in advance; the tables with the number sorted to the top N are fed back to the terminal device 102, so that the worksheets do not need to be traversed one by one manually, the efficiency of data question answering is greatly improved, and the accuracy is high.
Optionally, the table with the first quantity rank is fed back to the terminal device 102. The table with the first rank in number matches best with the user's requirements.
Specifically, when only the table with the first number rank is fed back to the terminal apparatus 102, as shown in fig. 3, S15 includes:
s31: if the first quantity-ordered table includes at least two, determining a jaccard similarity coefficient of the data query information and the first target keyword in each first quantity-ordered table.
The Jaccard similarity coefficient (Jaccard similarity coeffient) is used to compare similarity and difference between a finite sample set. The larger the Jaccard coefficient value, the higher the sample similarity. The Jaccard coefficient J (A, B) is defined as the ratio of the size of the intersection of A and B to the size of the union of A and B, and is defined as follows:
Figure BDA0002259043910000061
s32: and feeding back the table with the largest jaccard similarity coefficient to the terminal equipment 102.
It is understood that the larger the Jaccard similarity factor, the higher the match to the data access request. The table with the largest jaccard similarity coefficient is fed back to the terminal device 102, so that the requirements of the user are met.
The method further comprises the following steps:
s41: the data query information is converted into a first text vector and the first target keywords in each table are converted into a second text vector.
Specifically, the model data query information may be generated using a word vector to convert to a first text vector and to convert the first target keyword in each table to a second text vector. Wherein the Word vector generation model may be a Word2vec model.
S42: and determining the similarity of the first text vector and the second text vector corresponding to each table respectively.
In this embodiment of the present application, a cosine similarity calculation method may be adopted to determine the similarity between the first text vector and the second text vector corresponding to each table.
S43: the table with the maximum similarity is fed back to the terminal device 102.
It can be understood that the table with the greater similarity is matched with the data access request with the greater matching degree, and the table with the greatest similarity is fed back to the terminal device 102, so as to meet the requirements of the user.
Optionally, as shown in fig. 5, the method further includes:
s51: and changing the first target key words in the inverted index.
For example, when the field name or the enumerated value of the table is changed, the field name or the enumerated value in the inverted index is dynamically changed.
Referring to fig. 6, the embodiment of the present application further provides a data feedback apparatus 600, it should be noted that the basic principle and the generated technical effect of the data feedback apparatus 600 provided in the embodiment of the present application are the same as those of the above embodiment, and for a brief description, corresponding contents in the above embodiment may be referred to where the embodiment of the present application is not mentioned in part. The apparatus 600 includes a request response unit 601, an information extraction unit 602, an information recording unit 603, and an information feedback unit 604, wherein,
the request response unit 601 is configured to respond to a data access request sent by the terminal device 102, where the data access request carries data query information.
The information extraction unit 602 is configured to input the data query information to the multi-pattern matching automaton to extract a second target keyword matching the data query information from the first target keywords in the plurality of tables.
The multi-mode matching automaton is constructed in advance according to first target keywords in a plurality of tables.
The information recording unit 603 is configured to record the number of tables including the second target keyword each if the table including the second target keyword is matched according to a previously established inverted index, which is previously constructed from the first target keyword and the table ID of the plurality of tables;
the information feedback unit 604 is configured to feed back the table with the number sorted to the top N to the terminal device 102.
The data feedback apparatus 600 provided in the embodiment of the present application can implement the following functions when executed: the data query information is input into a multi-mode matching automaton, so that a second target keyword matched with the data query information is extracted from first target keywords in a plurality of tables, wherein the multi-mode matching automaton is constructed in advance according to the first target keywords in the plurality of tables; if tables comprising second target keywords are matched according to a pre-established inverted index, recording the number of the second target keywords in each table, wherein the inverted index is constructed according to the first target keywords and the table IDs of the plurality of tables in advance; the tables with the number sorted to the top N are fed back to the terminal device 102, so that the worksheets do not need to be traversed one by one manually, the efficiency of data question answering is greatly improved, and the accuracy is high.
Optionally, the information feedback unit 604 is specifically configured to feed back the table with the first quantity rank to the terminal device 102.
Optionally, the information feedback unit 604 is specifically configured to determine a jaccard similarity coefficient between the data query information and the first target keyword in each quantity-sorted first table if the quantity-sorted first table includes at least two tables; and feeding back the table with the largest jaccard similarity coefficient to the terminal equipment 102.
Optionally, as shown in fig. 7, the apparatus 600 further includes:
a vector conversion unit configured to convert the data query information into a first text vector and convert the first target keyword in each table into a second text vector if the table including the second target keyword is not matched according to the pre-established inverted index.
And the similarity determining unit is configured to determine the similarity of the first text vector and the second text vector corresponding to each table respectively.
The information feedback unit 604 is configured to feed back the table with the largest similarity to the terminal device 102.
Optionally, as shown in fig. 8, the apparatus 600 further includes:
an information changing unit 701 configured to change the first target keyword in the inverted index.
It should be noted that the execution subjects of the steps of the method provided in embodiment 1 may be the same device, or different devices may be used as the execution subjects of the method. For example, the execution subject of steps 11 and 12 may be device 1, and the execution subject of step 13 may be device 2; for another example, the execution subject of step 11 may be device 1, and the execution subjects of step 12 and step 13 may be device 2; and so on.
The foregoing description has been directed to specific embodiments of this disclosure. Other embodiments are within the scope of the following claims. In some cases, the actions or steps recited in the claims may be performed in a different order than in the embodiments and still achieve desirable results. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing may also be possible or may be advantageous.
Fig. 9 is a schematic structural diagram of an electronic device according to an embodiment of the present application, where the electronic device may be the server described above. Referring to fig. 9, at a hardware level, the electronic device includes a processor, and optionally further includes an internal bus, a network interface, and a memory. The memory may include a memory, such as a Random-access memory (RAM), and may further include a non-volatile memory, such as at least 1 disk memory. Of course, the electronic device may also include hardware required for other services.
The processor, the network interface, and the memory may be connected to each other via an internal bus, which may be an ISA (Industry Standard Architecture) bus, a PCI (peripheral component Interconnect) bus, an EISA (Extended Industry Standard Architecture) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one double-headed arrow is shown in FIG. 9, but this does not indicate only one bus or one type of bus.
And the memory is used for storing programs. In particular, the program may include program code comprising computer operating instructions. The memory may include both memory and non-volatile storage and provides instructions and data to the processor.
The processor reads the corresponding computer program from the nonvolatile memory into the memory and then runs the computer program to form the data feedback device on the logic level. The processor is used for executing the program stored in the memory and is specifically used for executing the following operations:
responding a data access request sent by terminal equipment, wherein the data access request carries data query information;
inputting the data query information into a multi-mode matching automaton to extract a second target keyword matched with the data query information from first target keywords in a plurality of tables, wherein the multi-mode matching automaton is constructed in advance according to the first target keywords in the plurality of tables;
if tables comprising the second target keywords are matched according to a pre-established inverted index, recording the number of the second target keywords in each table, wherein the inverted index is constructed according to the first target keywords and the table IDs of the tables in advance;
and feeding back the table with the number sorted as the top N to the terminal equipment.
The method performed by the data feedback device according to the embodiment shown in fig. 1 of the present application may be applied to or implemented by a processor. The processor may be an integrated circuit chip having signal processing capabilities. In implementation, the steps of the above method may be performed by integrated logic circuits of hardware in a processor or instructions in the form of software. The Processor may be a general-purpose Processor, including a Central Processing Unit (CPU), a Network Processor (NP), and the like; but also Digital Signal Processors (DSPs), Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs) or other Programmable logic devices, discrete gates or transistor logic devices, discrete hardware components. The various methods, steps, and logic blocks disclosed in the embodiments of the present application may be implemented or performed. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of the method disclosed in connection with the embodiments of the present application may be directly implemented by a hardware decoding processor, or implemented by a combination of hardware and software modules in the decoding processor. The software module may be located in ram, flash memory, rom, prom, or eprom, registers, etc. storage media as is well known in the art. The storage medium is located in a memory, and a processor reads information in the memory and completes the steps of the method in combination with hardware of the processor.
The electronic device may also execute the method shown in fig. 1 and implement the functions of the data feedback apparatus in the embodiments shown in fig. 1 and fig. 2, which are not described herein again in this embodiment of the present application.
Of course, besides the software implementation, the electronic device of the present application does not exclude other implementations, such as a logic device or a combination of software and hardware, and the like, that is, the execution subject of the following processing flow is not limited to each logic unit, and may also be hardware or a logic device.
Embodiments of the present application also provide a computer-readable storage medium storing one or more programs, where the one or more programs include instructions, which when executed by a portable electronic device including a plurality of application programs, enable the portable electronic device to perform the method of the embodiment shown in fig. 1, and are specifically configured to:
responding a data access request sent by terminal equipment, wherein the data access request carries data query information;
inputting the data query information into a multi-mode matching automaton to extract a second target keyword matched with the data query information from first target keywords in a plurality of tables, wherein the multi-mode matching automaton is constructed in advance according to the first target keywords in the plurality of tables;
if tables comprising the second target keywords are matched according to a pre-established inverted index, recording the number of the second target keywords in each table, wherein the inverted index is constructed according to the first target keywords and the table IDs of the tables in advance;
and feeding back the table with the number sorted as the top N to the terminal equipment.
In short, the above description is only a preferred embodiment of the present application, and is not intended to limit the scope of the present application. Any modification, equivalent replacement, improvement and the like made within the spirit and principle of the present application shall be included in the protection scope of the present application.
The systems, devices, modules or units illustrated in the above embodiments may be implemented by a computer chip or an entity, or by a product with certain functions. One typical implementation device is a computer. In particular, the computer may be, for example, a personal computer, a laptop computer, a cellular telephone, a camera phone, a smartphone, a personal digital assistant, a media player, a navigation device, an email device, a game console, a tablet computer, a wearable device, or a combination of any of these devices.
Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, a computer readable medium does not include a transitory computer readable medium such as a modulated data signal and a carrier wave.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
The embodiments in the present specification are described in a progressive manner, and the same and similar parts among the embodiments are referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for the system embodiment, since it is substantially similar to the method embodiment, the description is simple, and for the relevant points, reference may be made to the partial description of the method embodiment.

Claims (10)

1. A data feedback method, comprising:
responding a data access request sent by terminal equipment, wherein the data access request carries data query information;
inputting the data query information into a multi-mode matching automaton to extract a second target keyword matched with the data query information from first target keywords in a plurality of tables, wherein the multi-mode matching automaton is constructed in advance according to the first target keywords in the plurality of tables;
if tables comprising the second target keywords are matched according to a pre-established inverted index, recording the number of the second target keywords in each table, wherein the inverted index is constructed according to the first target keywords and the table IDs of the tables in advance;
and feeding back the table with the number sorted as the top N to the terminal equipment.
2. The method of claim 1, wherein feeding back the table with the number sorted as top N to the terminal device comprises:
and feeding back the table with the first quantity sequence to the terminal equipment.
3. The method of claim 2, wherein feeding back the first quantity-ordered table to the terminal device comprises:
if the first quantity-sorted table comprises at least two, determining the jaccard similarity coefficient of the data query information and the first target keyword in each first quantity-sorted table;
and feeding back the table with the maximum jaccard similarity coefficient to the terminal equipment.
4. The method of claim 1, further comprising:
if the tables including the second target keywords are not matched according to the pre-established inverted index, converting the data query information into a first text vector and converting the first target keywords in each table into a second text vector;
determining the similarity between the first text vector and a second text vector corresponding to each table respectively;
and feeding back the table with the maximum similarity to the terminal equipment.
5. The method of claim 1, further comprising:
and changing the first target key words in the inverted index.
6. A data feedback apparatus, comprising:
the request response unit is configured to respond to a data access request sent by terminal equipment, wherein the data access request carries data query information;
an information extraction unit configured to input the data query information to a multi-pattern matching automaton to extract a second target keyword matched with the data query information from first target keywords in a plurality of forms, wherein the multi-pattern matching automaton is constructed in advance according to the first target keyword in the plurality of forms;
an information recording unit configured to record the number of tables including the second target keyword for each table if the table including the second target keyword is matched according to a pre-established inverted index, wherein the inverted index is previously constructed according to the first target keyword and the table ID of a plurality of tables;
an information feedback unit configured to feed back the table with the number sorted as top N to the terminal device.
7. The apparatus according to claim 6, wherein the information feedback unit is specifically configured to feed back a table with a first number rank to the terminal device.
8. The apparatus according to claim 7, wherein the information feedback unit is specifically configured to determine a jaccard similarity coefficient of the data query information with the first target keyword in each quantity-sorted first table if the quantity-sorted first table includes at least two; and feeding back the table with the maximum jaccard similarity coefficient to the terminal equipment.
9. The apparatus of claim 6, further comprising:
a vector conversion unit configured to convert the data query information into a first text vector and convert the first target keyword in each table into a second text vector if the table including the second target keyword is not matched according to a pre-established inverted index;
a similarity determining unit configured to determine similarity of the first text vector with a second text vector corresponding to each table, respectively;
the information feedback unit is configured to feed back the table with the maximum similarity to the terminal device.
10. The apparatus of claim 6, further comprising:
an information changing unit configured to change the first target keyword in the inverted index.
CN201911064999.5A 2019-11-04 2019-11-04 Data feedback method and device Pending CN110866085A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911064999.5A CN110866085A (en) 2019-11-04 2019-11-04 Data feedback method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911064999.5A CN110866085A (en) 2019-11-04 2019-11-04 Data feedback method and device

Publications (1)

Publication Number Publication Date
CN110866085A true CN110866085A (en) 2020-03-06

Family

ID=69654820

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911064999.5A Pending CN110866085A (en) 2019-11-04 2019-11-04 Data feedback method and device

Country Status (1)

Country Link
CN (1) CN110866085A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112948545A (en) * 2021-02-25 2021-06-11 平安国际智慧城市科技股份有限公司 Duplicate checking method, terminal equipment and computer readable storage medium
CN114090760A (en) * 2022-01-20 2022-02-25 阿里巴巴达摩院(杭州)科技有限公司 Data processing method of table question and answer, electronic equipment and readable storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101192237A (en) * 2006-11-30 2008-06-04 国际商业机器公司 Method and system for inquiring multiple information
CN101477568A (en) * 2009-02-12 2009-07-08 清华大学 Integrated retrieval method for structured data and non-structured data
CN103164408A (en) * 2011-12-09 2013-06-19 阿里巴巴集团控股有限公司 Information storage and query method based on vertical search engine and device thereof
CN106909647A (en) * 2017-02-21 2017-06-30 福建榕基软件股份有限公司 A kind of data retrieval method and device
CN107491547A (en) * 2017-08-28 2017-12-19 北京百度网讯科技有限公司 Searching method and device based on artificial intelligence
US20180165354A1 (en) * 2016-12-09 2018-06-14 Salesforce.Com, Inc. Augmenting match indices
CN110019647A (en) * 2017-10-25 2019-07-16 华为技术有限公司 A kind of keyword search methodology, device and search engine
US20190236178A1 (en) * 2018-01-31 2019-08-01 Salesforce.Com, Inc. Trie-based normalization of field values for matching
CN110347722A (en) * 2019-07-11 2019-10-18 软通智慧科技有限公司 Data acquisition method, device, equipment and storage medium based on HBase

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101192237A (en) * 2006-11-30 2008-06-04 国际商业机器公司 Method and system for inquiring multiple information
CN101477568A (en) * 2009-02-12 2009-07-08 清华大学 Integrated retrieval method for structured data and non-structured data
CN103164408A (en) * 2011-12-09 2013-06-19 阿里巴巴集团控股有限公司 Information storage and query method based on vertical search engine and device thereof
US20180165354A1 (en) * 2016-12-09 2018-06-14 Salesforce.Com, Inc. Augmenting match indices
CN106909647A (en) * 2017-02-21 2017-06-30 福建榕基软件股份有限公司 A kind of data retrieval method and device
CN107491547A (en) * 2017-08-28 2017-12-19 北京百度网讯科技有限公司 Searching method and device based on artificial intelligence
CN110019647A (en) * 2017-10-25 2019-07-16 华为技术有限公司 A kind of keyword search methodology, device and search engine
US20190236178A1 (en) * 2018-01-31 2019-08-01 Salesforce.Com, Inc. Trie-based normalization of field values for matching
CN110347722A (en) * 2019-07-11 2019-10-18 软通智慧科技有限公司 Data acquisition method, device, equipment and storage medium based on HBase

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112948545A (en) * 2021-02-25 2021-06-11 平安国际智慧城市科技股份有限公司 Duplicate checking method, terminal equipment and computer readable storage medium
CN114090760A (en) * 2022-01-20 2022-02-25 阿里巴巴达摩院(杭州)科技有限公司 Data processing method of table question and answer, electronic equipment and readable storage medium
CN114090760B (en) * 2022-01-20 2022-04-26 阿里巴巴达摩院(杭州)科技有限公司 Data processing method of table question and answer, electronic equipment and readable storage medium

Similar Documents

Publication Publication Date Title
CN108763952B (en) Data classification method and device and electronic equipment
CN111898643B (en) Semantic matching method and device
US9524526B2 (en) Disambiguating authors in social media communications
US10037381B2 (en) Apparatus and method for searching information based on Wikipedia's contents
US20230205755A1 (en) Methods and systems for improved search for data loss prevention
CN110276009B (en) Association word recommendation method and device, electronic equipment and storage medium
US20220129491A1 (en) Method for supporting agent and computing device thereof
CN111898380A (en) Text matching method and device, electronic equipment and storage medium
CN113535817A (en) Method and device for generating characteristic broad table and training business processing model
CN110866085A (en) Data feedback method and device
US20120059786A1 (en) Method and an apparatus for matching data network resources
CN108228869B (en) Method and device for establishing text classification model
CN111144098B (en) Recall method and device for extended question
US20180157744A1 (en) Comparison table automatic generation method, device and computer program product of the same
CN110427496B (en) Knowledge graph expansion method and device for text processing
CN111177093A (en) Method, device and medium for sharing scientific and technological resources
CN110704423A (en) Excitation information acquisition method and device, storage medium and electronic equipment
CN110018844B (en) Management method and device of decision triggering scheme and electronic equipment
CN111143203A (en) Machine learning method, privacy code determination method, device and electronic equipment
CN111897910A (en) Information pushing method and device
CN112101308B (en) Method and device for combining text boxes based on language model and electronic equipment
CN117851608B (en) Case map generation method, device, equipment and medium
CN114417808B (en) Article generation method and device, electronic equipment and storage medium
CN114048394A (en) Target object determination method and device, electronic equipment and storage medium
CN116306651A (en) Multi-label qualitative word recognition method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: No.27, 1st floor, building 14, Haidian Road, zhongjiancai District, Beijing 100096

Applicant after: Beijing PERCENT Technology Group Co.,Ltd.

Address before: 100096 101 / F, building 14, 27 Jiancai Middle Road, Haidian District, Beijing

Applicant before: BEIJING BAIFENDIAN INFORMATION SCIENCE & TECHNOLOGY Co.,Ltd.

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200306