CN116089459B - Data retrieval method, device, electronic equipment and storage medium - Google Patents

Data retrieval method, device, electronic equipment and storage medium Download PDF

Info

Publication number
CN116089459B
CN116089459B CN202310369548.2A CN202310369548A CN116089459B CN 116089459 B CN116089459 B CN 116089459B CN 202310369548 A CN202310369548 A CN 202310369548A CN 116089459 B CN116089459 B CN 116089459B
Authority
CN
China
Prior art keywords
search
data
type
retrieval
localization
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310369548.2A
Other languages
Chinese (zh)
Other versions
CN116089459A (en
Inventor
李登高
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lianren Healthcare Big Data Technology Co Ltd
Original Assignee
Lianren Healthcare Big Data Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lianren Healthcare Big Data Technology Co Ltd filed Critical Lianren Healthcare Big Data Technology Co Ltd
Priority to CN202310369548.2A priority Critical patent/CN116089459B/en
Publication of CN116089459A publication Critical patent/CN116089459A/en
Application granted granted Critical
Publication of CN116089459B publication Critical patent/CN116089459B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a data retrieval method, a device, electronic equipment and a storage medium, wherein the method comprises the following steps: receiving a search text to be processed, and determining at least one search keyword to be used, which corresponds to the search text to be processed; determining search information corresponding to at least one search keyword to be used based on the at least one search keyword to be used and a pre-constructed search data table; based on the retrieval information, a target retrieval result is determined. According to the technical scheme provided by the embodiment of the invention, the effect of carrying out mixed search on the localized data and the standardized data which are simultaneously included based on the same data search table is realized, the search efficiency is improved, and the use experience of a user is improved.

Description

Data retrieval method, device, electronic equipment and storage medium
Technical Field
The present invention relates to the field of data management technologies, and in particular, to a data retrieval method, a data retrieval device, an electronic device, and a storage medium.
Background
With the development of information technology, various emerging information technologies such as the Internet, cloud service, the Internet of things and the like are fused with the medical industry, wherein the application of a big data system brings unprecedented technological breakthroughs for the health medical industry. With the continuous development of big data technology, the status of data management in the medical industry is also becoming more and more important.
In the process of medical data aggregation, administration and application, it is sometimes necessary to retrieve all databases, but the data in the original database will have words that cannot be standardized, at which time all matching data cannot be retrieved.
Currently, in the prior art, when non-standardized content is included in a search text input by a user, the non-standardized content is generally converted into standardized content, and then, a search result is determined based on the converted text. The disadvantage of this approach is that: for some content which cannot realize standardization, corresponding search data cannot be matched, so that search efficiency is affected, and user experience is reduced.
Disclosure of Invention
The invention provides a data retrieval method, a device, electronic equipment and a storage medium, which are used for realizing the effect of mixed retrieval of localized data and standardized data based on the same data retrieval table, improving the retrieval efficiency and improving the use experience of users.
According to an aspect of the present invention, there is provided a data retrieval method, the method comprising:
receiving a search text to be processed, and determining at least one search keyword to be used corresponding to the search text to be processed;
Determining search information corresponding to the at least one search keyword to be used based on the at least one search keyword to be used and a pre-constructed search data table; wherein the search data table is constructed based on a first type search table constructed based on at least one localization language type and a second type search table constructed based on a standard language type, the second type search table is constructed based on the first type search table, and the at least one localization language type corresponds to the standard language type;
and determining a target retrieval result based on the retrieval information.
According to another aspect of the present invention, there is provided a data retrieval apparatus comprising:
the search keyword to be used determining module is used for receiving the search text to be processed and determining at least one search keyword to be used corresponding to the search text to be processed;
a search information determining module, configured to determine search information corresponding to the at least one search keyword to be used based on the at least one search keyword to be used and a search data table constructed in advance; wherein the search data table is constructed based on a first type search table constructed based on at least one localization language type and a second type search table constructed based on a standard language type, the second type search table is constructed based on the first type search table, and the at least one localization language type corresponds to the standard language type;
And the target search result determining module is used for determining a target search result based on the search information.
According to another aspect of the present invention, there is provided an electronic apparatus including:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein, the liquid crystal display device comprises a liquid crystal display device,
the memory stores a computer program executable by the at least one processor to enable the at least one processor to perform the data retrieval method of any one of the embodiments of the present invention.
According to another aspect of the present invention, there is provided a computer readable storage medium storing computer instructions for causing a processor to execute a data retrieval method according to any one of the embodiments of the present invention.
According to the technical scheme, the search text to be processed is received, the at least one search keyword to be used corresponding to the search text to be processed is determined, further, the search information corresponding to the at least one search keyword to be used is determined based on the at least one search keyword to be used and the search data table constructed in advance, finally, the target search result is determined based on the search information, the problem that when the search text input by a user is used for searching, the corresponding search data cannot be matched with some content which cannot be standardized in the prior art, and therefore the search efficiency is affected is solved, the effect of directly determining the corresponding search information without standardized language conversion when the search text comprises the words which cannot be standardized is achieved, the search efficiency is improved, and the use experience of the user is improved.
It should be understood that the description in this section is not intended to identify key or critical features of the embodiments of the invention or to delineate the scope of the invention. Other features of the present invention will become apparent from the description that follows.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required for the description of the embodiments will be briefly described below, and it is apparent that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of a data retrieval method according to a first embodiment of the present invention;
FIG. 2 is a flow chart of a data retrieval method according to a second embodiment of the present invention;
fig. 3 is a schematic structural diagram of a data retrieval device according to a third embodiment of the present invention;
fig. 4 is a schematic structural diagram of an electronic device implementing a data retrieval method according to an embodiment of the present invention.
Detailed Description
In order that those skilled in the art will better understand the present invention, a technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in which it is apparent that the described embodiments are only some embodiments of the present invention, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present invention without making any inventive effort, shall fall within the scope of the present invention.
It should be noted that the terms "first," "second," and the like in the description and the claims of the present invention and the above figures are used for distinguishing between similar objects and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used may be interchanged where appropriate such that the embodiments of the invention described herein may be implemented in sequences other than those illustrated or otherwise described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
It can be appreciated that before using the technical solutions disclosed in the embodiments of the present disclosure, the user should be informed and authorized by appropriate ways according to relevant laws and regulations for the type, usage range, usage scenario, etc. of the personal information related to the present disclosure.
Example 1
Fig. 1 is a flowchart of a data retrieval method according to an embodiment of the present invention, where the method may be performed by a data retrieval device, and the data retrieval device may be implemented in hardware and/or software, and the data retrieval device may be configured in a terminal and/or a server, where when a to-be-processed retrieval text includes a localized retrieval keyword, a target retrieval result may be obtained without performing standardized conversion. As shown in fig. 1, the method includes:
S110, receiving a search text to be processed, and determining at least one search keyword to be used, which corresponds to the search text to be processed.
In this embodiment, the search text to be processed may be a search text input by the user based on the terminal device. The search text to be processed may be text characterizing the user's search needs. By way of example, the search text to be processed may be "what is mainly consumed by men in city a aged 40 suffering from cold", or "what is mainly consumed by men in city a aged 20 suffering from respiratory tract infection". Correspondingly, the search keywords to be used can be search words used for information search of the search text in the corresponding search text to be processed. For example, when the search text to be processed is "what medicine is mainly consumed by men in city a aged 40 suffering from a disease", the corresponding search keywords to be used may be "40 years old" suffering from a disease "and" men ", etc.
In the practical application process, when the received to-be-processed search text input by the user can be split based on a preset word segmentation rule to obtain at least one search word, further, each search word is matched with a preset keyword, and when the search word is detected to be matched with the preset keyword, the search word is used as a to-be-used search keyword, so that a subsequent system can utilize the obtained to-be-used search keyword to perform information search on the to-be-processed search text input by the user.
S120, determining search information corresponding to at least one search keyword to be used based on the at least one search keyword to be used and a pre-constructed search data table.
In this embodiment, the search data table may be a data table constructed in advance for information search of search texts including both the localization language and the standardized language. The search data table may be constructed based on a first type of search table constructed based on a localized language type and a second type of search table constructed based on a standardized language type. The localization language type corresponds to the standard language type, i.e. the retrieval data table is a data table comprising both localization data and standardized data, wherein localization may be a language expressed by a natural person. The normalization may be a language recognizable by a computer. The first type of search table may be a table storing localization data, including localization codes and localization values, and some spoken words that cannot be normalized may be included in the localization values. Illustratively, the first type of search table may be a data lake in the data storage system, i.e. ODS (Operational Data Store) layer, which may be used to store the most primitive data acquired from each service system, and is the source data of the other layer data. The second type of retrieval table may be a table for storing standardized data, including standardized codes and standardized values, and exemplary, the second type of retrieval table may be a primitive distributed database in a data storage system, i.e. a DWD (Data Warehouse Detail, data detail) layer, mainly cleaning and integrating ODS layer data synchronized in a service database into a corresponding event table.
It should be noted that, the second type of search table is constructed based on the first type of search table method, specifically, if it is detected that each piece of localization data included in the first type of search table includes an element capable of being standardized, corresponding standardized data can be generated by treating in a manner of object recognition or standardized conversion, and further, each piece of standardized data is stored in a table, so as to obtain the second type of search table.
In the practical application process, after determining at least one to-be-used search keyword, each to-be-used search keyword can be searched based on a pre-constructed search data table so as to determine search information corresponding to the at least one to-be-used search keyword. The search information may be search data to which the search keyword to be used belongs in the search data table, that is, the search data table includes the search data to which the search keyword to be used belongs.
Optionally, determining the search information corresponding to the at least one search keyword to be used includes: for each search keyword to be used, determining a localization value corresponding to the current search keyword to be used based on a search data table, and taking a search record to which the localization value belongs as search information.
In this embodiment, the localization value may be a character of the localization data stored in the first type of search table. Illustratively, the localization values may be "a disease", "B disease", "suffering from a disease", "C disease", and "D disease", etc. The search record to which the localization value belongs may be one piece of data including the localization value in the search data table.
In a specific implementation, after obtaining at least one search keyword to be used, for each search keyword to be used, information matched with the current search keyword to be used may be searched in a search data table, and if a localization value corresponding to the current search keyword to be used is detected, a search record to which the localization value belongs may be used as search information, so that a search result required by a user may be determined based on the search information.
S130, determining a target search result based on the search information.
In this embodiment, the target search result may be search feedback information corresponding to the search text to be processed, which satisfies the user requirement.
In the practical application process, after determining the search information, that is, determining the search record corresponding to the search keyword to be used in the search data table, the target search result may be determined according to the information corresponding to other fields included in the search record.
Optionally, determining the target search result based on the search information includes: and determining a target search result corresponding to the search text to be processed based on at least one target index number contained in the search information.
In this embodiment, the target index number may be an identification for characterizing a specific location of the corresponding search information in the first type of search table and/or the second type of search table. Illustratively, the target index number may be "ODSID00003" or "DWDID00002", or the like. In the practical application process, the retrieval data table is constructed based on the first type retrieval table and the second type retrieval table, and the retrieval data table comprises a localization value in the first type retrieval table and an index number corresponding to the localization value, and a standard code in the second type retrieval table and an index number corresponding to the standard code. When the search information is determined in the search data table, the first type search table and/or the second type search table can be positioned according to at least one target index number included in the search information, so that a target search result corresponding to the search text to be processed can be determined.
Optionally, determining, based on at least one target index number included in the search information, a target search result corresponding to the search text to be processed includes: for each target index number, a target search table associated with the current target index number is determined to determine a target search result corresponding to the search text to be processed based on the current target index number and the target search table.
In this embodiment, the target search table is a first type search table or a second type search table.
In the practical application process, after determining at least one target index number included in the search information, for each target index number, a target search table corresponding to the target index number can be determined according to the information included in the current target index number, further, search data corresponding to the current target index number can be obtained based on the target search table, and thus a target search result corresponding to the to-be-processed search text can be obtained.
For example, when the search text to be processed is "what medicine is mainly consumed by men in city a of 40 years old suffering from cold", the corresponding search keywords to be used may be "40 years old suffering from cold", men "and the like, and the search information corresponding to" suffering from cold "and" men "may be determined based on the search data table, and further, when the target index number included in the search information is determined to be" ODSID00003", it may be determined that the target search table corresponding to the search information is the search table to which the ODS layer belongs, i.e., the search table of the first type, according to" ODSID00003", and then the search is performed in the search table of the first type according to" ODSID00003", so as to determine the target search result corresponding to the search information.
It should be noted that the target index number may be one or more. In the actual application process, when no element capable of being standardized is detected in the localization data, the localization value and the index number corresponding to the localization value can be directly stored in a retrieval data table, and at the moment, if the retrieval information is a retrieval record only comprising the localization value and the corresponding index number, the index number is the target index number; if a standardized element is detected in the localization data, the localization data may be subjected to standardization processing, that is, each item of information included in the localization data is converted into a proper noun identifiable by a computer, and a standardized code corresponding to each proper noun is determined, further, standardized data corresponding to the localization data may be obtained, an index number corresponding to the standardized data is determined, and the standardized data and the index number are stored in a corresponding search table, and at this time, if the finally determined search information is a search record including a localization value and a corresponding standard code, both the index number corresponding to the localization value and the index number corresponding to the standardized code may be used as target index numbers.
According to the technical scheme, the search text to be processed is received, the at least one search keyword to be used corresponding to the search text to be processed is determined, further, the search information corresponding to the at least one search keyword to be used is determined based on the at least one search keyword to be used and the search data table constructed in advance, finally, the target search result is determined based on the search information, the problem that when the search text input by a user is used for searching, corresponding search data cannot be matched with some content which cannot be standardized in the prior art, so that the search efficiency is affected is solved, the effect of mixed search of the localized data and the standardized data which are simultaneously included based on the same data search table is achieved, and when the search text includes the non-standardized words, the effect of corresponding search information can be directly determined without standardized language conversion, so that the search efficiency is improved, and the use experience of the user is improved.
Example two
Fig. 2 is a flowchart of a data retrieval method according to a second embodiment of the present invention, where, based on the foregoing embodiment, a retrieval data table may be constructed based on the first type retrieval table and the second type retrieval table, so as to determine corresponding retrieval information based on the constructed retrieval data table. The specific implementation manner can be seen in the technical scheme of the embodiment. Wherein, the technical terms identical to or corresponding to the above embodiments are not repeated herein.
As shown in fig. 2, the method includes:
s210, for each piece of localization data in the first type of retrieval table, if at least one standardized element is detected to be included in the current localization data, standardized data corresponding to the current localization data is generated.
In this embodiment, the data stored in the first type of search table may be data after summarizing the localized data stored in each service system. The normalization element may be a field that can be subjected to normalization processing. Alternatively, the standardized elements may include space, time, subjects, objects, resources, and the like. Wherein the space may be a specific geographic location, and may be exemplified by XX medical institution XX department in XX region XX of XX city; the subject may be a user performing an item, and illustratively, may be a consultation user, pharmacist or laboratory staff, etc.; the object may be a moving object with respect to the subject, and may be, illustratively, a visiting user; the resource may be a resource employed in executing any item.
In the practical application process, after the first type retrieval table is acquired, for each piece of localization data in the first type retrieval table, traversing can be performed on the current localization data based on a preset standardized field screening criterion to determine whether the current localization data comprises standardized elements, if at least one standardized element is detected to be included in the current localization data, standardized conversion can be performed on the current localization data, and the localization value and the localization code included in the current localization data are converted into standardized values and standardized codes, so that standardized data corresponding to the current localization data can be obtained. For example, if the localization value included in the localization data is "cold" and the localization code corresponding to the localization value is "820", the localization data is normalized and converted to obtain the normalized data with the normalized value of "upper respiratory tract infection" and the normalization code corresponding to the localization data is "J06.903".
S220, constructing a second type retrieval table based on the standardized data, and constructing a retrieval data table based on the first type retrieval table and the second type retrieval table.
In this embodiment, after processing the localization data including the standardized element and obtaining the corresponding standardized data, the second type search table may be constructed according to each standardized data, that is, each standardized data is correspondingly stored according to the template corresponding to the second type search table, so as to obtain the second type search table.
Further, when the second type of search table is obtained, the search data table can be constructed according to the first type of search table and the second type of search table.
In the practical application process, when information retrieval is performed based on a retrieval text input by a user, if the user is a user in a professional field, in order to realize quick retrieval, the text input by the user may include a standard code so as to perform information retrieval based on the standard code; if the user is a non-professional user and does not know the standard codes corresponding to certain information, the text input by the user may include localization values, so as to retrieve information based on the localization values. In order to achieve the effect of performing mixed search for both localized data and standardized data based on the same data search table, a search data table may be constructed based on a first type search table including localized data and a second type search table including standardized data.
Optionally, constructing the search data table based on the first type search table and the second type search table includes: based on a first type retrieval table, obtaining a localization value and a first preset index number included in each piece of standardized data; based on a second type retrieval table, acquiring a standardized code and a second preset index number included in each piece of standardized data; and constructing the retrieval data table based on each standardized code, the first preset index number, the localization value and the second preset index number.
In this embodiment, the standardized code may be an identification corresponding to the standardized information. The standardized code may be composed of symbols, letters, and/or data, and, illustratively, when the standardized information is a sex, the corresponding standardized code may be 1; when the standardized information is the gender B, the corresponding standardized code may be 2; when the standardized information is a disease, the corresponding standardized code may be J06.903 or the like. The first preset index number may be an identification number corresponding to when the localization data is stored in the first type of search table. For example, the field name corresponding to the first preset index number may be an ODSID. Correspondingly, the second preset index number may be an identification number corresponding to when the standardized data is stored in the second type of search table. For example, the field name corresponding to the second preset index number may be DWDSID.
In a specific implementation, for each piece of localization data in the first type of retrieval table, a localization value and a first preset index number included in the current localization data can be extracted from the first type of retrieval table, and for each piece of standardization data in the second type of retrieval table, a standardization code and a second preset index number included in the current standardization data can be extracted from the second type of retrieval table, further, standardization codes corresponding to the localization values are determined, and further, the localization values, the standardization codes, the first preset index number and the second preset index number are correspondingly stored in the same data table, so that the retrieval data table can be obtained.
It should be noted that, for the localization data in the first type of search table, there may be a case where some localization data does not include a standardized element, and further, the standardized data corresponding to the localization data cannot be determined, in this case, the localization data may be directly stored in the search data table, so that when a search text including the localization values is received, a target search result may be determined based on the corresponding localization values.
Based on the above, the above technical means further includes: for each piece of localization data in the first type of retrieval table, if no standardized element is detected in the current localization data, determining a localization value and a first preset index number corresponding to the localization data so as to update the retrieval data table based on the localization value and the first preset index number.
In the practical application process, for each piece of localization data in the first type of retrieval table, traversing the current localization data based on a preset standardized field screening criterion to determine whether the current localization data comprises standardized elements, if no standardized elements are detected in the current localization data, directly extracting the localization value and the first preset index included in the current localization data from the first type of retrieval table, further updating each localization value and the first preset index number into the retrieval data table, and setting the standardized codes corresponding to the localization values and the corresponding second preset index as preset field values. The preset field value may be any value, and optionally, may be a null value.
S230, receiving the search text to be processed, and determining at least one search keyword to be used corresponding to the search text to be processed.
S240, determining search information corresponding to at least one search keyword to be used based on the at least one search keyword to be used and a pre-constructed search data table.
S250, determining a target search result based on the search information.
According to the technical scheme, the search text to be processed is received, the at least one search keyword to be used corresponding to the search text to be processed is determined, further, the search information corresponding to the at least one search keyword to be used is determined based on the at least one search keyword to be used and the search data table constructed in advance, finally, the target search result is determined based on the search information, the problem that when the search text input by a user is used for searching, corresponding search data cannot be matched with some content which cannot be standardized in the prior art, so that the search efficiency is affected is solved, the effect of mixed search of the localized data and the standardized data which are simultaneously included based on the same data search table is achieved, and when the search text includes the non-standardized words, the effect of corresponding search information can be directly determined without standardized language conversion, so that the search efficiency is improved, and the use experience of the user is improved.
Example III
Fig. 3 is a schematic structural diagram of a data retrieval device according to a third embodiment of the present invention. As shown in fig. 3, the apparatus includes:
The search keyword determining module 310 is configured to receive a search text to be processed and determine at least one search keyword to be used corresponding to the search text to be processed;
a search information determining module 320, configured to determine search information corresponding to the at least one search keyword to be used based on the at least one search keyword to be used and a search data table constructed in advance; wherein the search data table is constructed based on a first type search table constructed based on at least one localization language type and a second type search table constructed based on a standard language type, the second type search table is constructed based on the first type search table, and the at least one localization language type corresponds to the standard language type;
the target search result determining module 330 is configured to determine a target search result based on the search information.
According to the technical scheme, the search text to be processed is received, the at least one search keyword to be used corresponding to the search text to be processed is determined, further, the search information corresponding to the at least one search keyword to be used is determined based on the at least one search keyword to be used and the search data table constructed in advance, finally, the target search result is determined based on the search information, the problem that when the search text input by a user is used for searching, corresponding search data cannot be matched with some content which cannot be standardized in the prior art, so that the search efficiency is affected is solved, the effect of mixed search of the localized data and the standardized data which are simultaneously included based on the same data search table is achieved, and when the search text includes the non-standardized words, the effect of corresponding search information can be directly determined without standardized language conversion, so that the search efficiency is improved, and the use experience of the user is improved.
Optionally, the apparatus further includes: the standardized element generation module and the retrieval data table construction module.
The standardized element generation module is used for generating standardized data corresponding to the current localized data if at least one standardized element is detected to be included in the current localized data for each piece of localized data in the first type retrieval table;
and the retrieval data table construction module constructs a second type retrieval table based on each piece of standardized data, and constructs a retrieval data table based on the first type retrieval table and the second type retrieval table.
Optionally, the search data table construction module includes: a standardized code determining unit, a localization value determining unit and a retrieval data table constructing unit.
A localization value determining unit, configured to obtain, based on the first type search table, a localization value and a first preset index number included in each piece of standardized data;
a standardized code determining unit, configured to obtain, based on the second type search table, a standardized code and a second preset index number included in each piece of standardized data;
and the search data table construction unit is used for constructing the search data table based on each standardized code, the first preset index number, the localization value and the second preset index number.
Optionally, the apparatus further includes: and a retrieval data table updating module.
And the search data table updating module is used for determining a localization value and a second preset index number corresponding to the localization data for each piece of localization data in the first type search table if the fact that the current localization data does not comprise the standardized element is detected, so as to update the search data table based on the localization value and the first preset index number.
Optionally, the search information determining module 320 is specifically configured to determine, for each of the search keywords to be used, a localization value corresponding to the search keyword to be used currently based on the search data table, and use a search record to which the localization value belongs as the search information.
Optionally, the target search result determining module 330 includes: and a target retrieval result determining unit.
And the target search result determining unit is used for determining a target search result corresponding to the to-be-processed search text based on at least one target index number contained in the search information.
Optionally, the target search result determining unit is specifically configured to determine, for each target index number, a target search table associated with a current target index number, so as to determine, based on the current target index number and the target search table, a target search result corresponding to the to-be-processed search text; wherein the target retrieval table comprises the first type retrieval table and/or the second type retrieval table.
The data retrieval device provided by the embodiment of the invention can execute the data retrieval method provided by any embodiment of the invention, and has the corresponding functional modules and beneficial effects of the execution method.
Example IV
Fig. 4 shows a schematic diagram of the structure of an electronic device 10 that may be used to implement an embodiment of the invention. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. Electronic equipment may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smartphones, wearable devices (e.g., helmets, glasses, watches, etc.), and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be exemplary only, and are not meant to limit implementations of the inventions described and/or claimed herein.
As shown in fig. 4, the electronic device 10 includes at least one processor 11, and a memory, such as a Read Only Memory (ROM) 12, a Random Access Memory (RAM) 13, etc., communicatively connected to the at least one processor 11, in which the memory stores a computer program executable by the at least one processor, and the processor 11 may perform various appropriate actions and processes according to the computer program stored in the Read Only Memory (ROM) 12 or the computer program loaded from the storage unit 18 into the Random Access Memory (RAM) 13. In the RAM 13, various programs and data required for the operation of the electronic device 10 may also be stored. The processor 11, the ROM 12 and the RAM 13 are connected to each other via a bus 14. An input/output (I/O) interface 15 is also connected to bus 14.
Various components in the electronic device 10 are connected to the I/O interface 15, including: an input unit 16 such as a keyboard, a mouse, etc.; an output unit 17 such as various types of displays, speakers, and the like; a storage unit 18 such as a magnetic disk, an optical disk, or the like; and a communication unit 19 such as a network card, modem, wireless communication transceiver, etc. The communication unit 19 allows the electronic device 10 to exchange information/data with other devices via a computer network, such as the internet, and/or various telecommunication networks.
The processor 11 may be a variety of general and/or special purpose processing components having processing and computing capabilities. Some examples of processor 11 include, but are not limited to, a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), various specialized Artificial Intelligence (AI) computing chips, various processors running machine learning model algorithms, digital Signal Processors (DSPs), and any suitable processor, controller, microcontroller, etc. The processor 11 performs the various methods and processes described above, such as the data retrieval method.
In some embodiments, the data retrieval method may be implemented as a computer program tangibly embodied on a computer-readable storage medium, such as storage unit 18. In some embodiments, part or all of the computer program may be loaded and/or installed onto the electronic device 10 via the ROM 12 and/or the communication unit 19. When the computer program is loaded into RAM 13 and executed by processor 11, one or more steps of the data retrieval method described above may be performed. Alternatively, in other embodiments, the processor 11 may be configured to perform the data retrieval method in any other suitable way (e.g., by means of firmware).
Various implementations of the systems and techniques described here above may be implemented in digital electronic circuitry, integrated circuit systems, field Programmable Gate Arrays (FPGAs), application Specific Integrated Circuits (ASICs), application Specific Standard Products (ASSPs), systems On Chip (SOCs), load programmable logic devices (CPLDs), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs, the one or more computer programs may be executed and/or interpreted on a programmable system including at least one programmable processor, which may be a special purpose or general-purpose programmable processor, that may receive data and instructions from, and transmit data and instructions to, a storage system, at least one input device, and at least one output device.
A computer program for carrying out methods of the present invention may be written in any combination of one or more programming languages. These computer programs may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus, such that the computer programs, when executed by the processor, cause the functions/acts specified in the flowchart and/or block diagram block or blocks to be implemented. The computer program may execute entirely on the machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
In the context of the present invention, a computer-readable storage medium may be a tangible medium that can contain, or store a computer program for use by or in connection with an instruction execution system, apparatus, or device. The computer readable storage medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. Alternatively, the computer readable storage medium may be a machine readable signal medium. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
To provide for interaction with a user, the systems and techniques described here can be implemented on an electronic device having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and a pointing device (e.g., a mouse or a trackball) through which a user can provide input to the electronic device. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic input, speech input, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a background component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such background, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), wide Area Networks (WANs), blockchain networks, and the internet.
The computing system may include clients and servers. The client and server are typically remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The server can be a cloud server, also called a cloud computing server or a cloud host, and is a host product in a cloud computing service system, so that the defects of high management difficulty and weak service expansibility in the traditional physical hosts and VPS service are overcome.
It should be appreciated that various forms of the flows shown above may be used to reorder, add, or delete steps. For example, the steps described in the present invention may be performed in parallel, sequentially, or in a different order, so long as the desired results of the technical solution of the present invention are achieved, and the present invention is not limited herein.
The above embodiments do not limit the scope of the present invention. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives are possible, depending on design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the scope of the present invention.

Claims (7)

1. A data retrieval method, comprising:
receiving a search text to be processed, and determining at least one search keyword to be used corresponding to the search text to be processed;
determining search information corresponding to the at least one search keyword to be used based on the at least one search keyword to be used and a pre-constructed search data table; wherein the search data table is constructed based on a first type search table constructed based on at least one localization language type and a second type search table constructed based on a standard language type, the second type search table is constructed based on the first type search table, and the at least one localization language type corresponds to the standard language type;
Determining a target retrieval result based on the retrieval information;
the method further comprises the steps of:
for each piece of localization data in the first type of retrieval table, if at least one standardized element is detected to be included in the current localization data, standardized data corresponding to the current localization data is generated;
constructing a second type of retrieval table based on each piece of standardized data, and constructing a retrieval data table based on the second type of retrieval table and the first type of retrieval table;
wherein said constructing a search data table based on said first type search table and said second type search table comprises:
based on the first type retrieval table, obtaining a localization value and a first preset index number included in each piece of localization data;
based on the second type search table, obtaining a standardized code and a second preset index number included in each piece of standardized data;
constructing the retrieval data table based on each standardized code, the first preset index number, the localization value and the second preset index number;
the method further comprises the steps of:
for each piece of localization data in the first type of retrieval table, if the fact that the current localization data does not comprise standardized elements is detected, determining a localization value and a first preset index number corresponding to the localization data, and updating the retrieval data table based on the localization value and the first preset index number.
2. The method of claim 1, wherein the determining the search information corresponding to the at least one search keyword to be used comprises:
for each search keyword to be used, determining a localization value corresponding to the current search keyword to be used based on the search data table, and taking a search record to which the localization value belongs as the search information.
3. The method of claim 1, wherein the determining a target search result based on the search information comprises:
and determining a target retrieval result corresponding to the to-be-processed retrieval text based on at least one target index number contained in the retrieval information.
4. A method according to claim 3, wherein said determining a target search result corresponding to the search text to be processed based on at least one target index number contained in the search information comprises:
for each target index number, determining a target retrieval table associated with a current target index number, so as to determine a target retrieval result corresponding to the to-be-processed retrieval text based on the current target index number and the target retrieval table; wherein the target retrieval table comprises the first type retrieval table and/or the second type retrieval table.
5. A data retrieval apparatus, comprising:
the search keyword to be used determining module is used for receiving the search text to be processed and determining at least one search keyword to be used corresponding to the search text to be processed;
a search information determining module, configured to determine search information corresponding to the at least one search keyword to be used based on the at least one search keyword to be used and a search data table constructed in advance; wherein the search data table is constructed based on a first type search table constructed based on at least one localization language type and a second type search table constructed based on a standard language type, the second type search table is constructed based on the first type search table, and the at least one localization language type corresponds to the standard language type;
the target search result determining module is used for determining a target search result based on the search information;
the standardized element generation module is used for generating standardized data corresponding to the current localized data if at least one standardized element is detected to be included in the current localized data for each piece of localized data in the first type retrieval table;
The search data table construction module is used for constructing a second type search table based on each piece of standardized data and constructing a search data table based on the first type search table and the second type search table;
wherein, the search data table construction module comprises: a localization value determining unit, a standardized code determining unit and a search data table constructing unit;
the localization value determining unit is configured to obtain, based on the first type search table, a localization value and a first preset index number included in each piece of localization data;
the standardized code determining unit is configured to obtain, based on the second type search table, a standardized code and a second preset index number included in each piece of standardized data;
the search data table construction unit is configured to construct the search data table based on each of the standardized codes, the first preset index number, the localization value, and the second preset index number;
the apparatus further comprises: a retrieval data table updating module;
the search data table updating module is configured to determine, for each piece of localization data in the first type of search table, a localization value and a second preset index number corresponding to the localization data if it is detected that the current localization data does not include a standardized element, so as to update the search data table based on the localization value and the first preset index number.
6. An electronic device, the electronic device comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein, the liquid crystal display device comprises a liquid crystal display device,
the memory stores a computer program executable by the at least one processor to enable the at least one processor to perform the data retrieval method of any one of claims 1-4.
7. A computer readable storage medium storing computer instructions for causing a processor to perform the data retrieval method of any one of claims 1-4.
CN202310369548.2A 2023-04-10 2023-04-10 Data retrieval method, device, electronic equipment and storage medium Active CN116089459B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310369548.2A CN116089459B (en) 2023-04-10 2023-04-10 Data retrieval method, device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310369548.2A CN116089459B (en) 2023-04-10 2023-04-10 Data retrieval method, device, electronic equipment and storage medium

Publications (2)

Publication Number Publication Date
CN116089459A CN116089459A (en) 2023-05-09
CN116089459B true CN116089459B (en) 2023-06-27

Family

ID=86202913

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310369548.2A Active CN116089459B (en) 2023-04-10 2023-04-10 Data retrieval method, device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN116089459B (en)

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1677208A1 (en) * 2004-12-30 2006-07-05 Sap Ag Method and system for searching for data objects
US8015217B2 (en) * 2008-01-02 2011-09-06 International Business Machines Corporation Method and system for obtaining user data having user-defined data types
CN103106189B (en) * 2011-11-11 2016-04-27 北京百度网讯科技有限公司 A kind of method and apparatus excavating synonym attribute word
JP5705293B2 (en) * 2013-11-14 2015-04-22 トランス・コスモス株式会社 Code conversion apparatus, code conversion method, and code conversion program
US9626430B2 (en) * 2014-12-22 2017-04-18 Ebay Inc. Systems and methods for data mining and automated generation of search query rewrites
CN107818169B (en) * 2017-11-13 2021-09-07 医渡云(北京)技术有限公司 Electronic medical record retrieval and storage method and device, storage medium and electronic terminal
CN108831562A (en) * 2018-06-22 2018-11-16 北京海德康健信息科技有限公司 A kind of disease name standard convention database and its method for building up
CN109684445B (en) * 2018-11-13 2021-05-28 中国科学院自动化研究所 Spoken medical question-answering method and spoken medical question-answering system
US11755661B2 (en) * 2020-03-31 2023-09-12 Roche Molecular Systems, Inc. Text entry assistance and conversion to structured medical data
US11782956B2 (en) * 2020-10-23 2023-10-10 Privacy Analytics Inc. System and method for intermediary mapping and de-identification of non-standard datasets

Also Published As

Publication number Publication date
CN116089459A (en) 2023-05-09

Similar Documents

Publication Publication Date Title
US20220318275A1 (en) Search method, electronic device and storage medium
CN113836314B (en) Knowledge graph construction method, device, equipment and storage medium
CN114595686B (en) Knowledge extraction method, and training method and device of knowledge extraction model
CN109947903B (en) Idiom query method and device
CN116167352B (en) Data processing method, device, electronic equipment and storage medium
CN112528641A (en) Method and device for establishing information extraction model, electronic equipment and readable storage medium
CN113836316B (en) Processing method, training method, device, equipment and medium for ternary group data
CN114816578A (en) Method, device and equipment for generating program configuration file based on configuration table
CN114116997A (en) Knowledge question answering method, knowledge question answering device, electronic equipment and storage medium
CN113408280A (en) Negative example construction method, device, equipment and storage medium
CN116955856A (en) Information display method, device, electronic equipment and storage medium
CN116597443A (en) Material tag processing method and device, electronic equipment and medium
CN116089459B (en) Data retrieval method, device, electronic equipment and storage medium
US20220198358A1 (en) Method for generating user interest profile, electronic device and storage medium
CN115292506A (en) Knowledge graph ontology construction method and device applied to office field
CN114443802A (en) Interface document processing method and device, electronic equipment and storage medium
CN113377924A (en) Data processing method, device, equipment and storage medium
CN116628004B (en) Information query method, device, electronic equipment and storage medium
CN117033801B (en) Service recommendation method, device, equipment and storage medium
CN115392389B (en) Cross-modal information matching and processing method and device, electronic equipment and storage medium
CN115938599A (en) Data query method and device, electronic equipment and storage medium
CN117349312A (en) Word standardization, query method, device, electronic equipment and storage medium
CN116166783A (en) Method, device and storage medium for generating similar problems
CN116738993A (en) Training method of information recognition model, information recognition method and device
CN118012936A (en) Data extraction method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant