CN114579839A - Data processing system based on webpage - Google Patents

Data processing system based on webpage Download PDF

Info

Publication number
CN114579839A
CN114579839A CN202210265110.5A CN202210265110A CN114579839A CN 114579839 A CN114579839 A CN 114579839A CN 202210265110 A CN202210265110 A CN 202210265110A CN 114579839 A CN114579839 A CN 114579839A
Authority
CN
China
Prior art keywords
character string
data
initial
target
string
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210265110.5A
Other languages
Chinese (zh)
Inventor
周琦
方毅
俞锋锋
吕繁荣
尹祖勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Yunshen Technology Co ltd
Original Assignee
Hangzhou Yunshen Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Yunshen Technology Co ltd filed Critical Hangzhou Yunshen Technology Co ltd
Priority to CN202210265110.5A priority Critical patent/CN114579839A/en
Publication of CN114579839A publication Critical patent/CN114579839A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The invention discloses a data processing system based on web pages, which comprises a plurality of original web page system interfaces, a database, a processor and a memory for storing computer programs, wherein the database comprises: the raw data set comprises a number of raw data lists, which when executed by the processor, perform the following steps: acquiring a target data list; according to any original data list, obtaining an initial data list corresponding to any target character string in a target data list and constructing an initial data set corresponding to the target character string, according to the target character string and the intermediate character string, obtaining a mapping data table corresponding to the target character string and presenting the mapping data table according to the data in the mapping data table corresponding to the target character string; therefore, the method and the device can acquire the related data from different webpage systems to display, avoid the situation that all the required data cannot be displayed, simplify the complex work flow and improve the work efficiency.

Description

Data processing system based on webpage
Technical Field
The invention relates to the technical field of webpage data processing, in particular to a data processing system based on a webpage.
Background
With the continuous development of internet technology and the continuous increase of internet users, a web page system has developed vigorously, wherein the web page system comprises a web page server, an application server and a database, and the web page system is mainly used for inquiring or displaying a large amount of information required by the users.
In the prior art, when a large amount of information required by a user is queried or displayed from a webpage system, a keyword or information needs to be input into the corresponding webpage system for querying or displaying, so that different information required by the user is queried through different webpage systems and cannot be integrated to realize uniform display, and the work flow is complex and the efficiency is low.
Disclosure of Invention
In order to solve the problems in the prior art, an embodiment of the present invention provides a data processing system based on a web page, where the technical solution is as follows:
the invention provides a data processing system based on a webpage, which comprises: a plurality of raw web page system interfaces, a database, a processor, and a memory storing a computer program, the database comprising: original data set H ═ H1,H2,……,Hz},HxThe method refers to an xth original data list, wherein x is 1 … … z, and z is the number of original data lists, wherein the original data list refers to a data list acquired from an original webpage system which is docked through an original webpage system interface, and when the computer program is executed by a processor, the following steps are implemented:
s100, obtain target data list a ═ { a ═ a1,A2,……,Am},AiThe method refers to the ith target character string, wherein i is 1 … … m, and m is the number of the target character strings;
s200, according to HxObtaining AiCorresponding initial data list and construct AiCorresponding initial data set Ci={Ci1,Ci2,……,Cin},Cij={Ci 1,Ci 2,……,Ci gj},Ci rIs referred to as AiThe corresponding ith initial character string in the jth initial data list, j is 1 … … n, n is the number of the initial data lists, r is 1 … … gj,gjIs CijThe number of the initial character strings;
s300, obtaining CijCorresponding to Bij={B1 ij,B2 ij,……,Bp ij},Bq ijIs referred to as CijCorresponding q-th intermediate character string, q-1 … … pj,pjThe following conditions are met:
Figure BDA0003551382840000021
wherein the middle character string is at CiInternal removal of CijAny initial character string in other initial data lists;
s400, according to Cr ijAnd Bq ijObtaining BiCorresponding mapping data table TiAnd will TiAnd performing presentation.
The data processing system based on the webpage has the following technical effects:
the system of the invention comprises: a plurality of raw web page system interfaces, a database, a processor, and a memory storing a computer program, the database comprising: the original data set comprises a plurality of original data lists, wherein the original data lists are data lists acquired from an original webpage system which is in butt joint through an original webpage system interface, and when the computer program is executed by a processor, the following steps are realized:
acquiring a target data list; according to the original data list, obtaining an initial data list corresponding to any target character string in the target data list and constructing an initial data set corresponding to the target character string, wherein the initial data set comprises a plurality of initial data lists, and each initial data list comprises a plurality of initial character strings; acquiring an intermediate data list corresponding to the initial data list, wherein any intermediate character string refers to any initial character string in other initial data lists except the initial data list in the initial data set; acquiring a mapping data table corresponding to the target character string according to the target character string and the intermediate character string, and presenting the mapping data table according to the data in the mapping data table corresponding to the target character string; therefore, the method and the device can acquire the relevant data from different webpage systems for displaying, avoid the situation that all the required data cannot be displayed, and on the other hand, can perform unified display on the relevant data acquired from different webpage systems, simplify the work flow and improve the work efficiency.
In addition, the target field names of the mapping data table obtained in the present invention include: the system comprises a first target field name, a second target field name and a third target field name, wherein the first target field name, the second target field name and the third target field name are respectively determined according to an initial character string and an intermediate character string; when only the webpage system is updated, the mapping data table is updated only by updating the mapping data table, namely the mapping data table is updated based on the field name and the data of the original data list in the updated webpage system, so that the data required by a user can be updated without acquiring the data from the webpage system again, and the working efficiency is improved.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings required to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the description below are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a flowchart illustrating a program executed by a data processing system based on a web page according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention.
It should be noted that the terms "first," "second," and the like in the description and claims of the present invention and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the invention described herein are capable of operation in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or server that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
Example one
This embodiment provides a data processing system based on web page, the system includes: a plurality of raw web page system interfaces, a database, a processor, and a memory storing a computer program, the database comprising: original data set H ═ H1,H2,……,Hz},HxReferring to the xth original data list, where x is 1 … … z, and z is the number of original data lists, where the original data list refers to a data list acquired in an original web page system interfaced through an original web page system interface, and when the computer program is executed by a processor, as shown in fig. 1, the following steps are implemented:
s100, obtain target data list a ═ { a ═ a1,A2,……,Am},AiThe number of the ith target character string is defined, i is 1 … … m, and m is the number of the target character strings.
Specifically, the step S100 further includes the following steps of obtaining the target character string:
s101, obtaining a target language segment, wherein the target language segment is preset by a user or is obtained from a current webpage system, and the current webpage system is different from an original webpage system in interface butt joint with the original webpage system.
S103, performing word segmentation processing on the target language fragment to obtain a target character string; those skilled in the art know that any word segmentation process in the prior art is adopted to obtain a character string, and the description is omitted here.
Further, AiAnd A is removed fromiBesides, other target character strings meet the regular expression conditions, wherein the regular expression conditions are set by technical personnel in the field according to actual requirements, so that the interference among different target characters can be avoided, and the accuracy of data for display is improved.
S200, according to HxObtaining AiCorresponding initial data list is constructed as AiCorresponding initial data set Ci={Ci1,Ci2,……,Cin},Cij={Ci 1,Ci 2,……,Ci gj},Ci rIs referred to as AiThe corresponding ith initial character string in the jth initial data list, j is 1 … … n, n is the number of the initial data lists, r is 1 … … gj,gjIs CijThe number of initial strings.
Specifically, the initial data list in step S200 means that any one of H includes aiThe raw data list of (2) can be understood as: hxIncluding AiThen H is determinedxIs AiA corresponding initial data list.
Preferably, the initial character string can be understood as any record corresponding to the target character string under each field name in the initial data list.
S300, obtaining CijCorresponding to Bij={Bi 1,Bi 2,……,Bi pj},Bi qIs referred to as CijCorresponding q (th) ofMiddle character string, q 1 … … pj,pjIs referred to as CijCorresponding number of intermediate strings, pjThe following conditions are met:
Figure BDA0003551382840000051
wherein the middle character string is at CiInternal removing CijAny initial character string in other initial data lists.
S400, according to Cr ijAnd Bq ijObtaining AiCorresponding mapping data table TiAnd will TiAnd performing presentation.
Specifically, the step S400 further includes the steps of:
s401, when Cr ij=Bq ijWhen it is, C isr ijAs AiA corresponding first key string; the data with repeated field names can be effectively deleted.
Specifically, the database further includes: h corresponds to the priority list F ═ { F ═ F1,F2,……,Fz},FxIs referred to as HxA corresponding priority; those skilled in the art know to set the priority of the original data list according to actual requirements, so as to reflect the importance degree of the original data list.
Further, F1>F2>……>Fz(ii) a It can be understood that: the larger the priority of the original data list is, the more important the importance degree of the user requirements is, and the method can be favorable for removing repeated initial character strings or initial field names under the condition that the initial character strings or the initial field names are the same, and simplifying the displayed data volume.
S403, obtaining Cr ijCorresponding initial field name Dr ijAnd Bq ijCorresponding middle field name Uq ijWhen C is presentr ij≠Bq ijAnd Dr ij=Uq ijWill be driven from Dr ijCorresponding priority and Uq ijObtaining the initial character string corresponding to the maximum priority in the corresponding priority as AiA corresponding second key string; the method can effectively delete the data with repeated field names, and avoid inaccurate data caused by the fact that records under the initial field names in a partial original data list are not updated timely.
In particular, Dr ijCorresponding priority is Cr ijThe priority of the corresponding initial data list, wherein: and the priority of the initial data list is the same as that of the original data list corresponding to the target character string.
Further, the priorities corresponding to all the initial field names in the initial data list are consistent.
Preferably, the determination method of the priority corresponding to the intermediate field name is consistent with the determination method of the priority corresponding to the initial field name, and is not described herein again.
S405, when Cr ij≠Bq ijAnd Dr ij≠Uq ijThen, obtain AiA corresponding third key string.
Specifically, the step S405 further includes the steps of:
when C isr ij=Bq ijWhen it is, obtain Cr ijAll corresponding intermediate character strings are taken as Cr ijA corresponding first designated string;
when C isr ij≠Bq ijAnd Dr ij=Uq ijWhen D isr ijAll the corresponding middle character strings under the names of all the middle fields are taken as Dr ijA corresponding second designated string;
when C is presentr ij≠Bq ijAnd Dr ij≠Uq ijThen, according to the first specified character string and the second specified character string, A is obtainediThe corresponding third key character string, namely AiThe corresponding third key character string is indicated at CiOther initial character strings except the first specified character string and the second specified character string; repeated data can be removed, and the quantity of the displayed data is simplified.
S407, according to AiCorresponding first key string, AiCorresponding second key string and AiCorresponding third key character string, obtaining AiCorresponding mapping data table TiAnd will TiAnd performing presentation.
In a specific embodiment, TiCorresponding field name List { Ti 1,Ti 2,……,Ti kIn which T isi yThe name of the ith target field is shown, y is 1 … … k, and k is the number of the target field names.
Further, Ti 1Corresponding priority is greater than or equal to Ti 2Corresponding priority is not less than … … not less than Ti kThe corresponding priority.
Preferably, the target field name includes: a first target field name, a second target field name and a third target field name, wherein, when C isr ij=Bq ijFrom C to Cr ijMiddle field names and C of all corresponding middle character stringsr ijThe field name with the highest priority determined in the corresponding initial field names is used as the first target field name when Cr ij≠Bq ijAnd Dr ij=Uq ijWhen D isr ijThe field name with the highest priority among all the corresponding intermediate field names is used as the second target field name when Cr ij≠Bq ijAnd Dr ij≠Uq ijWhen, AiTaking the field name of the corresponding third key character string as a third target field name; when only the webpage system is updated, only the mapping data table needs to be updated, namely, the mapping data table is updated based on the field names and the data of the original data list in the updated webpage systemAnd updating is carried out, so that the data required by the user is updated without acquiring the data from the webpage system again, and the working efficiency is improved.
The embodiment provides a data processing system based on a webpage, which comprises: a plurality of raw web page system interfaces, a database, a processor, and a memory storing a computer program, the database comprising: the original data set comprises a plurality of original data lists, wherein the original data lists are data lists acquired from an original webpage system which is in butt joint through an original webpage system interface, and when the computer program is executed by a processor, the following steps are realized:
acquiring a target data list; according to the original data list, obtaining an initial data list corresponding to any target character string in the target data list and constructing an initial data set corresponding to the target character string, wherein the initial data set comprises a plurality of initial data lists, and each initial data list comprises a plurality of initial character strings; acquiring an intermediate data list corresponding to the initial data list, wherein any intermediate character string refers to any initial character string in other initial data lists except the initial data list in the initial data set; acquiring a mapping data table corresponding to the target character string according to the target character string and the intermediate character string, and presenting the mapping data table according to the data in the mapping data table corresponding to the target character string; therefore, the method and the device can acquire the related data from different webpage systems for displaying, avoid the situation that all the required data cannot be displayed, and on the other hand, can perform unified display on the related data acquired from different webpage systems, simplify the work flow and improve the work efficiency.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims (8)

1. A web-based data processing system, the system comprising: system interface, database and processor for several original web pagesAnd a memory storing a computer program, the database comprising: original data set H ═ H1,H2,……,Hz},HxThe method refers to an xth original data list, wherein x is 1 … … z, and z is the number of the original data lists, wherein the original data list refers to a data list acquired from an original webpage system which is docked through an original webpage system interface, and when the computer program is executed by a processor, the following steps are implemented:
s100, obtain target data list a ═ { a ═ a1,A2,……,Am},AiThe method refers to the ith target character string, wherein i is 1 … … m, and m is the number of the target character strings;
s200, according to HxObtaining AiCorresponding initial data list and construct AiCorresponding initial data set Ci={Ci1,Ci2,……,Cin},Cij={Ci 1,Ci 2,……,Ci gj},Ci rIs referred to as in AiThe corresponding ith initial character string in the jth initial data list, j is 1 … … n, n is the number of the initial data lists, r is 1 … … gj,gjIs CijThe number of the initial character strings;
s300, acquiring CijCorresponding to Bij={B1 ij,B2 ij,……,Bp ij},Bq ijIs referred to as CijCorresponding qth intermediate string, q 1 … … pj,pjThe following conditions are met:
Figure FDA0003551382830000011
wherein the intermediate character string is at CiInternal removing CijAny initial character string in other initial data lists;
s400, according to Cr ijAnd Bq ijObtaining AiCorresponding mapping data table TiAnd will beTiAnd performing presentation.
2. A web-based data processing system according to claim 1, wherein the step S100 further comprises the steps of obtaining the target character string:
s101, acquiring a target language fragment, wherein the target language fragment is preset by a user or acquired from a current webpage system, and the current webpage system is different from an original webpage system in interface butt joint with the original webpage system;
s103, performing word segmentation processing on the target language fragment to obtain a target character string.
3. A web-based data processing system according to claim 1, wherein aiAnd A is removed fromiOther target character strings meet the regular expression condition.
4. The web-based data processing system of claim 1, wherein in step S200, the initial data list is any one of H including aiThe list of the original data of (a) is,
5. a web-based data processing system according to claim 1,
s401, when Cr ij=Bq ijWhen it is, Cr ijAs AiA corresponding first key string; deleting data with repeated field names effectively;
s403, obtaining Cr ijCorresponding initial field name Dr ijAnd Bq ijCorresponding middle field name Uq ijWhen C is presentr ij≠Bq ijAnd Dr ij=Uq ijWill be from Dr ijCorresponding priority and Uq ijAcquiring the initial character string corresponding to the maximum priority from the corresponding priorities as BiA corresponding second key string;
s405, when Cr ij≠Bq ijAnd Dr ij≠Uq ijThen, obtain AiA corresponding third key character string;
s407, according to AiCorresponding first key string, AiCorresponding second key string and AiCorresponding third key character string, obtaining AiCorresponding mapping data table TiAnd will TiAnd performing presentation.
6. A web-based data processing system according to claim 5, further comprising in said database: h corresponds to the priority list F ═ { F ═ F1,F2,……,Fz},FxIs referred to as HxCorresponding priority and F1>F2>……>Fz
7. The web-based data processing system of claim 5, wherein the step S405 further comprises the steps of:
when C is presentr ij=Bq ijThen, obtain Cr ijAll corresponding intermediate character strings are taken as Cr ijA corresponding first designated string;
when C is presentr ij≠Bq ijAnd Dr ij=Uq ijWhen D isr ijAll the corresponding middle character strings under the names of all the middle fields are taken as Dr ijA corresponding second designated string;
when C is presentr ij≠Bq ijAnd Dr ij≠Uq ijThen, according to the first specified character string and the second specified character string, B is obtainediA corresponding third key string.
8.A Web-based data processing system according to claim 7, characterized in that B isiThe corresponding third key character string is indicated at CiOther initial character strings than the first specified character string and the second specified character string.
CN202210265110.5A 2022-03-17 2022-03-17 Data processing system based on webpage Pending CN114579839A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210265110.5A CN114579839A (en) 2022-03-17 2022-03-17 Data processing system based on webpage

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210265110.5A CN114579839A (en) 2022-03-17 2022-03-17 Data processing system based on webpage

Publications (1)

Publication Number Publication Date
CN114579839A true CN114579839A (en) 2022-06-03

Family

ID=81781278

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210265110.5A Pending CN114579839A (en) 2022-03-17 2022-03-17 Data processing system based on webpage

Country Status (1)

Country Link
CN (1) CN114579839A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114785505A (en) * 2022-06-22 2022-07-22 中科雨辰科技有限公司 Data processing system for acquiring abnormal equipment
CN115840742A (en) * 2023-02-13 2023-03-24 每日互动股份有限公司 Data cleaning method, device, equipment and medium
CN117474392A (en) * 2023-10-30 2024-01-30 北京香田智能科技有限公司 Grower potential analysis system

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114785505A (en) * 2022-06-22 2022-07-22 中科雨辰科技有限公司 Data processing system for acquiring abnormal equipment
CN114785505B (en) * 2022-06-22 2022-08-23 中科雨辰科技有限公司 Data processing system for acquiring abnormal equipment
CN115840742A (en) * 2023-02-13 2023-03-24 每日互动股份有限公司 Data cleaning method, device, equipment and medium
CN115840742B (en) * 2023-02-13 2023-05-12 每日互动股份有限公司 Data cleaning method, device, equipment and medium
CN117474392A (en) * 2023-10-30 2024-01-30 北京香田智能科技有限公司 Grower potential analysis system

Similar Documents

Publication Publication Date Title
CN114579839A (en) Data processing system based on webpage
US9846729B1 (en) Attribute category enhanced search
US8661015B2 (en) Identification of name entities via search, determination of alternative searches, and automatic integration of data across a computer network for dynamic portal generation
US8555182B2 (en) Interface for managing search term importance relationships
CN101882149B (en) Reorder and improve the dependency of Search Results
DK177142B1 (en) Procedure for presenting a dataset using search, computer-readable medium and computer
US7552400B1 (en) System and method for navigating within a graphical user interface without using a pointing device
JP4962967B2 (en) Web page search server and query recommendation method
US9171132B1 (en) Electronic note management system and user-interface
US20050251513A1 (en) Techniques for correlated searching through disparate data and content repositories
US20050203888A1 (en) Method and apparatus for improved relevance of search results
US20090265330A1 (en) Context-based document unit recommendation for sensemaking tasks
JP2008538149A (en) Rating method, search result organizing method, rating system, and search result organizing system
NO337464B1 (en) Filtering user interface for a data summary table
JP5225004B2 (en) Content visualization apparatus and content visualization method
US20070094293A1 (en) Filtering search results by grade level readability
CN110134970B (en) Header error correction method and apparatus
CN102750081A (en) Information processing apparatus, information processing method, and program
JP2008243050A (en) Web page retrieval program, method, and program
CN112069783A (en) Medical record input method and input system thereof
CN111143422A (en) Data retrieval method, data retrieval device, storage medium, and electronic device
CN114201615B (en) Scientific research data change review method and server based on data snapshot
US10877970B1 (en) Identifying relevant data sources for a data visualization application
US10699451B1 (en) Generating digital graphical representations reflecting multiple data series utilizing dynamic y-axes
EP2071476A1 (en) Search device, search method and search program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination