CN107077505A - Automatic mode mismatches detection - Google Patents

Automatic mode mismatches detection Download PDF

Info

Publication number
CN107077505A
CN107077505A CN201580062833.3A CN201580062833A CN107077505A CN 107077505 A CN107077505 A CN 107077505A CN 201580062833 A CN201580062833 A CN 201580062833A CN 107077505 A CN107077505 A CN 107077505A
Authority
CN
China
Prior art keywords
schema elements
data set
component
data
working space
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201580062833.3A
Other languages
Chinese (zh)
Inventor
P·阿迪拉
C·斯托姆
A·J·配亚科克
A·内兹
C·科里斯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Technology Licensing LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing LLC filed Critical Microsoft Technology Licensing LLC
Publication of CN107077505A publication Critical patent/CN107077505A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/20Software design
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/211Schema design and management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/0486Drag-and-drop
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/30Creation or generation of source code
    • G06F8/34Graphical or visual programming

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Data Mining & Analysis (AREA)
  • User Interface Of Digital Computer (AREA)
  • Stored Programmes (AREA)

Abstract

Mismatch between auto-id data collection and the schema elements of operation.It is configured as the interactive visual working space for the diagram creation for supporting data conversion streamline to be visually rendered in addition, the mismatch can be combined.After data set is connected to operation, one or more mismatches can be determined and be present in the context of working space.In addition, schema elements can be reconfigured by way of the visual representation with schema elements is interacted, to solve to mismatch.

Description

Automatic mode mismatches detection
Background technology
Change data first is related to collect valuable opinion to the processing of substantial amounts of data or so-called big data.It is logical The establishment, scheduling and execution of one or more operations are crossed, data are converted into for by business intelligence end points (such as instrument board) The available form announced or used.In this context, operation is to include the work of one or more map functions in data Unit.Generally, operation is by hand-codings such as data mining personnel, data framework teacher, business intelligence architects.In addition, exploit person Member or similar individual task are to ensure that data used in operation are constructed in the acceptable mode of operation.
The content of the invention
The content of the invention of simplification presented below, so as to provide to subject some in terms of basic comprehension.This hair Bright content is not extensive general introduction.It is not intended to mark key/critical elements or describes the scope of theme claimed.Its Sole purpose is some concepts are presented in simplified form, is used as the preamble of the embodiment presented later.
Briefly describe, this theme, which is disclosed, is related to automatic mode mismatch detection.Being connected to data set in response to user can The operation on working space depending on creating interface, pattern matching process is initiated.Pattern matching process mark data integrated mode with Between the element for matching and detecting dataset schema and the desired pattern of operation between the element of the desired pattern of operation Mismatch.Mismatch can be detected based on the measurement of corresponding relation intensity and predetermined threshold is indicated.The mismatch detected can With the user being presented in the part at visual creation interface.In addition, user can be able to be solved with to mismatch Mode and the mismatched interaction detected.
For the realization of foregoing and related purpose, claimed theme is described herein in conjunction with following description and accompanying drawing Some illustrative aspects.These aspects are indicated can be in the various modes of practical matter, and all these modes are intended to be in and wanted In the range of the theme for asking protection.When considered in conjunction with the accompanying drawings, other advantages and novel feature can from following detailed description To become apparent.
Brief description of the drawings
Fig. 1 is the block diagram of visual authoring system.
Fig. 2 is the block diagram of representative mode matching component.
Fig. 3 is the block diagram of representative matching identification part.
Fig. 4 is the screenshot capture that exemplary visual creates interface.
Fig. 5 is to create the screenshot capture at interface with the exemplary visual for showing unmatched panel.
Fig. 6 is the screenshot capture that the exemplary visual with the panel for showing unmatched solution creates interface.
Fig. 7 is the flow chart of detection and the unmatched method of solution pattern.
Fig. 8 is the flow chart of the method for pattern match.
Fig. 9 is the flow chart for the method classified to pattern match.
Figure 10 is to show the schematic block diagram suitable for the operating environment of each side disclosed in this theme.
Embodiment
Following details relates generally to automatic mode and mismatches detection and solve.Streamline includes wherein the first operation Output alternatively provides the set of one or more related operations of input to the second operation.For example, one or more input numbers The operation using input data set is may be connected to according to collection, data map function is performed, and produce output data set.Input number Can be different with the pattern of operation (or, in other words, the desired pattern of the operation) according to the pattern of collection.In successful execution operation Before, it is necessary to solve these differences.As provided herein, data set mould can be automatically determined after data source is connected to operation Difference between formula and work pattern.With reference to interactive visual working space, data source diagrammatically is connected into operation to send out Schema elements can be categorized as matching or unmatched pattern matching process by rising.The element of pattern whether with the desired mould of operation Formula, which is matched or mismatched, can be based on corresponding relation intensity and predetermined threshold, and wherein corresponding relation intensity is as one or more The function of factor and the confidence metric calculated, one or more factors include but not limited to element data type and name Claim.Mismatch can be presented in the context with visual working space, and allow users to solve the mismatch. Visually distinguished not with coupling element for example, the schema elements associated with both data set and operation can be presented to have Coupling element.User can then use one or more postures with data source and work pattern Match of elemental composition graphically to solve Never match.Specify for input data set and matched on the contrary, passing through using all of schema elements of operation with requiring user Displaying pattern mismatches and allows it readily to correct, and therefore pattern match is simplified or more efficiently carries out.
Various aspects disclosed in this theme are more fully described with reference now to accompanying drawing, wherein through accompanying drawing identical accompanying drawing mark Note generally refers to identical or corresponding element.It will be appreciated, however, that accompanying drawing and associated detailed description are not intended as institute Claimed theme is limited to particular forms disclosed.Theme claimed is fallen on the contrary, it is intended to be to cover All modifications, equivalent and alternatives in spirit and scope.
With reference first to Fig. 1, visual authoring system 100 is shown.Visual authoring system 100 include working space component 110, Source component 120, target element 130 and pattern matching components 140.Working space component 110 is configured as interactive by providing The diagram of visual working space or painting canvas to enable operation and streamline is created, and wherein streamline includes the defeated of wherein the first operation Go out alternatively to provide the set of one or more related operations of input to the second operation.For example, data set can be represented as Cylinder and the operation that modified data set is connected to using data set and produced by arrow.Substantially, user Can be with the graph of a relation between drawing data collection and operation.This causes intuitively to experience, and it is saved on understanding relation and finally referred to The time of constant current waterline.Mismatched in addition, pattern matching components 140 are configured as automatic markers, and make it possible to solution The certainly mismatch identified between data source and operation.
Source component 120 is configured as producing available data sources or the data set (set for including data) for being used for that operation to be created Visual representation.Arbitrary data collection can be obtained by source component 120 and it is used, including substantially any form is (for example, table Lattice, file, stream ...) or structure (for example, structuring, non-structured, semi-structured) local data source and be based on The data source of cloud.In other words, source component 120 is configured as showing heterogeneous data source.Can be by being provided by source component 120 Search and import feature enable the set of data to be used.In addition, source component 120 can be configured as monitoring user or entity account automatically Family etc. and addressable data source is set to be used.The data source rendered by source component 120 is interactive, and is used as For the input of one or more operations.For example, using posture (such as drag and drop), the data source from source region can be added To working space.
Target element 130 is configured to supply viewing position with display final data after being employed in all conversion Collection.These data sets then can be announced or used by application (such as analysis is applied).A series of result of operation or operations can To be dragged and dropped to from working space in target viewing area.
Working space component 110 is configured as enabling operation including one or more map functions and including wherein first The output of operation alternatively provides the visual wound of the streamline of the set of one or more related operations of input to the second operation Make.Especially, working space component 110 is configured as by way of the figure on working space promoting operation and streamline structure Make.For example, user can be by the way that the visual representation of data source to be dragged and dropped into the working space pane or panel of user interface from source To obtain data source.For example by drawing arrow to operation from data source with indicate data source provide the input that uses of operation and One or more data map functions (for example, sequence, packet, axle turn, segmentation, filtering ...), the data source are performed thereon It may be connected to the operation (for example, being automatically created using data preview and/or hand-coding) previously created.In addition, The expression of transformed output can be linked to the expression of the operation on working space.As a result, defeated from data sources The figure for entering and exporting the operation of the source of new data of the application of one or more map functions of reflection operation is shown.
Pattern matching components 140 are configured as the matching and not of the schema elements between mark data source and operation Match somebody with somebody.Matching and mismatch the confidence metric that is based on the function of one or more factors to calculate and with one or many Individual predetermined threshold is automatically determined compared to relatively.In an example, it can be obtained for data set and the schema elements of operation The factor of such as element type and element term.Based on this, the intensity of " matching " is represented or more generally between schema elements The confidence measure of corresponding relation intensity can be calculated.Then confidence metric can be with indicating to mismatch, matching or both it Between one or more threshold values of certain situation be compared.Pass through the comparison with one or more threshold values, the pattern of data source Element can be classified on the schema elements associated with operation.If for example, the highest confidence between two schema elements Degree measurement is less than percent 50 (50%), then mismatching to be identified.By contrast, if highest confidence metric is higher than Percent 50 (50%), then matching can be identified.Certainly, the 3rd selection can be it is uncertain with the presence or absence of mismatch or Match somebody with somebody.For example, higher than the confidence metric of percent 70 (70%), then matching may be considered that presence, less than 50 percent (50%) then mismatch and may be considered that presence, and can be classified between 50 percent and 70 percent (50-70%) To be uncertain etc..
Fig. 2 depicts representative pattern matching components 140.Pattern matching components include pattern acquiring component 210, matching mark Know component 220, configuration component 230 and visualization component 240.Pattern acquiring component 210 from data source and operation obtaining mode, its Middle work pattern corresponds to the desired pattern of operation.Dataset schema and work pattern can be received or examined from source or operation Rope, or determined alternatively by analyze data source or operation.Matching identification component 220 is configured as one group of schema elements It is categorized as matching or mismatches.
Short attention span is gone into Fig. 3, representative matching identification component 220 is shown.Matching identification component 220 includes Type component 310, title component 320, learning object 330 and confidence component 340.Type component 310 is configured to determine that mould Type matching and mismatch between formula element.More specifically, type component 310, which is configured as mark, includes fundamental type (example Such as, integer, real number, boolean ...) and compound type (for example, array, set, record, object ...) schema elements data Type, and determine the matching of which schema elements based on type and mismatch.Title component 320 is configured as based on pattern member Plain title determines pattern match and mismatch.If schema elements include same or analogous title (for example, in synonym Listed in dictionary), then schema elements can be considered to be name-matches.If on the contrary, schema elements include different titles, Schema elements are considered title mismatch.Learning object 330 is configured as based on the friendship from one or more users Mutually learn to match and mismatch.If for example, user previously indicate the first element matched with second element, then ought be again When running into the first element and second element, learning element 330 could be used to indicate that matching.Confidence component 340 is configured as Generate the confidence metric of the corresponding relation intensity between acquisition mode element.In an example, intermediate scheme can be produced Percentage match or unmatched value between element.For example, first element and second element are with 70 percent (70%) confidence match.In addition, confidence component 340 can be obtained and using from type in generation confidence metric The input of component 310, title component 320 and learning object 330.If for example, two schema elements have identical data class Type and title, the then high likelihood that there is Match of elemental composition.By contrast, if one or more of data type and title no It is similar, then there is the property of may be less likely to and unmatched high probability of matching.
Fig. 2 is returned to, component 230 is corrected and is configured as promoting unmatched solution.If existed not between schema elements Matching, then correcting component 230 and providing allows user to correct these unmatched mechanism so that the whole pattern of data set and operation Desired pattern match.According to embodiment, mismatch can be presented, and user can be by first by the pattern of data source Element graphically corrects mismatch with the schema elements maps mutually of the operation matched.This can be by being positioned relative to each other Completed with schema elements (for example, in mutually colleague of table) or by drawing the line of matching connection element.It is alternatively possible to logical The mode of code specification is crossed to perform this mapping.
Visualization component 240 is configured as presenting mismatches inspection on the pattern in the context of visual authoring system 100 The visualization surveyed and solved.In an example, visualization component 240 can with render mode Match of elemental composition and it is unmatched can Depending on expression.Approached and the potential data set matched of work pattern element for example, visualization component 240 can be generated and presented Schema elements.In addition, graph-based can indicate that two schema elements are matching or mismatch.For example, green point and/ Or tick boxes can serve to indicate that the high probability of matching, and red point and/or tick boxes can be used to indicate that unmatched height Probability.Of course, it is possible to which additional figure is presented come at least one middle ground between representing to match and mismatching, wherein confidence Measurement is between matching or unmatched confidence metric, and additional figure is for example represented by the point and/or question mark of yellow.Another In one example, visualization component 240 can provide interactive visual mechanism to correct mismatch.As an example, schema elements can To be chosen, drag and be placed into the new position corresponding with the schema elements matched.It is alternatively possible to draw line The schema elements of matching connection.In addition, when being received from user for solving unmatched input, can be with generation patterns element Visual display to reflect made change, and the instruction of matching can be shown.Therefore, user can iteratively solution be never Match somebody with somebody, until in the absence of mismatch.
Fig. 4-6 is to show the various visualization sides associated with the visual authoring system 100 including pattern matching components 140 The exemplary screen shots in face.These screenshot captures are intended to aid in being aware and understood in terms of the disclosure, and not purport Limiting theme claimed.It should be appreciated that the screenshot capture provided depict only a realization.Graphic element and Various other combinations of text and arrangement are conceived to and are intended to fall under in scope of the following claims.Moreover, it will be appreciated that Using various sound user can also be aided in the unmatched mark of creation streamline including pattern and in solving.
Fig. 4 is can be by visually creating the screenshot capture at the visual creation interface 400 that interface 100 is produced.As illustrated, should Interface includes three panels, source panel 410, working space panel 420 and announcement panel 430.Source panel 410 presents multiple available Data source 412, and source is added or is removed from it.It should be appreciated that the data source described in source panel 410 412 can be arbitrary source.For example, some data sources 412 can be associated with local data, and other data sources and network Or cloud data repository is associated.In addition, data source 412 can have substantially any structure or form.Working space panel 420 provide data source and the interactive graphical diagram of operation.As illustrated, the operation for such as providing purchase suggestion is represented as standing Cube 422.Cube 422 is connected to the data set for being represented as the first cylinder 424.According to a realization, data set table Showing can be dragged and dropped from source panel 410.Line with arrow the first cylinder 424 is connected to indicate from source to operation from a left side To the cube of right data flow.In addition, the output of operation is represented as the second cylinder 426, and with from cube 422 to Line and the arrow connection of second cylinder 426, depict the output that the second cylinder represents operation.Performing all desired changes After alternatively, announce pane 430 provide come forth or workable data source visual representation.In the first circle by data set is represented When cylinder is connected to the cube 422 for representing the operation in working space panel 420, such as by drawing the first cylinder 424 are connected to the line 428 of cube 422, and pattern matching process can be initiated and Fig. 5 screenshot capture can be produced.
Fig. 5 is the screenshot capture at the visual creation interface 500 that can be produced by visual authoring system 100.Visual creation circle The similarity in face 500 and visual creation interface 400 is that it includes source panel 410 as previously described and working space face Plate 420.In addition, data source is connected to after operation in user, context of the pattern match panel 510 in working space 420 In or original place be presented.Data source can be triggered on data source schema and by the desired pattern of operation to the connection of operation Matching and unmatched determination, and the generation of pattern match panel 510 is at least to show result.Pattern match panel 510 Including table 520 and button 530 to receive mapping.Table 520 includes three row.First row corresponds to data source schema element.3rd row Corresponding to work pattern element, and secondary series captures correspondence relationship information.Using dataset schema element, work pattern element and Schema elements are the visual indicators 522 of mismatch or matching to fill row.
Here, first row corresponds to the schema elements of " population in the world statistics " data set, and the 3rd row are corresponding to being directed to The expected and acceptable pattern of " purchase is recommended in game " operation.Include schema elements title and data type per a line, Together with the finger that there is mismatch (being represented by the letter " x " surrounded by line) or matching (being represented by the tick boxes surrounded by line) Show.In the row shown by five, two rows show the mismatch between schema elements.Especially, " integration " is illustrated as and " game Score " is mismatched, and " score " is illustrated as mismatching with " Xbox integrations ", all these to have style number.In such case Under, " integration " is matched with " Xbox integrations ", and " score " is matched with " game points ".In order to correct the mismatch, Yong Huke To select " integration " element as shown in 540, drag and drop include the element in the row of " Xbox integrations " element.The result quilt of the action There is provided in figure 6.
Fig. 6 is the screenshot capture at the visual creation interface 600 that can be produced by visual authoring system 100.With Fig. 5 interface 500 is similar, and interface 600 includes source panel 410, working space panel 420 and pattern match panel 510.Here, pattern match face Plate by the way that " integration " element is dragged and dropped into the position occupied by " score " element in the row including " Xbox integrations " element and Produce.Note, placement of " integration " element in as the point occupied by " score " element causes " score " element to replace including " trip " Xbox integrations " element in the row of play score ".Present table 520, which is updated to reflect, causes all the new of schema elements matching to be reflected Penetrate.Therefore, using a simple posture (i.e. drag and drop), mismatch is solved.Then, user can be connect with select button 530 Mapped, and enabled by being successfully processed that the operation is carried out by this.
Aforementioned system, framework, environment etc. are described on the interaction between several components.It should be appreciated that such System and component can include some components in those components or sub-component, specified component or sub-component for wherein specifying Or sub-component, and/or add-on assemble.Sub-component can also be implemented as being communicably coupled to other assemblies without being included in Component in parent component.In addition, one or more assemblies and/or sub-component can be combined into single component to provide polymerization work( Energy.Communication between system, component and/or sub-component can be according to pushing away and/or draw model is realized.Component can also with order to Do not specifically describe herein for purpose of brevity and but one or more of the other component interaction well known by persons skilled in the art.
In addition, the various pieces of system disclosed above and following method can be included or using artificial intelligence, machine Component, sub-component, process, means, method or the mechanism of device study or knowledge based or rule are (for example, SVMs, god Through network, expert system, bayesian belief network, fuzzy logic, data fusion engines, grader ...).Such component is outstanding It can make some mechanism thus performed or cross process automation, with cause a part for system and method have more adaptability and It is efficiently and intelligent.It is unrestricted as example, learning object 330 can using such mechanism come based on previous interaction and Other contextual informations are mismatched or matched to determine or infer.
In view of above-mentioned example sexual system, may be referred to Fig. 7-9 flow chart, more fully understand can according to disclosed theme In the method for realization.While for purposes of simplicity of explanation, method is shown and described as a series of frame, it is to be appreciated that and , it is realized that theme claimed is not limited by the order of the blocks because some frames can occur in a different order and/or Occur simultaneously with other frames depicted and described herein.Furthermore, it is possible to not need all frames shown to be described below to realize Method.
With reference to Fig. 7, detection and the unmatched method 700 of solution pattern are shown.At reference marker 710, receive and indicate Signal of the data set (for example, separate data source, output ... of operation) to the connection of operation.Authoring environment is illustrated in interactive mode In, when the expression of data set is connected to the expression of operation by user (such as by draw connection both line), signal can be with It is generated and is then received.
After signal is received, schema elements matching and mismatch can be determined.More specifically, by data set mould The element of formula and the element of work pattern are compared, and work pattern description is for the desired mould of the data inputted to operation Formula.It can be compared between the feature (for example, title, type ...) of element, it is optimal between element to attempt to find " matching " or corresponding relation.The element of confidence measure (intensity of the corresponding relation between expressive element) with higher than threshold value can To be classified as matching.The element of confidence metric with less than threshold value can be classified as mismatch.In other words, pattern Matching between element may be performed that the intensity based on corresponding relation come the best match between markers element, and If the intensity of subsequent corresponding relation is less than the predetermined threshold for matching, matching can be re-classified as mismatching.
At mark 730, mismatch and optional matching is shown., can be in working space according to one embodiment The display of in context or original place is mismatched so that user need not be by context or focus from a windows exchange to another window Mouthful to check that flowing water line chart and pattern mismatch both.According to specific embodiment, can delivery mode information in a tabular form, its Middle first row includes the element of dataset schema, and the 3rd row are included between the element of work pattern, and first row and the 3rd row Secondary series indicator element whether match.Indicate to match based on various factors in addition, confidence metric can be shown with Intensity instruction.
At reference marker 740, receive and change input on unmatched.In other words, receive with carrying out at least one Change and mismatch associated signal to remedy.In one embodiment, it can be presented in interactive graphics user interface Pattern is mismatched so that can be changed to solve to mismatch.For example, user can be by schema elements from first position drag and drop To the second place corresponding with matching.It is alternatively possible to draw line with the schema elements of matching connection.Regardless of realizing, User can use simple posture to carry out execution pattern matching, such as in the case where Auto-matching is not carried out.This simplifies The process of pattern match, because to eliminate extensive work for the automatic mode of user.If unmatched problem is present, User can be showed to be solved mismatch.
The method that Fig. 8 depicts pattern match 800.At reference marker 810, receive, the pattern associated with data set Received, retrieve or otherwise obtain or obtain.If it is available, then dataset schema can be acquired from source.Alternatively Ground, can be automatically determined based on the analysis of data set or infer the pattern.At mark 820, for the work pattern of input Received, retrieve or otherwise obtain or obtain.Work pattern capture is for the desired mould of the data inputted to operation Formula.The pattern can be acquired from operation or be automatically determined or infer from operation.The mark data integrated mode at 830 With the coupling element of work pattern.The shape or structure of pattern definition data, and schema elements are the parts of pattern, it can be with Including title and type etc..Coupling element is included based on various schema elements feature or characteristic (such as element term) come at two Same or analogous element is identified between pattern.Similitude can be measured in a variety of ways, and similarity can be with Change.It therefore, it can set up threshold value, it defines when it is matching (such as when similarity is more than predetermined value).In reference marker At 840, mismatch element and be identified.Mismatch the member that element is the threshold value of the similitude or confidence level that do not meet matching presence Element.Therefore, the matching for not meeting predetermined threshold is identified as to mismatch.
Fig. 9 is the flow chart for the method 900 classified to pattern match.At reference marker 910, type is performed Match somebody with somebody.Here, including same data type, the element for example from dataset schema and work pattern is determined.With this side Formula, can perform the citation form of matching based on data types such as character string, numeral, date-times.By Fig. 4 and figure In example shown in 5 screenshot capture, " integration " may be matched with " game points ", because they all have style number.
At mark 920, name-matches are performed.In this case, the element from two patterns is analyzed, to identify tool There are the schema elements of same or similar title.Whether synonymicon can be created and for understanding two elements because of them It is related with identical title or title synonym.In the screenshot capture on Fig. 4 and Fig. 5, " Xbox integrations " is " product Point " synonym, and " game points " are the synonyms of " score ".Using the name-matches in addition to type matching with only adopting With only a kind of matching in type matching or name-matches compared to the more fine-grained matching identification of offer.
At reference marker 930, using machine learning come auxiliary matched.Machine learning can be learned based on previous interaction Practise the relation between element.If for example, user previously with system interaction or is otherwise indicated that the schema elements of data set Or field " score " is mapped to the schema elements or field " game points " of operation, then this can be recorded and then when identical User or different user are sought to be utilized when data source is connected into operation.In addition, learning object can summarize the fact so that " score " and " game points " is interpreted synonym, or even further to include any schema elements of " score " Title is considered as synonym.
At reference marker 940, based on predetermined confidence threshold value by element classification is matching or mismatches.More specifically, It can determine to be assigned to the plain confidence metric of every constituent element based on one or more in type, title or machine learning. For example, have a case that the elements of same type and same names with element only share one of type or title compared to will with Higher levels of confidence level with presence.It is less than title furthermore, it is possible to be assigned to the element of title that is shared similar but differing The confidence metric of confidence metric in the case of identical.Similarly, can to shared related or equivalent data type and It is not that the element of same data type assigns the confidence metric for being less than the confidence metric for being assigned to same type.Generally, Confidence metric or score can be assigned for each " matching " element set.Then, it can be incited somebody to action based on predetermined confidence threshold value Element set is categorized as matching or mismatched.For example, the element of the confidence metric with percent 70 (70%) or bigger Matching can be classified as to (meaning that system is that percent 70 (70%) is determined or more determines that matching is present), and had Confidence metric less than 70 percent to that will be classified as mismatch.In other words, the intensity of the matching between element Determine that it is considered as matching or mismatched.Certainly, other classification are possible, and threshold value can change.As an example, Element indicates that can be assigned the color of confidence level measurement, including wherein confidence metric is more than percent 80 (80%) Green, yellow of its vacuum metrics between percent 80 (80%) and percent 50 (50%), and its vacuum metrics are less than The red of percent 50 (50%).
Operation is connected to perform conversion on data set with data set.However, the pattern of data set and the desired frame of operation Generally had differences between structure.These differences can not be solved, Job execution may fail or the wrong result of generation.This can be with Solved by calculating the measurement of the corresponding relation intensity between data set and the schema elements of operation.Then, corresponding relation Intensity can be compared with one or more predetermined thresholds, matched and mismatched with automatic mark.Then can be to user Notification mode is mismatched.Further it is provided that a kind of mechanism, enables a user to solve the mismatch between schema elements.Example Such as, user can be interacted by the expression of the schema elements with being presented using graphic user interface come correctly mapped mode member Element.As a result, never match pattern comes into force for match pattern state change.
Theme, which is disclosed, to be supported to perform or be configured as the various products for performing the various actions for mismatching detection on pattern And process.Followed by one or more illustrative methods and system.
A kind of method is included in the expression of the data set presented on the display in the Part I at interface on working space, The working space is configured as support and streamline is created using chart, and wherein output of the streamline including wherein the first operation can Selection of land provides the set of one or more related operations of input to the second operation;In response to the table of the data set on working space Show that the connection of the expression of operation mismatches the execution detected to initiate pattern;And the display in the Part II at interface The upper one or more patterns presented between data set and operation are unmatched to be represented.This method is also included by by data set The measurement of corresponding relation intensity between schema elements and the schema elements of operation is compared to execution pattern with predetermined threshold Mismatch detection.This method is additionally included on display is presented the one or more of dataset schema in the Part III at interface The expression of one or more elements of element and work pattern.This method is additionally included in the display in the Part III of interface The first element in upper one or more elements that dataset schema is presented and one or more elements from work pattern The visual instruction of corresponding relation intensity between second element.This method is also included based on corresponding relation intensity and predetermined threshold Compare unmatched visual instruction is presented.This method also includes being based on corresponding relation intensity and predetermined threshold or different predetermined thresholds The comparison of value is presented the visual instruction of matching.This method also includes receiving corrects one or many with the Part II on interface One of individual mismatch associated signal.This method also includes the signal for receiving the unmatched first mode element of selection, by the One schema elements are dragged from its home position, and first mode element is placed on to the target position occupied by second mode element In putting.This method also includes the home position that second mode element is automatically moved to first mode element.
A kind of method, which includes using, to be configured as performing at least the one of the computer executable instructions of storage in memory Individual processor performs following action:Detected input data set to the company of operation by means of graphic user interface working space Connect, graphic user interface working space is configured as supporting the diagram creation of streamline, streamline includes wherein the first operation Output alternatively to the second operation provide input one or more related operations set;By by data set and operation The measurement of corresponding relation intensity between schema elements is compared to determine the pattern member of data set and operation with predetermined threshold One or more mismatches between element;And one or more mismatches are presented in the context of working space.This method Also include based on data type or title relatively at least one determine the corresponding relation intensity between schema elements.The party Method also includes relatively identifying input data set and operation based on corresponding relation intensity and predetermined threshold or different predetermined thresholds Schema elements between one or more matchings.This method also includes visually distinguishing matching with mismatching.This method is also wrapped Include reception signal the schema elements of input data set are assigned to the different mode element of operation.Methods described also includes receiving Selection, drag operation and placement operation on the schema elements of the visual representation of one of one or more mismatches.
System includes being coupled to the processor of memory, and the processor is configured as performing storage in memory following Computer can perform component:First assembly, is configured as that the visual working space for being used for diagrammatically creating streamline, the stream is presented The set for one or more related operations that the output that waterline includes wherein the first operation is alternatively inputted to the second operation offer; The connection of the expression of operation is arrived in second component, the expression for the data set being configured to respond on working space, based on data set The mould of the measurement of corresponding relation intensity between the schema elements of operation and the comparison of predetermined threshold, mark data collection and operation One or more patterns between formula element are mismatched;And the 3rd component, it is configured as presenting between data set and operation One or more patterns are mismatched.3rd component is additionally configured to present in the context of working space and mismatched.Second group Part is additionally configured to data type and title at least based on schema elements to determine the schema elements of data set and the mould of operation Corresponding relation intensity between formula element.The system also includes being configured as enabling mismatching graphically again referring to for schema elements 4th component of group.4th component is additionally configured to support the drag and drop of the visual representation with mismatching schema elements to interact.
System includes being coupled to the processor of memory, and the processor is configured as performing storage in memory following Computer can perform component:First assembly, is configured as detecting input data by means of graphic user interface working space Collect the connection of operation, graphic user interface working space is configured as supporting the diagram creation of streamline, and streamline includes The output of wherein the first operation alternatively provides the set of one or more related operations of input to the second operation;Second group Part, is configured as by the way that the measurement of the corresponding relation intensity between data set and the schema elements of operation and predetermined threshold are carried out Compare to determine one or more mismatches between data set and the schema elements of operation;And the 3rd component, it is configured as One or more mismatches are presented in the context of working space.System also includes being configured as being based on data type or title At least one compared determines the component of the corresponding relation intensity between schema elements.The system also includes being configured as base In corresponding relation intensity the schema elements of input data set and operation are identified from the comparison of predetermined threshold or different predetermined thresholds Between one or more matchings component.The system also includes being configured as receiving signal so that the pattern of input data set is first Element is assigned to the component of the different mode element of operation.The system also includes being configured as receiving mismatching on one or more One of the selection of schema elements of visual representation, the component of drag operation and placement operation.
Word " exemplary " or its various forms are used to mean to serve as example, example or explanation herein.It is described herein Any aspect or design for " exemplary " are not necessarily to be construed as preferably or more favourable than other aspects or design.In addition, example It is provided merely for the purpose being aware and understood, and is not intended to be limiting in any manner or constrains theme claimed Or the relevant portion of the disclosure.It should be appreciated that the countless additionally or alternatively examples of various scopes may be presented, but go out It has been omitted in succinct purpose.
As it is used herein, term " component " and " system " and its various forms are (for example, component, system, subsystem System ...) it is intended to refer to computer related entity, hardware, the combination of hardware and software, software or executory software.Example Such as, component can be but not limited to run on a processor process, processor, object, example, executable file, perform Thread, program and/or computer.As explanation, both the application run on computers and computer can be components.One Individual or multiple components be may reside within the thread of process and/or execution, and component can be localized in a computer Go up and/or be distributed between two or more computers.
The connection "or" used in this specification and in the appended claims be intended to mean to show the "or" that includes and It is not exclusive "or", unless otherwise dictated from context or clearly.In other words, " X " or " Y " be intended to mean " X " and Any inclusive arrangement of " Y ".If for example, " ' A ' use ' X ' ", " ' A ' uses ' Y ' " or " ' A ' use ' X ' and ' Y ' two Person ", then in the case of any of above, " ' A ' uses ' X ' or ' Y ' " is satisfied.
In addition, term " comprising ", "comprising", " having ", " containing " or variant with its form be used in detailed description or In the sense that in claims, so that when being used as transition word in the claims, " comprising " is explained with term " comprising " Similar fashion, these terms are intended to inclusive.
In order to provide the context for theme claimed, Figure 10 and following discussion are aimed to provide and wherein may be used With brief, the general description of the proper environment of the various aspects of realizing theme.However, suitable environment is only example, and It is not intended to imply that to any limitation using scope or function.
Although the general context of the computer executable instructions for the program that can be run on one or more computers Described in disclosed systems above and method, it will be recognized to those skilled in the art that these aspect can also be with other journeys Sequence module etc. combines to be implemented.Generally, program module includes performing particular task and/or realizes particular abstract data type Routine, program, component, data structure etc..Further, it will be understood by those skilled in the art that can be under unified central planning using various departments of computer science Put to implement the systems and methods, including uniprocessor, multiprocessor or polycaryon processor computer system, miniature calculating are set Standby, mainframe computer and personal computer, handheld computing device are (for example, personal digital assistant (PDA), phone, hand Table ...), based on microprocessor or programmable consumer or industrial electrical equipment etc..Aspect can also be in Distributed Calculation ring Implement in border, wherein task by the remote processing devices of communication network links by being performed.However, theme claimed Some (if not all) aspects can be implemented on stand-alone computers.In a distributed computing environment, program module can Be located locally with the one or both in remote memory storage devices.
With reference to Figure 10, show exemplary general computer or computing device 1002 (for example, desktop computer, on knee Computer, flat board, wrist-watch, server, hand-held, programmable consumer or industrial electrical equipment, set top box, games system, calculating Node ...).Computer 1002 includes one or more processors 1020, memory 1030, system bus 1040, Large Copacity and deposited Store up equipment 1050 and one or more interface modules 1070.System bus 1040 is at least communicatively coupled said system composition portion Point.It will be appreciated, however, that in its simplest form, computer 1002 can include be coupled to memory 1030 one or Multiple processors 1020, its component for performing the executable action of various computers, instructing and/or being stored in memory 1030.
Processor 1020 can using general processor, digital signal processor (DSP), application specific integrated circuit (ASIC), Field programmable gate array (FPGA) or other PLDs, discrete gate or transistor logic, discrete hardware components or It is designed to perform any combinations of functionality described herein to realize.General processor can be microprocessor, but standby Selection of land, processor can be any processor, controller, microcontroller or state machine.Processor 1020 can also be implemented as Combination, multi-microprocessor, polycaryon processor and the conjunction DSP core hearty cord of the combination of computing device, such as DSP and microprocessor The one or more microprocessors of conjunction or any other such configuration.In one embodiment, processor can be at figure Manage device.
Computer 1002 can include or otherwise interact to promote computer with various computer-readable mediums The one or more aspects of theme claimed are realized in 1002 control.Computer-readable medium can be can be by counting Any usable medium that calculation machine 1002 is accessed, and including volatibility and non-volatile media, and it is detachable and non-dismountable Medium.Computer-readable medium can include two kinds of different and mutually exclusive types, i.e. computer-readable storage medium and communication Medium.
Computer-readable storage medium is included for such as computer-readable instruction, data structure, program module or other numbers According to information storage any method or technique realize volatibility and non-volatile, detachable and non-dismountable medium.Meter Calculation machine storage medium includes storage device, and such as memory devices are (for example, random access memory (RAM), read-only storage (ROM), Electrically Erasable Read Only Memory (EEPROM) ...), magnetic storage apparatus (for example, hard disk, floppy disk, cassette, Tape ...), CD (for example, compact disk (CD), digital versatile disc (DVD) ...) and solid condition apparatus are (for example, solid-state is hard Disk (SSD), flash drive (for example, card, rod, key drive ...) ...) or any other similar medium, it is deposited Storage is (with transmitting or communicating relative) by the addressable desired information of computer 1002.Therefore, computer-readable storage medium is excluded Modulated data-signal.
Communication media embodies computer-readable instruction, data structure, program module or such as carrier wave or other transmission mechanisms Modulated data-signal in other data, and including any information transmitting medium.Term " modulated data letter Number " refer to the signal that makes one or more of its feature be set or change in the way of encoding information onto in the signal. Unrestricted as example, communication media includes the wire medium of such as cable network or direct wired connection, and such as sound , RF, the wireless medium of infrared and other wireless mediums.
Memory 1030 and mass-memory unit 1050 are the examples of computer-readable recording medium.Set depending on calculating Standby exact configuration and type, memory 1030 can be volatibility (for example, RAM), non-volatile (for example, ROM, dodge Deposit ...) or both certain combination.As an example, including for all elements as during start-up in computer 1002 Between the basic input/output (BIOS) of basic routine of transmission information can be stored in nonvolatile memory, And volatile memory can serve as external cache, with processing of promoting processor 1020 etc..
Mass-memory unit 1050 includes being used for storing the detachable/non-disconnectable of mass data relative to memory 1030 Unload, volatile/nonvolatile computer storage media.For example, mass-memory unit 1050 include but is not limited to one or Multiple equipment, such as disk or CD drive, floppy disk, flash memory, solid-state drive or memory stick.
Memory 1030 and mass-memory unit 1050 can include operating system 1060, one or more applications 1062nd, one or more program modules 1064 and data 1066, or it is stored in wherein.Operating system 1060 act as control The resource of system and distribution computer 1002.Include the one or both in system and application software using 1062, and can lead to Cross program module 1064 and be stored in the data 1066 in memory 1030 and/or mass-memory unit 1050 to utilize operation System 1060 is to the management of resource to perform one or more actions.Therefore, can be according to the logic thus provided using 1062 All-purpose computer 1002 is converted into special purpose machinery.
The all or part of theme claimed can use standard program and/or engineering technology with produce software, Firmware, hardware or its any combinations are realized, disclosed function is realized with control computer.It is unrestricted as example, Visual authoring system 100 or part thereof can be the part using 1062, or composition is using 1062 part, and including One or more of memory and/or mass-memory unit 1050 module 1064 and data 1066 are stored in, its function is worked as It can be implemented when being performed by one or more processors 1020.
According to a specific embodiment, processor 1020 can correspond to on-chip system (SOC) or similar architecture, Include on single integrated circuit substrate or in other words integrated hardware and software.Here, processor 1020 can include one or Multiple processors and it is at least similar to memory of processor 1020 and memory 1030 etc..Conventional processors include minimum Hardware and software, and extensively depend on external hardware and software.By contrast, the SOC realizations of processor are more powerful, because Hardware and software is embedded by it, hardware and software enable with to outside hardware and software it is minimum rely on or independent of The specific function of external hardware and software.For example, visual authoring system 100 and/or associated function can be embedded in SOC In the hardware of framework.
Computer 1002 also includes being communicably coupled to system bus 1040 and promoted and interact the one of computer 1002 Individual or multiple interface modules 1070.As an example, interface module 1070 can be port (for example, serial, parallel, PCMCIA, USB, FireWire ...) or interface card (for example, sound, video ...) etc..In an example implementation, interface module 1070 User's input/output interface can be embodied as, is enabled a user to by one or more input equipments (for example, such as The pointing device of mouse, trace ball, contact pilotage, touch pad, keyboard, microphone, control stick, handle, satellite antenna, scanner, phase Machine, other computers ...), it will for example be ordered by means of one or more postures or phonetic entry and be input to computer with information In 1002.In another example implementation, interface module 1070 can be embodied as peripheral interface, output is fed to aobvious Show device (for example, LCD, LED, plasma ...), loudspeaker, printer and/or other computers.Further, interface group Part 1070 can be embodied as network interface, to enable such as by wired or wireless communication link and other computing devices The communication of (not shown).
Content already described above includes the example of the aspect of theme claimed.Certainly, wanted for description Seek the purpose of the theme of protection and can not possibly describe each of component or method it is contemplated that combination, but ordinary skill people Member will recognize that many further combinations and permutations of disclosed theme are possible.Therefore, disclosed theme purport Covering to fall all such changes, modifications and variations in the spirit and scope of the appended claims.

Claims (10)

1. a kind of method, including:
Use be configured as performing at least one processor of storage computer executable instructions in memory perform with Lower action:
Detected input data set to the connection of operation, Graphic User circle by means of graphic user interface working space Face working space is configured as supporting the diagram creation of streamline, and the streamline includes the output of wherein the first operation alternatively The set of one or more related operations of input is provided to the second operation;
By the way that the measurement of the corresponding relation intensity between the data set and the schema elements of the operation is entered with predetermined threshold Row relatively determines one or more mismatches between the data set and the schema elements of the operation;And
One or more of mismatches are presented in the context of the working space.
2. according to the method described in claim 1, in addition to based on data type or title relatively at least one of determine Corresponding relation intensity between the schema elements.
3. according to the method described in claim 1, in addition to based on the corresponding relation intensity and the predetermined threshold or difference The comparison of predetermined threshold identifies one or more matchings between the input data set and the schema elements of the operation.
4. method according to claim 3, in addition to matching is visually distinguished with mismatching.
5. according to the method described in claim 1, in addition to receive signal with by the schema elements of the input data set assign Different mode element to the operation.
6. method according to claim 5, in addition to receive the visual table on one of one or more of mismatches Selection, drag operation and the placement operation for the schema elements shown.
7. a kind of system, including:
The processor of memory is coupled to, the processor is configured as performing the following computer being stored in the memory Executable component:
First assembly, is configured as that the visual working space for being used for diagrammatically creating streamline is presented, the streamline includes it In the first operation output alternatively to the second operation provide input one or more related operations set;
Second component, is configured to respond to the expression of the data set on the working space to the company of the expression of the operation Connect, measurement and the comparison of predetermined threshold based on the corresponding relation intensity between data set and the schema elements of operation, identify institute The one or more patterns stated between data set and the schema elements of the operation are mismatched;And
3rd component, is configured as presenting one or more of patterns between the data set and the operation and mismatches.
8. system according to claim 7, the 3rd component is additionally configured in the context of the working space Described mismatch is presented.
9. system according to claim 7, second component is additionally configured at least data class based on schema elements Type and title determine the corresponding relation between the schema elements of the data set and the schema elements of the operation Intensity.
10. system according to claim 7, in addition to the 4th component, are configured as enabling the figure for mismatching schema elements Shape is reassigned.
CN201580062833.3A 2014-11-21 2015-11-18 Automatic mode mismatches detection Pending CN107077505A (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US14/550,856 2014-11-21
US14/550,856 US10684998B2 (en) 2014-11-21 2014-11-21 Automatic schema mismatch detection
PCT/US2015/061209 WO2016081531A1 (en) 2014-11-21 2015-11-18 Automatic schema mismatch detection

Publications (1)

Publication Number Publication Date
CN107077505A true CN107077505A (en) 2017-08-18

Family

ID=54754809

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201580062833.3A Pending CN107077505A (en) 2014-11-21 2015-11-18 Automatic mode mismatches detection

Country Status (7)

Country Link
US (1) US10684998B2 (en)
EP (1) EP3221786A1 (en)
JP (1) JP2018507450A (en)
CN (1) CN107077505A (en)
BR (1) BR112017008453A2 (en)
RU (1) RU2017117425A (en)
WO (1) WO2016081531A1 (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108713205B (en) * 2016-08-22 2022-11-11 甲骨文国际公司 System and method for automatically mapping data types for use with a data stream environment
US11093516B2 (en) 2016-09-20 2021-08-17 Microsoft Technology Licensing, Llc Systems and methods for data type identification and adjustment
US10681164B2 (en) * 2018-05-03 2020-06-09 Microsoft Technology Licensing, Llc Input and output schema mappings
US10860602B2 (en) 2018-06-29 2020-12-08 Lucid Software, Inc. Autolayout of visualizations based on contract maps
US10860603B2 (en) 2018-06-29 2020-12-08 Lucid Software, Inc. Visualization customization
US11232139B2 (en) * 2018-06-29 2022-01-25 Lucid Software, Inc. Custom interactions with visualizations
SG11202108731TA (en) 2019-02-22 2021-09-29 Lucid Software Inc Reversible data transforms
US11100173B2 (en) 2019-06-18 2021-08-24 Lucid Software, Inc. Autolayout of visualizations based on graph data
US11169671B2 (en) 2019-11-26 2021-11-09 Lucid Software, Inc. Alteration of a source data visualization based on user input
US11263105B2 (en) 2019-11-26 2022-03-01 Lucid Software, Inc. Visualization tool for components within a cloud infrastructure
US11080484B1 (en) * 2020-10-08 2021-08-03 Omniscient Neurotechnology Pty Limited Natural language processing of electronic records
US11567735B1 (en) * 2020-10-19 2023-01-31 Splunk Inc. Systems and methods for integration of multiple programming languages within a pipelined search query
US20220121640A1 (en) * 2020-10-21 2022-04-21 Western Digital Technologies, Inc. Emulation of relational data table relationships using a schema
US11625367B1 (en) * 2022-06-08 2023-04-11 Snowflake Inc. Schema evolution

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080021912A1 (en) * 2006-07-24 2008-01-24 The Mitre Corporation Tools and methods for semi-automatic schema matching
CN102279737A (en) * 2010-06-02 2011-12-14 埃森哲环球服务有限公司 System and method for analytic process design
CN102722542A (en) * 2012-05-23 2012-10-10 无锡成电科大科技发展有限公司 Resource description framework (RDF) graph pattern matching method
CN102792298A (en) * 2010-01-13 2012-11-21 起元技术有限责任公司 Matching metadata sources using rules for characterizing matches

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6483524B1 (en) 1999-10-01 2002-11-19 Global Graphics Software Limited Prepress workflow method using raster image processor
US6996268B2 (en) * 2001-12-28 2006-02-07 International Business Machines Corporation System and method for gathering, indexing, and supplying publicly available data charts
US7343377B1 (en) * 2003-07-07 2008-03-11 Unisys Corporation Method and system for verifying the integrity of a database
US20050262190A1 (en) 2003-08-27 2005-11-24 Ascential Software Corporation Client side interface for real time data integration jobs
US7403956B2 (en) * 2003-08-29 2008-07-22 Microsoft Corporation Relational schema format
US7415517B1 (en) * 2004-02-11 2008-08-19 Versata Development Group, Inc. Developing session context from nonlinear web site flow records
US7707218B2 (en) * 2004-04-16 2010-04-27 Mobot, Inc. Mobile query system and method based on visual cues
US20050257193A1 (en) * 2004-05-13 2005-11-17 Alexander Falk Method and system for visual data mapping and code generation to support data integration
US20060218158A1 (en) * 2005-03-23 2006-09-28 Gunther Stuhec Translation of information between schemas
US20140236722A1 (en) * 2005-04-08 2014-08-21 Marshall Feature Recognition Llc System And Method For Accessing Electronic Data Via An Image Search Engine
US7739292B2 (en) 2005-09-28 2010-06-15 Altova Gmbh System and method for modeling and managing enterprise architecture data and content models and their relationships
US8234312B2 (en) 2006-02-28 2012-07-31 Sap Ag Schema mapping and data transformation on the basis of layout and content
US20080126987A1 (en) 2006-09-19 2008-05-29 International Business Machines Corporation Graphical representation of compatible workflow steps
US8041746B2 (en) * 2007-10-30 2011-10-18 Sap Ag Mapping schemas using a naming rule
US20100235725A1 (en) 2009-03-10 2010-09-16 Microsoft Corporation Selective display of elements of a schema set
US8386493B2 (en) 2010-09-23 2013-02-26 Infosys Technologies Limited System and method for schema matching
US9406037B1 (en) * 2011-10-20 2016-08-02 BioHeatMap, Inc. Interactive literature analysis and reporting
US8583626B2 (en) 2012-03-08 2013-11-12 International Business Machines Corporation Method to detect reference data tables in ETL processes
GB2505938A (en) 2012-09-17 2014-03-19 Ibm ETL debugging

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080021912A1 (en) * 2006-07-24 2008-01-24 The Mitre Corporation Tools and methods for semi-automatic schema matching
CN102792298A (en) * 2010-01-13 2012-11-21 起元技术有限责任公司 Matching metadata sources using rules for characterizing matches
CN102279737A (en) * 2010-06-02 2011-12-14 埃森哲环球服务有限公司 System and method for analytic process design
CN102722542A (en) * 2012-05-23 2012-10-10 无锡成电科大科技发展有限公司 Resource description framework (RDF) graph pattern matching method

Also Published As

Publication number Publication date
WO2016081531A1 (en) 2016-05-26
US20160147796A1 (en) 2016-05-26
RU2017117425A (en) 2018-11-19
JP2018507450A (en) 2018-03-15
US10684998B2 (en) 2020-06-16
BR112017008453A2 (en) 2017-12-26
EP3221786A1 (en) 2017-09-27

Similar Documents

Publication Publication Date Title
CN107077505A (en) Automatic mode mismatches detection
US10719301B1 (en) Development environment for machine learning media models
US11182401B1 (en) Digital processing systems and methods for multi-board mirroring with automatic selection in collaborative work systems
Freitag et al. Strategies employed by citizen science programs to increase the credibility of their data
US11537506B1 (en) System for visually diagnosing machine learning models
US9886669B2 (en) Interactive visualization of machine-learning performance
US10417492B2 (en) Conversion of static images into interactive maps
BR102018009859A2 (en) METHOD AND SYSTEM FOR DATA-BASED OPTIMIZATION OF PERFORMANCE INDICATORS IN MANUFACTURING AND PROCESS INDUSTRIES
CN106204522A (en) The combined depth of single image is estimated and semantic tagger
US11093702B2 (en) Checking and/or completion for data grids
US9122995B2 (en) Classification of stream-based data using machine learning
US20180300333A1 (en) Feature subset selection and ranking
CN103810493A (en) Method and apparatus for identifying mathematical formula
US10853732B2 (en) Constructing new formulas through auto replacing functions
US20090204703A1 (en) Automated document classifier tuning
US20210279618A1 (en) System and method for building and using learning machines to understand and explain learning machines
US20210281469A1 (en) System for decomposing events that includes user interface
JP2015011641A (en) Apparatus and method of creating image processing filter
CN103294805A (en) Creation method and device for data warehouse personalized dimension table
US20240078473A1 (en) Systems and methods for end-to-end machine learning with automated machine learning explainable artificial intelligence
CN110362767A (en) Bury a processing method, device, system and computer readable storage medium
US11960794B2 (en) Seamless three-dimensional design collaboration
US20230297831A1 (en) Systems and methods for improving training of machine learning systems
CN117730347A (en) Automatic generation of one or more machine vision jobs based on a region of interest (ROI) of a digital image
CN111754486B (en) Image processing method, device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination