WO2002005209A9 - Method and apparatus for visualizing complex data sets - Google Patents

Method and apparatus for visualizing complex data sets

Info

Publication number
WO2002005209A9
WO2002005209A9 PCT/US2001/021807 US0121807W WO0205209A9 WO 2002005209 A9 WO2002005209 A9 WO 2002005209A9 US 0121807 W US0121807 W US 0121807W WO 0205209 A9 WO0205209 A9 WO 0205209A9
Authority
WO
WIPO (PCT)
Prior art keywords
virtual
spot
array
data
well
Prior art date
Application number
PCT/US2001/021807
Other languages
French (fr)
Other versions
WO2002005209A2 (en
WO2002005209A3 (en
Inventor
Seth Taylor
Eric J Stuckey
Original Assignee
Molecularware Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Molecularware Inc filed Critical Molecularware Inc
Priority to AU2001273343A priority Critical patent/AU2001273343A1/en
Publication of WO2002005209A2 publication Critical patent/WO2002005209A2/en
Publication of WO2002005209A3 publication Critical patent/WO2002005209A3/en
Publication of WO2002005209A9 publication Critical patent/WO2002005209A9/en

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • G16B25/30Microarray design
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/20Drawing from basic elements, e.g. lines or circles
    • G06T11/206Drawing of charts or graphs
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B45/00ICT specially adapted for bioinformatics-related data visualisation, e.g. displaying of maps or networks

Definitions

  • the invention relates to the field of data management and, more specifically, to the field of visualizing complex, voluminous data sets.
  • HTS high-throughput screening
  • microassay samples have volumes in a range of picoliters to milliliters.
  • experiments done in microassays require liquid-handling steps to mix reagents in strict quantities and specified orders.
  • these HTS liquid-handling robots are employed to automate the preparation of thousands of microassay samples, often using microarrays, microplates, or DNA Chips. Data is often collected in the form of images for the thousands of samples, further increasing the difficult of managing experiments.
  • the invention relates to a virtual microplate corresponding to a plurality of physical microplate.
  • the virtual microplate includes a first virtual well and a second virtual well.
  • the first virtual well is associated with a first physical well of a first physical microplate.
  • the second virtual well is associated with a second physical well of a second physical microplate.
  • the physical microplates to which the virtual wells correspond may be geographically remote from one another.
  • each virtual well has a number of attributes associated with it that indicates information corresponding to the physical well to which the virtual well corresponds.
  • the present invention relates to a method for visualizing data.
  • a first data array and a second data array are provided, each of the data arrays have a plurality of locations which includes a plurality of attributes associated with each location.
  • a virtual data array is generated which includes a plurality of locations, each location associated with a location from the first or second data and each location including a plurality of attributes associated with the location in the first or second data and each location including a plurality of attributes associated with the location in the first or second data array.
  • Fig. 1 is a highly diagrammatic block diagram of a research environment in which the present invention may be used.
  • Fig. 2 is a block diagram depicting one embodiment of a virtual microarray.
  • each individual sample in a microplate is referred to as a "well” and each microplate may have 96, 384, or 1536 wells.
  • Individual areas on a microarray 106 which receive biological sample material are referred to as "spots.” The number of spots on each microarray 106 is limited solely by the size of the physical media comprising the microarray 106 and the proximity with which biological samples may be placed to one another by the microarray robot 102.
  • software implementing the present invention may run directly on a microarray robot 102.
  • software implementing the present invention may run on a computing node (not shown) that is in communication with the microarray robot 102.
  • the computing node may be any personal computer (e.g., 286, 386, 486, Pentium, Pentium II, Macintosh computer), Windows-based terminal, Network Computer, wireless device, information appliance, RISC Power PC, X-device, workstation, mini computer, main frame computer or other computing device.
  • Windows-oriented platforms supported by the computing node can include Windows 3.x, Windows 95, Windows 98, Windows NT 3.51, Windows NT 4.0, Windows CE, Macintosh, Java, Unix and Windows 2000.
  • the computing node can include a display screen, a keyboard, memory for storing downloaded application programs, a processor, and a mouse.
  • the memory can provide persistent or volatile storage.
  • the computing node may be provided as a personal digital assistant (PDA), such as the Palm series of PDAs, manufactured by Palm, Inc. of Santa Clara, California.
  • PDA personal digital assistant
  • the computing node may communicate with the microarray robot 102 using infrared links.
  • the computing personal computer node is a personal computer including a PENTIUM III processor (or better) using MICROSOFT WINDOWS 4.0 or MICROSOFT WINDOWS 2000 (or better).
  • the computing node includes, or has access to, a database for storing information relating to individual spots and wells.
  • the database may be a relational database such as the Oracle 9i database, manufactured by Oracle Corporation of Breadwood Shores, California.
  • the database may be a flat filed database such as the JET database manufactured by Microsoft Corporation of Redmond, Washington.
  • the computing node may store information relating to wells and spots in a flat file or, delimited, -delimited format.
  • the database may be any OTBC-compliant database.
  • Fig. 2 depicts an embodiment of a virtual microarray 206. Each spot on virtual microarray 206 corresponds to a physical spot on a physical microarray 106.
  • the virtual microplates may also be constructed from one or more wells from a physical microplate 104.
  • the assignment of physical spots to virtual spots in virtual microarray 206 may be selected by the user, either by specifying a search criteria that operates on data representing the physical spots, or by manual selection.
  • the search results may form virtual microarray 206, which allows a user to combine physical spots from multiple physical microarrays 106 that share a common data characteristic, regardless of the geographic or temporal location of the physical spot.
  • virtual microarray 206 does not need to have the same geometric characteristics as the physical microarrays 106 from which it is constructed. Also as shown in Fig.
  • a base class representing a physical microarray 106 or microplate 104 provides: (1) an internal variable indicating whether the object represents a microplate 104 or a microarray 106; (2) one or more attributes of the physical microplate or microarray, which may be defined by the researcher; (3) information about the geometric layout of the physical microplate or microarray, such as the spacing between wells/spots, and whether wells/spots are grouped into sets;
  • geometrical information about each spot of each image associated with the rack such as detection status information, location, shape or contour, extents, compactness and confidence
  • a companion class may be created which represents virtual versions of physical microarrays or microplates.
  • the class representing the virtual microarrays or microplates is an extension of the base class described above.
  • the class representing the virtual microarrays or microplates is a separate class.
  • a class representing a virtual microarray 106 or microplate 104 provides: (1) an internal variable indicating whether the object represents a virtual microarray or a virtual microplate; (2) information about the geometric layout of the virtual microplate or microarray; (3) well/spot attributes for each virtual well/virtual spot, which includes information about the physical well/physical spot and physical microplate/physical microarray it represents; (4) the search criteria used to identify each well/spot.
  • a class representing a virtual microplate or microarray should support the following actions: creation; display of the virtual microplate or microarray; display of virtual spots or virtual wells; selection of virtual spots or virtual wells; search refinements; deletion of virtual wells and virtual spots; deletion of virtual microplates or microarrays; and saving and loading virtual microplates and microarrays to and from a database.
  • a virtual microplate or microarray should be created as the result of a selection of wells or spots either manually or by search process.
  • the dimensions of a virtual microplate or microarray i.e., rows and columns) should be computed (or recomputed) as necessary.
  • a virtual microplate or microarray should be displayed to the user in a fashion that is analogous to the display of a physical microarray or microplate or robot deck.
  • Individual spots and wells should be drawn to reflect a user-indicated properties through shading or color coding. For example, a researcher may have searched for all spots having an intensity above a minimum value and below a maximum value. Spots matching that search criteria can be displayed with a color map reflecting the actual intensity value.
  • Nirtual wells and virtual spots may be selected with standard mouse clicking and dragging. Attributes of those virtual wells or virtual spots should be displayed when a particular virtual well or virtual spot is selected.
  • virtual wells or virtual spots when virtual wells or virtual spots are selected they are highlighted and a graphical representation of the physical well or physical spot to which they correspond should also be displayed.
  • the researcher may use all or a subset of the virtual wells or virtual spots to refine a search and generate additional virtual microplates or microarrays as the result of the searches. For each search iteration, the search criteria and search results should be saved.
  • a researcher should be allowed to delete a virtual well or virtual spot by selecting a virtual well or virtual spot and indicating to the computing device that the virtual well or virtual spot should be deleted. When this occurs, the virtual well or virtual spot is removed from the virtual microplate or virtual microarray.
  • Nirtual microplates and microarrays should be able to be stored.
  • virtual microplate and microarrays are stored in a database.
  • a virtual array may have 1, 10, 20, 40, 80, 160, 250, 500, 1000, 10000, 100000, 1000000, 10000000 or more spots or wells.
  • a virtual array or plate contains a CArray (Microsoft Foundation Class construct) of spots or wells. C Arrays are indexed. by integers so in the 32-bit world the current maximum number of virtual spots or wells that we could have associated with a single virtual array or plate is I ⁇ TJViAX or 2147483647. In a 64-bit world, L ⁇ T_MAX for a virtual array is 4611686014132420609.
  • a virtual array may have 1, 10, 20, 40, 80, 160, 250, 500, 1000, 10000, 100000, 1000000, 10000000 or more attribute field linked to each spot or well of a virtual array.
  • By changing one line of code we can increase the number of attribute fields linked to a spot or well to practically any multiple of 32.
  • a virtual array may contain links back to spots or wells from 1, 10, 20, 40, 80, 160, 250, 500, 1000, 10000, 100000, 1000000, 10000000 or more different virtual arrays, microplates or microarrays.
  • Each spot or well in a virtual array may come from one well or spot of a linked virtual array, microarray or microplate.
  • a virtual array or plate contains a CArray (Microsoft Foundation Class construct) of spots or wells. CArrays are indexed by integers so in the 32-bit world the current maximum number of different microplates, microarrays or virtual arrays that we could have associated with a single virtual array or plate is iNT VIAX or 2147483647. In a 64-bit world, INT_MAX for a virtual array is 4611686014132420609.
  • a researcher should be allowed to order the numbering of the virtual wells or spots in a virtual array based on their attributes.
  • the ordering of the virtual spots or wells in a virtual array is based on a numerical value described in a common attribute field for each of the virtual spots or wells. For example, the researcher can order the virtual spots or wells in a virtual array from left to right starting with the highest numbers, first.
  • the ordering of the virtual spots or wells in a virtual array is based on an alphabetical ordering of a word or alphanumeric descriptor found in a common attribute field for each of the virtual spots or wells.
  • the researcher may order the virtual spots or wells in a virtual array from left to right starting with A and ending with Z based on a word found in a common attribute field.
  • the ordering of the virtual spots or wells in a virtual array may be based on a set of user-defined rules that use a word, number, or alphanumeric descriptor found in a common attribute field for each of the virtual spots or wells.
  • the ordering of the virtual spots or wells in a virtual array may be determined by user-defined rules that break the virtual spots or wells in a virtual array into groups based on a common attribute and then orders these groups from left to right.
  • the ordering of the virtual spots or wells in a virtual array may be determined by a word, number, or alphanumeric descriptor found in two or more common attribute fields in combination. For example, a set of virtual spots or wells in a virtual array is first ordered alphabetically from A to Z using a word found in the first attribute field, and then sorted numerically starting with one using a number found in the second attribute field.
  • Nirtual arrays may provide pop-up views of the data associated with a virtual well or spot when a mouse hovers over the virtual spot or well.
  • the pop-up view may display one or more attributes at the same time.
  • one or more attribute values may be displayed for a defined period of time, where the first attribute value is displayed followed by the next attribute value until all the attribute values have been displayed.
  • attribute values for a group of virtual spots or wells in a virtual array may be displayed in a pop-up view together when a mouse hovers over one virtual spot or well of the group.
  • the virtual spots or wells of a virtual array may be grouped based on a common attribute value, e.g. attribute Gene Name contains value of Spl, and all the virtual spots or wells with the specified attribute value display data from one or more attribute fields in a popup view when the mouse hovers over any virtual spot or well containing the specified attribute value.
  • the researcher may generate various tables, graphs and reports based on the data contained in one or more the attributes of a virtual array.
  • the attributes may be displayed in a table that contains one or more attributes from the virtual array in the rows or columns of the table.
  • the attributes may be displayed in a graph that displays the data graphically based on one or more attributes.
  • the data may be displayed in a scatter plot based on the data in one or more common attributes.
  • the researcher may generate a report that describes one or more attributes for a selected group of virtual spots or wells in a virtual array.
  • the report may be a template that contains fields that are filled out based on information contained in the attributes of a virtual array.
  • Nirtual arrays may provide a means to select attribute data in spots or wells of virtual arrays, microplates, and microarrays for automatic updating over the Web or Internet.
  • the attribute data of one or more virtual spots or wells of a virtual array are selected and the researcher then defines the source of the updates and the frequency of updating. The updates may arrive as files for import into the application or the updates may be written directly to the database.
  • the researcher may create links from one or more attribute fields of a spot or well in a virtual array, microplate, or microarray to a Web site that contains data.
  • the data on the Web site may be updated and viewed from the virtual array using software (e.g. an embedded Web browser such as ExplorerTM from Microsoft).
  • Nirtual arrays may provide for data export from or import to. spot and/or well attributes associated with virtual arrays, plates, and microarrays. Export and import may be performed to and from a web page, a Web-based search engine, a report, another software application, a robotic or other hardware interface, or another content management system.
  • the import and export of data is mediated by a standard data transfer format, e.g.
  • data from one or more attributes of a virtual array may be exported into and imported from an application for visualization of microarray data, e.g. Array ExplorerTM software from Spotfire of Cambridge, Massachusetts and GeneSpringTM software from Silicon Genetics of Redwood City, California.
  • a researcher may select data from one of more attributes in a virtual array for export to a visualization software package (e.g.
  • data from the virtual array will then be viewed in this visualization software by one or more researchers where the data from the virtual array may be modified, and then the modified data from the virtual array in the visualization software may be imported from the visualization software back into the original virtual array or used to create a new virtual array by a researcher.
  • data from one or more attributes of a virtual array may be exported to and imported from robotic control software to specify the aspiration and dispensing procedures for a robot, using a flat file format readable by both software packages (e.g. XML or text file formats).
  • a researcher may create a virtual array from spots or wells from microarrays, microplates and virtual arrays where data from one of more attributes met certain criteria, e.g. ratio data from two images generated from the microarray (e.g. Cy3 and Cy5) exceeded a threshold value, or the quality of an assay component in a microplate or microarray met or exceeded a certain value (e.g. including but not limited to components such as D ⁇ A, protein, chemicals, carbohydrates, buffers, dyes, isotopes).
  • the data from this virtual array may be used by the researcher to specify the manufacture of a new microarray using a robot, e.g.
  • data from one or more attributes in a virtual array may be exported to or imported from a report template generated in a Web page format (e.g.
  • GeneNetTM from Silicon Genetics of Redwood City, California
  • a flat file format readable by both software packages (e.g. XML or text file fo ⁇ nats).
  • a researcher may select data from one of more attributes in a virtual array for export to a Web page report format, the data from the virtual array will then be viewed in this report by one or more researchers where the data from the virtual array may be modified, and then the modified data from the virtual array in the Web page report format may be imported back into the original virtual array or used to create a new virtual array by a researcher.
  • the researcher may import to or export from a virtual array to a content management system (e.g.
  • Nirtual arrays may provide a data set for the application of mathematical and/or other statistical based analysis on information contained in attributes of spots and/or wells of virtual arrays, microplates and microarrays.
  • the mathematical and/or statistical analysis that may be applied to spots and/or wells of virtual arrays, microplates or microarrays includes but is not limited to data mining, data normalization, data reduction or data quality analysis.
  • the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays.
  • the information contained in the attributes of a virtual array e.g. data points, confidence analysis, information on controls and/or experimental replicates
  • Examples of data normalization statistical outcomes include but are not limited to normalized weighted average log data sets and normalized weighted average ratio data sets. These data sets can be used to derive statistical measures of data quality, including but not limited to outlier analysis and spot to spot variation, that may be used by the researcher to remove poor data points from further analysis.
  • the statistical normalization applied to a virtual array may allow the researcher to directly compare data from the spots or wells of the different microplates, microarrays and virtual arrays that comprise a virtual array and may enable the researcher to create a dataset or new virtual array for use in other mathematical and/or statistical techniques (e.g. data mining, data normalization, data reduction or data quality analysis).
  • the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays.
  • the information contained in spot or well attributes of virtual arrays, microplates or microarrays linked to the spots or wells of a primary virtual array e.g. information on controls or experimental replicates
  • the links between the virtual arrays, microplates and microarrays and the primary virtual array may be internal to the software and/or database, or mediated through a network (e.g. LAN, WAN, Internet).
  • Examples of data normalization statistical outcomes include but are not limited to normalized weighted average log data sets and normalized weighted average ratio data sets. These data sets can be used to derive statistical measures of data quality, including but not limited to outlier analysis and spot to spot variation, that may be used by the researcher to remove poor data points from further analysis.
  • the statistical normalization applied to a primary virtual array may allow the researcher to directly compare data from the spots or wells of the different microplates, microarrays and virtual arrays that comprise the primary virtual array and may enable the researcher to create a data set by filling attribute fields of the virtual array or by generating a new virtual array for use in other mathematical and/or statistical techniques (e.g. data mining, data normalization, data reduction or data quality analysis).
  • the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays.
  • the information contained in the attributes of a virtual array e.g.
  • Data points, confidence analysis, information on controls and/or experimental replicates may be used as a data set to perform statistical normalization with third party software applications.
  • Data from one or more attributes linked to the spots or wells of a virtual array may be exported in a form readable by to a software package (e.g. XML or text file formats), including but not limited to Statistical InformaticsTM from Imaging Research of Ontario, Canada, and the results of the normalization process may be imported back into the attribute fields of the original virtual array or into the attribute fields of a new virtual array for use with other statistical analysis and data mining procedures.
  • the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays.
  • the information contained in the attributes of a virtual array may be used to generate a data set to perform statistical data mining techniques.
  • Examples of statistical data mining techniques that may be applied to data sets generated from virtual arrays includes but is not limited to support vector machines, K-means analysis, hierarchical clustering analysis, principal component analysis, self-organizing maps, mutual relevancy networks, PLAID models, Bayesian networks, independent component analysis, etc.
  • the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays.
  • the information contained in the attributes of a virtual array e.g.
  • Data points, confidence analysis, information on controls and/or experimental replicates may be used as a data set to perform data mining with third party software applications.
  • Data from one or more attributes linked to the spots or wells of a virtual array may be exported in a format (e.g.
  • XML or text file formats readable by data mining software packages including but not limited to Statistical InformaticsTM from Imaging Research of Ontario, Canada; GenExploreTM from Applied Maths of Sint-Martens-Latem, Belgium, GeneSightTM from Biodiscovery of Los Angeles, California; Cluster/TreeView from Eisen-Stanford University of Stanford, California; Expressionist from GeneData of Basel, Switzerland; ResolverTM from Rosetta Inpharmatics of Kirkland, Washington; GenespringTM from Silicon Genetics of Redwood City, California; Spotfire Array Explorer from Spotfire of Cambridge, Masachusetts; etc.
  • the results of the data mining process may be imported back into the attribute fields of the original virtual array or into the attribute fields of a new virtual array for use with other statistical analysis and data mining procedures.
  • the researcher may generate a purchase or product ordering request (possibly through integration with an ERP system) based on the content contained in the virtual arrays, plates, spots and/or wells.
  • the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays.
  • the information contained in the attributes of a virtual array may be used to create a requisition request for one or more items including but not limited to reagents, chemicals, microarrays (e.g.
  • the requisition request may be generated in a format (e.g. XML or text file formats) readable by Enterprise Resource Planning (ERP) software used for making and tracking purchase requests and managing inventories.
  • ERP Enterprise Resource Planning
  • Examples of ERP software include but are not limited to EMAX Substance Manager and EMAX Reagent Manager from EMAX Solutions a subsidiary of SciQuest of Morrisville, NC.
  • the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays.
  • the information contained in the attributes of a virtual array may be used to create a requisition request for one or more items including but not limited to reagents, chemicals, microarrays (e.g. bioassay, reagent, DNA, protein, chemical, carbohydrate, etc.), microplates (e.g. bioassay, reagent, DNA, protein, chemical, carbohydrate, etc.), hardware, disposables.
  • the requisition request may be generated in a format (e.g. XML or text file formats) readable by e-commerce software that runs over the Internet. Examples of e- commerce software include but are not limited to SciQuest.com from SciQuest of Morrisville, NC.
  • exemplary attributes include:
  • Parent ID This attribute is used to keep track of which array or plate is parent and which is daughter; it will be set to "None” if there is no parent, and to the value of the parent's "Array/Plate ID” if there is a parent;
  • This field is read-only and displays the labels of all images associated with the selected array(s); Comments: This field stores comments; (5) X Offset ( ⁇ m): This field is read-only and specifies the horizontal offset of the top left well center from the Deck View's left edge;
  • Y Offset ( ⁇ m): This field is read-only and specifies the vertical offset of the top left well center from the Deck View's top edge; (7) Columns of Multi-Sets: This field is read-only and specifies the number of columns of secondary array combinations for the selected array(s) or plate(s);
  • Rows of Multi-Sets This field is read-only and specifies the number of rows of secondary array combinations for the selected array(s) or plate(s);
  • Multi-Set Column Spacing (9) Multi-Set Column Spacing ( ⁇ m): This field is read-only and specifies the horizontal center-to-center spacing between elements of all multi-sets within the selected array(s) or plate(s);
  • Multi-Set Row Spacing This field is read-only and specifies the vertical center-to-center spacing between elements of all multi-sets within the selected array(s) or plate(s);
  • (11) Columns of Sets This field is read-only and specifies the number of columns of secondary arrays for the selected array(s) or plate(s);
  • Rows of Sets This field is read-only and specifies the number of rows of secondary arrays for the selected array(s) or plate(s);
  • Rows per Set This field is read-only and displays the number of rows per set in the selected array(s) or plate(s);
  • Y Spacing between Wells ( ⁇ m): This field is read-only and specifies the vertical spacing between wells in the selected array(s) or plate(s);
  • Well Diameter This field is read-only and specifies the diameter or width of each well in the selected array(s) or plate(s);
  • Well Shape This field is read-only and specifies the shape of all elements in the selected array(s) or plate(s);
  • Grid Angle Tins field specifies the angle of rotation (in degrees) of the grid(s) associated with the array. Positive angles correspond to clockwise rotations;
  • Shape Constraint This field specifies the spot shape used to represent the spot for intensity calculations. The shape choices are: Fixed, Fitted Min, Fitted Max, and Contour;
  • Max Grid Rotation (deg): Specifies the maximum angle in degrees that the grid associated with an array can rotate;
  • Max Set Deviation Specifies the maximum deviation allowed for a set of Grid Spots (spot boundaries), starting from its nominal (template defined) position;
  • Max Spot Deviation Specifies the maximum deviation allowed for a spot
  • Min Spot Diameter Specifies the minimum tolerated spot diameter value
  • Max Spot Diameter Specifies the maximum tolerated spot diameter value
  • Treatment Units This field displays the Treatment Units (i.e. time, concentration, etc.) of the selected array(s) or plate(s);
  • Cy3 ID This field contains linked annotations and specifies the identification code of the dye
  • Cy3 eye dye This field is linked to Cy3 ID and specifies the name of the dye
  • Cy3 enzyme This field is linked to Cy3 ID and specifies the enzyme of the dye
  • Cy5 ID This field contains linked annotations and specifies the identification code of the dye
  • Cy5 eye dye This field is linked to Cy5 ID and specifies the name of the dye
  • Cy5 enzyme This field is linked to Cy5 ID and specifies the enzyme of the dye
  • Rows of Multi-Sets This field is read-only and specifies the number of rows of secondary array combinations for the selected array(s) or plate(s); (46) Columns of Sets: This field is read-only and specifies the number of columns of secondary arrays for the selected image(s);
  • Rows of Sets This field is read-only and specifies the number of rows of secondary arrays for the selected image(s);
  • Control Spots This field indicates whether certain spots on the image represent experimental controls, as opposed to experimental unknowns. If an image has control spots, then a well on the image's array or plate that is annotated Y'may be a control spotV is in fact a control spot, for that particular image; (55) Spot Confidence Threshold: Minimum confidence level, below which a spot is considered abnormal or of bad quality;
  • Max Background SD Specifies the maximum permitted background standard deviation, below which the confidence decreases. Note that in order for there to be spot background standard deviation values for this metric to work with, a background calculation method has to be turned on in the Project Settings before the image of interest is analyzed or in the Image Annotations after image analysis;
  • Max Signal CV Specifies the maximum acceptable percentage of the signal coefficient of variance, below which the confidence decreases; (64) Signal CV Window: Specifies the window of values within which the confidence decreases with respect to this metric;
  • Max Saturated Pixels Specifies the maximum allowed percentage of saturated pixels in a spot, below which the confidence decreases;
  • Min SNR Specifies the minimum tolerated percentage of the signal to noise ratio, above which the confidence decreases. By definition, this ratio is equal to the mean signal over the signal standard deviation;
  • SNR Window Specifies the window of values within which the confidence decreases with respect to this metric
  • Scanner ID This field contains linked annotations, and specifies the identification code of the scanner used to capture the microarray images;
  • Scanner operator This field specifies the scanner operator;
  • Scanner date This field specifies the date when the image was captured
  • Multi-Set This field is read-only and displays the number or alphabetic character of the selected well's multi-set
  • Multi-Set Row This field is read-only and displays the number or alphabetic character of the selected well's multi-set row
  • Multi-Set Column This field is read-only and displays the number or alphabetic character of the selected well's multi-set column
  • Clone ID This field contains linked annotations and specifies the identification code of the clone or sample in the well or spot. All the other annotations in this tab are linked to the Clone ID;
  • UniGene cluster ID This field specifies the UniGene cluster identification code of the gene in the well or spot;
  • Locus ID This field specifies the identification code of the locus that corresponds to the gene in the well or spot;
  • Chromosome ID This field specifies the chromosome in which the gene in the sample is located
  • GenBank accession number This field specifies the GenBank identification code of the gene in the well or spot;
  • Tissue This field specifies the tissue from which the sample was extracted;
  • Gene map This field specifies the genetic map of the gene in the well or spot
  • Organism This field specifies the organism from which the sample was extracted;
  • Sequence This field specifies the nucleotide sequence of the gene in the well or spot;
  • Control Spot This field indicates whether a well may be a control spot. The well will only be a control spot for those particular images that are attached to the well's array/plate and listed in their annotations as having control spots. Control spots can be set as either negative or positive controls;
  • Spot Normalized Intensity This field is read-only and contains the normalized intensity of the spot as defined by the project settings;
  • Spot Ratio This field is read-only and contains the ratio of the normalized spot intensities, and it uses the selected image (e.g. cy3 or cy5) as the numerator;
  • the array/plate name attribute, array/plate ID attribute, .parent ID attribute, images attribute, and comments are grouped together as general information attributes;
  • the X offset attribute, Y offset attribute, columns of multi-sets attribute, rows of multi-sets attribute, multi-set column spacing attribute, columns of sets attribute, rows of sets attribute, set column spacing attribute, set row spacing attribute, columns per set attribute, rows per set attribute, X spacing between Wells attribute, Y spacing between Wells attribute, well diameter attribute, well shape attribute, grid angle attribute, and shape constraint attribute are grouped together as geometry tab attributes;
  • max grid rotation attribute, max set deviation attribute, max spot deviation attribute, max spot deviation attribute, min spot diameter attribute, max spot diameter attribute, and max spot compactness attribute are grouped together as detection tab attributes;
  • treatment attribute, treatment units attribute, time attribute, temperature attribute, and buffer attribute are grouped together as hybridization tab attributes;
  • Cy3 ID attribute, Cy3 eye dye attribute, and Cy3 enzyme attribute are grouped together as Probe_cy3 attributes; Cy5 ID attribute, Cy5 eye
  • Fig. 3 depicts a screen shot of an embodiment of the invention showing two microplates: microplateOl 302 and microplate02 304.
  • One well 302 (1) of microplateOl 302 is selected and the screen display indicates that well 302 (1) maps to four spots per microarray 310, 312.
  • a well is mapped to a spot it is displayed in a color other than gray. If a well has not been mapped to a spot, then it is displayed as a gray color (as shown in connection with microplate 02 304). Also shown in Fig.
  • FIG. 3 is an annotation window 320 which displays to the researcher the attributes associated with selected well 302 (1). As shown in the embodiment of Fig. 3, the geometrical information associated with well 302 (1) is shown.
  • Fig. 4 depicts a closer view of microarray 312 showing that spots 312 (1), 312 (2), 312 (3), 312 (4), are mapped from a particular well.
  • Annotations window 320 displays attribute values associated with the particular spots.
  • Fig. 5 is a screen shot displaying to a researcher the results of a search performed on the samples represented by microarrays 310, 312 a search dialog box 510 provides a number of pulldown menus and input fields which allows a researcher to tailor a search.
  • search dialog box 510 the researcher is able to construct a search that finds all spots or wells throughout a project or only for the currently displayed microarrays or microplates.
  • the researcher has elected to find both spots and wells throughout the entire project matching the search criteria displayed.
  • the researcher has constructed a search looking for all spots having cDNA from chromosome 3, tissue sample from the brain, and a spot confidence exceeding fifty percent. Spots matching the search criteria are displayed to the user in a darker gray than other spots displayed to the user, although any color may be used to display qualifying spots to the researcher.
  • Fig. 6 shows a virtual array 620 created from the spot matching the search criteria of Fig. 5.
  • spot 620 (1) has been selected and various attribute values are displayed in the spot/well annotations window 640.
  • Fig. 7 depicts a further query performed on the spots contained in virtual array 620 a search dialog box 720 is depicted in which a researcher is searching for those spots having a spot ratio greater than or equal to 2 (signifying a two-fold change in gene expression).
  • the spots matching the search criteria depicted in search dialog box 720 may be highlighted by various colors or may themselves form a new virtual microarray for further searching or display to the researcher.
  • the spot well annotations window 640 depicts clone information corresponding to the selected spot 620 (1).
  • the spot well annotations page includes a number of tabs (info, clone, intensities) which allows the researcher to select between various collections of data corresponding to the selected spot.
  • the classes of the present invention may take many different forms.
  • the classes are compiled binary objects in accord with CORBA, ActiveX, OpenDoc, or other object-oriented frameworks.
  • the classes are binary executables compiled from files written in one or more programming languages including but not limited to C, C++, C#, Lisp, or Pascal.

Abstract

A virtual microplate is provided. The microplate corresponds to a plurality of physical microplates, the virtual microplate comprising a first virtual well associated with a first physical well of a first physical microplate and a second virtual well associated with a second physical well of a second physical microplate.

Description

METHOD AND APPARATUS FOR VISUALIZING COMPLEX DATA SETS
Cross-Reference to Related Applications
[0001] The present application claims benefit of United States Patent Application Serial Nos. 60/216,902 and 60/217,873, filed July 7, 2000 and July 12, 2000, respectively.
Field of the Invention [0002] The invention relates to the field of data management and, more specifically, to the field of visualizing complex, voluminous data sets.
Background of the Invention [0003] Laboratory researchers must design and plan their experiments, perform the experiments, store and analyze experimental data, and generate reports based on raw and processed data. These experimental tasks are particularly challenging in fields that entail high volumes of samples or data. For example, biologists working in the fields of genetics and drug-discovery often employ high-throughput screening ("HTS") where microassay samples have volumes in a range of picoliters to milliliters. In analogy to biological experiments done in test tubes, experiments done in microassays require liquid-handling steps to mix reagents in strict quantities and specified orders. Commonly, these HTS liquid-handling robots are employed to automate the preparation of thousands of microassay samples, often using microarrays, microplates, or DNA Chips. Data is often collected in the form of images for the thousands of samples, further increasing the difficult of managing experiments.
[0004] Sample tracking is the first challenge in data management, followed by data collection, analysis and reporting. The quantity and complexity of experiments achievable by these HTS research techniques far exceed the laboratory worker's ability to efficiently manually manage experimental information. This presents significant challenges in experimental design, sample tracking, and experimental data collecting, tracking and analysis. These challenges are particularly acute in microassy research which can require the analysis of hundred or thousands of individual results. There exists a need to organize, arrange, and display for evaluation voluminous, complex data sets from microassay research in an efficient and lucid manner. Summary of the Invention [0005] In one embodiment the invention relates to a virtual microplate corresponding to a plurality of physical microplate. The virtual microplate includes a first virtual well and a second virtual well. The first virtual well is associated with a first physical well of a first physical microplate. The second virtual well is associated with a second physical well of a second physical microplate. The physical microplates to which the virtual wells correspond may be geographically remote from one another. In other embodiments, each virtual well has a number of attributes associated with it that indicates information corresponding to the physical well to which the virtual well corresponds. [0006] In another embodiment, the present invention relates to a method for visualizing data. A first data array and a second data array are provided, each of the data arrays have a plurality of locations which includes a plurality of attributes associated with each location. A virtual data array is generated which includes a plurality of locations, each location associated with a location from the first or second data and each location including a plurality of attributes associated with the location in the first or second data and each location including a plurality of attributes associated with the location in the first or second data array.
Brief Description of the Drawings [0007] The advantages of the invention described above, together with further advantages, may be better understood by referring to the following description taken in conjunction with the accompanying drawings. In the drawings, like reference characters generally refer to the same parts throughout the different views. Also, the drawings are not necessarily to scale, the emphasis instead is placed on conveying the concepts of the invention.
Fig. 1 is a highly diagrammatic block diagram of a research environment in which the present invention may be used. Fig. 2 is a block diagram depicting one embodiment of a virtual microarray.
Fig. 3 is a screen shot depicting an embodiment of the invention. Fig. 4 is a screen shot depicting an embodiment of the invention. Fig. 5 is a screen shot depicting an embodiment of the invention. Fig. 6 is a screen shot depicting an embodiment of the invention. Fig. 7 is a screen shot depicting an embodiment of the invention. Detailed Description of the Invention r0008] Fig. 1 depicts a research environment in which the present invention may be used. A microarray robot 102 transfers biological samples from microplates 104, 1041, 1042, ..., 104" (generally 104) to microarrays 106, 1061, 1062, ..., 106" (generally 106). Although the example shown in Fig. 1 depicts an equal number of microplates and microarrays, the number of microplates does not need to equal the number of microarrays. Each individual sample in a microplate is referred to as a "well" and each microplate may have 96, 384, or 1536 wells. Individual areas on a microarray 106 which receive biological sample material are referred to as "spots." The number of spots on each microarray 106 is limited solely by the size of the physical media comprising the microarray 106 and the proximity with which biological samples may be placed to one another by the microarray robot 102. Exemplary microarray robots 102 that may be used include: the Biochip Arrayer I and the Biochip Arrayer II, manufactured by Packard BioScience of Meriden, Connecticut; the GTM-STP-1218, manufactured by Nippon Laser and Electronics Lab of Nagoya, Japan; the OmniGrid manufactured by GeneMachines of San Carols, California; the Pixsys 5500 manufactured by Cartesian Technologies of Irvine, California; the 417 manufactured by Affymetrix of Santa Clara, California, and the MicroGrid II manufactured by BioRobotics of Cambridge, United Kingdom.
[0009] In some embodiments, software implementing the present invention may run directly on a microarray robot 102. In other embodiments, software implementing the present invention may run on a computing node (not shown) that is in communication with the microarray robot 102. In these embodiments, the computing node may be any personal computer (e.g., 286, 386, 486, Pentium, Pentium II, Macintosh computer), Windows-based terminal, Network Computer, wireless device, information appliance, RISC Power PC, X-device, workstation, mini computer, main frame computer or other computing device. Windows-oriented platforms supported by the computing node can include Windows 3.x, Windows 95, Windows 98, Windows NT 3.51, Windows NT 4.0, Windows CE, Macintosh, Java, Unix and Windows 2000. The computing node can include a display screen, a keyboard, memory for storing downloaded application programs, a processor, and a mouse. The memory can provide persistent or volatile storage. In other embodiments, the computing node may be provided as a personal digital assistant (PDA), such as the Palm series of PDAs, manufactured by Palm, Inc. of Santa Clara, California. In these embodiments, the computing node may communicate with the microarray robot 102 using infrared links. In one particularly preferred embodiment, the computing personal computer node is a personal computer including a PENTIUM III processor (or better) using MICROSOFT WINDOWS 4.0 or MICROSOFT WINDOWS 2000 (or better).
[0010] The computing node includes, or has access to, a database for storing information relating to individual spots and wells. The database may be a relational database such as the Oracle 9i database, manufactured by Oracle Corporation of Breadwood Shores, California. Alternatively, the database may be a flat filed database such as the JET database manufactured by Microsoft Corporation of Redmond, Washington. In other embodiments the computing node may store information relating to wells and spots in a flat file or, delimited, -delimited format. In further embodiments the database may be any OTBC-compliant database. [0011] Fig. 2 depicts an embodiment of a virtual microarray 206. Each spot on virtual microarray 206 corresponds to a physical spot on a physical microarray 106. Although Fig. 2 depicts a virtual microarray 206, the virtual microplates may also be constructed from one or more wells from a physical microplate 104. The assignment of physical spots to virtual spots in virtual microarray 206 may be selected by the user, either by specifying a search criteria that operates on data representing the physical spots, or by manual selection. For embodiments in which the user specifies a search criteria, the search results may form virtual microarray 206, which allows a user to combine physical spots from multiple physical microarrays 106 that share a common data characteristic, regardless of the geographic or temporal location of the physical spot. As shown in Fig. 2, virtual microarray 206 does not need to have the same geometric characteristics as the physical microarrays 106 from which it is constructed. Also as shown in Fig. 2, virtual microarray 206 need not have a regular geometric shape. [0012] In order to facilitate the construction of virtual microarray 206 (and also virtual microplates or other virtual constructs which are hybrids of physical microplates and physical microarrays) a data object may be created which represents physical microarray 106. In some embodiments, a base class may be provided which generally represents physical microarrays 106. Each physical microarray 106 is modeled as an instantiation of that base class. In one particular embodiment, a base class representing a physical microarray 106 or microplate 104 provides: (1) an internal variable indicating whether the object represents a microplate 104 or a microarray 106; (2) one or more attributes of the physical microplate or microarray, which may be defined by the researcher; (3) information about the geometric layout of the physical microplate or microarray, such as the spacing between wells/spots, and whether wells/spots are grouped into sets;
(4) well/spot attributes for each spot/well, which may be defined by the researcher;
(5) sample tracking information about each spot/well indicating both source and destination spot/wells;
(6) geometrical information about each spot of each image associated with the rack, such as detection status information, location, shape or contour, extents, compactness and confidence; and
(7) intensities for each spot of each image associated with the rack. [0013] A companion class may be created which represents virtual versions of physical microarrays or microplates. In some embodiments, the class representing the virtual microarrays or microplates is an extension of the base class described above. In other embodiments, the class representing the virtual microarrays or microplates is a separate class. In one particular embodiment, a class representing a virtual microarray 106 or microplate 104 provides: (1) an internal variable indicating whether the object represents a virtual microarray or a virtual microplate; (2) information about the geometric layout of the virtual microplate or microarray; (3) well/spot attributes for each virtual well/virtual spot, which includes information about the physical well/physical spot and physical microplate/physical microarray it represents; (4) the search criteria used to identify each well/spot. [0014] A class representing a virtual microplate or microarray should support the following actions: creation; display of the virtual microplate or microarray; display of virtual spots or virtual wells; selection of virtual spots or virtual wells; search refinements; deletion of virtual wells and virtual spots; deletion of virtual microplates or microarrays; and saving and loading virtual microplates and microarrays to and from a database. [0015] A virtual microplate or microarray should be created as the result of a selection of wells or spots either manually or by search process. The dimensions of a virtual microplate or microarray (i.e., rows and columns) should be computed (or recomputed) as necessary. [0016] A virtual microplate or microarray should be displayed to the user in a fashion that is analogous to the display of a physical microarray or microplate or robot deck. [0017] Individual spots and wells should be drawn to reflect a user-indicated properties through shading or color coding. For example, a researcher may have searched for all spots having an intensity above a minimum value and below a maximum value. Spots matching that search criteria can be displayed with a color map reflecting the actual intensity value. [0018] Nirtual wells and virtual spots may be selected with standard mouse clicking and dragging. Attributes of those virtual wells or virtual spots should be displayed when a particular virtual well or virtual spot is selected. This allows a researcher to edit attributes or annotations of attributes by selecting a virtual well or virtual spot to display the attributes of that virtual well or virtual spot. In some embodiments, when virtual wells or virtual spots are selected they are highlighted and a graphical representation of the physical well or physical spot to which they correspond should also be displayed. [0019] The researcher may use all or a subset of the virtual wells or virtual spots to refine a search and generate additional virtual microplates or microarrays as the result of the searches. For each search iteration, the search criteria and search results should be saved. [0020] A researcher should be allowed to delete a virtual well or virtual spot by selecting a virtual well or virtual spot and indicating to the computing device that the virtual well or virtual spot should be deleted. When this occurs, the virtual well or virtual spot is removed from the virtual microplate or virtual microarray.
[0021] The researcher should also be allowed to delete and entire virtual microplate or microarray by selecting the virtual microplate or microarray and indicating that it should be deleted. [0022] Nirtual microplates and microarrays should be able to be stored. In some embodiments, virtual microplate and microarrays are stored in a database.
[0023] A virtual array may have 1, 10, 20, 40, 80, 160, 250, 500, 1000, 10000, 100000, 1000000, 10000000 or more spots or wells. A virtual array or plate contains a CArray (Microsoft Foundation Class construct) of spots or wells. C Arrays are indexed. by integers so in the 32-bit world the current maximum number of virtual spots or wells that we could have associated with a single virtual array or plate is IΝTJViAX or 2147483647. In a 64-bit world, LΝT_MAX for a virtual array is 4611686014132420609.
[0024] A virtual array may have 1, 10, 20, 40, 80, 160, 250, 500, 1000, 10000, 100000, 1000000, 10000000 or more attribute field linked to each spot or well of a virtual array. By changing one line of code we can increase the number of attribute fields linked to a spot or well to practically any multiple of 32. We have a # define called ROLLCALL_SIZE in AttributeList.h which determines the number of multiples of 32. Therefore the total number of attributes per object is the sum of the number of built-in attributes.
[0025] A virtual array may contain links back to spots or wells from 1, 10, 20, 40, 80, 160, 250, 500, 1000, 10000, 100000, 1000000, 10000000 or more different virtual arrays, microplates or microarrays. Each spot or well in a virtual array may come from one well or spot of a linked virtual array, microarray or microplate. A virtual array or plate contains a CArray (Microsoft Foundation Class construct) of spots or wells. CArrays are indexed by integers so in the 32-bit world the current maximum number of different microplates, microarrays or virtual arrays that we could have associated with a single virtual array or plate is iNT VIAX or 2147483647. In a 64-bit world, INT_MAX for a virtual array is 4611686014132420609.
[0026] A researcher should be allowed to order the numbering of the virtual wells or spots in a virtual array based on their attributes. In one embodiment, the ordering of the virtual spots or wells in a virtual array is based on a numerical value described in a common attribute field for each of the virtual spots or wells. For example, the researcher can order the virtual spots or wells in a virtual array from left to right starting with the highest numbers, first. In other embodiments, the ordering of the virtual spots or wells in a virtual array is based on an alphabetical ordering of a word or alphanumeric descriptor found in a common attribute field for each of the virtual spots or wells. For example, the researcher may order the virtual spots or wells in a virtual array from left to right starting with A and ending with Z based on a word found in a common attribute field. In other embodiments, the ordering of the virtual spots or wells in a virtual array may be based on a set of user-defined rules that use a word, number, or alphanumeric descriptor found in a common attribute field for each of the virtual spots or wells. For example, the ordering of the virtual spots or wells in a virtual array may be determined by user-defined rules that break the virtual spots or wells in a virtual array into groups based on a common attribute and then orders these groups from left to right. In another embodiment, the ordering of the virtual spots or wells in a virtual array may be determined by a word, number, or alphanumeric descriptor found in two or more common attribute fields in combination. For example, a set of virtual spots or wells in a virtual array is first ordered alphabetically from A to Z using a word found in the first attribute field, and then sorted numerically starting with one using a number found in the second attribute field.
[0027] A researcher should be allowed to modify the shape or shading of a virtual spot or well based on the attributes of a virtual well or spot included in the virtual array. In one embodiment, a virtual spot or well in a virtual array may be shaded based on the numerical value contained in an attribute field. In another embodiment, each virtual spot or well in a virtual array may have the appearance of a three dimensional rectangle with the same base dimensions but a height determined by the numerical value in one of the common attribute fields. In another embodiment, the two dimensional shape of each virtual spot or well of a virtual array may be defined by the value contained in a common attribute field. For example, a common attribute field may contain one of four values, and each attribute value is associated with a unique shape for the virtual spot or well, e.g. circle, square, triangle or hexagon. [0028] Nirtual arrays may provide pop-up views of the data associated with a virtual well or spot when a mouse hovers over the virtual spot or well. In one embodiment, the pop-up view may display one or more attributes at the same time. In other embodiments, one or more attribute values may be displayed for a defined period of time, where the first attribute value is displayed followed by the next attribute value until all the attribute values have been displayed. In other embodiments, attribute values for a group of virtual spots or wells in a virtual array may be displayed in a pop-up view together when a mouse hovers over one virtual spot or well of the group. For example, the virtual spots or wells of a virtual array may be grouped based on a common attribute value, e.g. attribute Gene Name contains value of Spl, and all the virtual spots or wells with the specified attribute value display data from one or more attribute fields in a popup view when the mouse hovers over any virtual spot or well containing the specified attribute value.
[0029] The researcher may generate various tables, graphs and reports based on the data contained in one or more the attributes of a virtual array. In one embodiment, the attributes may be displayed in a table that contains one or more attributes from the virtual array in the rows or columns of the table. In another embodiment, the attributes may be displayed in a graph that displays the data graphically based on one or more attributes. For example, the data may be displayed in a scatter plot based on the data in one or more common attributes. In another embodiment, the researcher may generate a report that describes one or more attributes for a selected group of virtual spots or wells in a virtual array. The report may be a template that contains fields that are filled out based on information contained in the attributes of a virtual array. In another embodiment, the researcher can modify values in the virtual spots or wells of a virtual array through the fields displaying data in the tables, graphs, and reports generated from the virtual array. [0030] Nirtual arrays may provide a means to select attribute data in spots or wells of virtual arrays, microplates, and microarrays for automatic updating over the Web or Internet. In one embodiment, the attribute data of one or more virtual spots or wells of a virtual array are selected and the researcher then defines the source of the updates and the frequency of updating. The updates may arrive as files for import into the application or the updates may be written directly to the database. In another embodiment, the researcher may create links from one or more attribute fields of a spot or well in a virtual array, microplate, or microarray to a Web site that contains data. The data on the Web site may be updated and viewed from the virtual array using software (e.g. an embedded Web browser such as Explorer™ from Microsoft). [0031] Nirtual arrays may provide for data export from or import to. spot and/or well attributes associated with virtual arrays, plates, and microarrays. Export and import may be performed to and from a web page, a Web-based search engine, a report, another software application, a robotic or other hardware interface, or another content management system. In one embodiment, the import and export of data is mediated by a standard data transfer format, e.g. Extensible Markup Language (XML), hi another embodiment, data from one or more attributes of a virtual array may be exported into and imported from an application for visualization of microarray data, e.g. Array Explorer™ software from Spotfire of Cambridge, Massachusetts and GeneSpring™ software from Silicon Genetics of Redwood City, California. For example, a researcher may select data from one of more attributes in a virtual array for export to a visualization software package (e.g. GeneSpring™ from Silicon Genetics of Redwood City, California), the data from the virtual array will then be viewed in this visualization software by one or more researchers where the data from the virtual array may be modified, and then the modified data from the virtual array in the visualization software may be imported from the visualization software back into the original virtual array or used to create a new virtual array by a researcher. In another embodiment, data from one or more attributes of a virtual array may be exported to and imported from robotic control software to specify the aspiration and dispensing procedures for a robot, using a flat file format readable by both software packages (e.g. XML or text file formats). For example, a researcher may create a virtual array from spots or wells from microarrays, microplates and virtual arrays where data from one of more attributes met certain criteria, e.g. ratio data from two images generated from the microarray (e.g. Cy3 and Cy5) exceeded a threshold value, or the quality of an assay component in a microplate or microarray met or exceeded a certain value (e.g. including but not limited to components such as DΝA, protein, chemicals, carbohydrates, buffers, dyes, isotopes). The data from this virtual array may be used by the researcher to specify the manufacture of a new microarray using a robot, e.g. Biochip Arrayer I and the Biochip Arrayer II, manufactured by Packard BioScience of Meriden, Connecticut; the GTM-STP-1218, manufactured by Nippon Laser and Electronics Lab of Nagoya, Japan; the OmniGrid manufactured by GeneMachines of San Carols, California; the Pixsys 5500 manufactured by Cartesian Technologies of Irvine, California; the 417 manufactured by Affymetrix of Santa Clara, California, and the MicroGrid II manufactured by BioRobotics of Cambridge, United Kingdom. In another embodiment, data from one or more attributes in a virtual array may be exported to or imported from a report template generated in a Web page format (e.g. GeneNet™ from Silicon Genetics of Redwood City, California) using a flat file format readable by both software packages (e.g. XML or text file foπnats). For example, a researcher may select data from one of more attributes in a virtual array for export to a Web page report format, the data from the virtual array will then be viewed in this report by one or more researchers where the data from the virtual array may be modified, and then the modified data from the virtual array in the Web page report format may be imported back into the original virtual array or used to create a new virtual array by a researcher. In another embodiment, the researcher may import to or export from a virtual array to a content management system (e.g. including but not limited to DigitalGENOME software from Molecularware of Cambridge, Massachusetts; Genomax software from Informax of California; Resolver™ from Rosetta Inpharmatics of Kirkland, Washington; DoubleTwist.com Web site from DoubleTwist.com of Oakland, California), using a flat file format readable by both software packages (e.g. XML or text file formats) or a software script that writes to and from the databases incorporated with the content management software. [0032] Nirtual arrays may provide a data set for the application of mathematical and/or other statistical based analysis on information contained in attributes of spots and/or wells of virtual arrays, microplates and microarrays. The mathematical and/or statistical analysis that may be applied to spots and/or wells of virtual arrays, microplates or microarrays includes but is not limited to data mining, data normalization, data reduction or data quality analysis. In one embodiment, the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays. The information contained in the attributes of a virtual array (e.g. data points, confidence analysis, information on controls and/or experimental replicates) may be used as a dataset to perform statistical normalization techniques (e.g. removal of artifacts, subtraction of background, measures of the middle (central tendency), measures of spread (dispersion), measures to describe relationships between two characteristics, measurements of variation in the data, measurements of reliability and validity, measurements of error propagation). Examples of data normalization statistical outcomes include but are not limited to normalized weighted average log data sets and normalized weighted average ratio data sets. These data sets can be used to derive statistical measures of data quality, including but not limited to outlier analysis and spot to spot variation, that may be used by the researcher to remove poor data points from further analysis. The statistical normalization applied to a virtual array may allow the researcher to directly compare data from the spots or wells of the different microplates, microarrays and virtual arrays that comprise a virtual array and may enable the researcher to create a dataset or new virtual array for use in other mathematical and/or statistical techniques (e.g. data mining, data normalization, data reduction or data quality analysis). In another embodiment, the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays. The information contained in spot or well attributes of virtual arrays, microplates or microarrays linked to the spots or wells of a primary virtual array (e.g. information on controls or experimental replicates) may be used as a data set to perform statistical normalization techniques (e.g. removal of artifacts, subtraction of background, measures of the middle (central tendency), measures of spread (dispersion), measures to describe relationships between two characteristics, measurements of variation in the data, measurements of reliability and validity, measurements of error propagation). The links between the virtual arrays, microplates and microarrays and the primary virtual array may be internal to the software and/or database, or mediated through a network (e.g. LAN, WAN, Internet). Examples of data normalization statistical outcomes include but are not limited to normalized weighted average log data sets and normalized weighted average ratio data sets. These data sets can be used to derive statistical measures of data quality, including but not limited to outlier analysis and spot to spot variation, that may be used by the researcher to remove poor data points from further analysis. The statistical normalization applied to a primary virtual array may allow the researcher to directly compare data from the spots or wells of the different microplates, microarrays and virtual arrays that comprise the primary virtual array and may enable the researcher to create a data set by filling attribute fields of the virtual array or by generating a new virtual array for use in other mathematical and/or statistical techniques (e.g. data mining, data normalization, data reduction or data quality analysis). In other embodiments, the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays. The information contained in the attributes of a virtual array (e.g. data points, confidence analysis, information on controls and/or experimental replicates) may be used as a data set to perform statistical normalization with third party software applications. Data from one or more attributes linked to the spots or wells of a virtual array may be exported in a form readable by to a software package (e.g. XML or text file formats), including but not limited to Statistical Informatics™ from Imaging Research of Ontario, Canada, and the results of the normalization process may be imported back into the attribute fields of the original virtual array or into the attribute fields of a new virtual array for use with other statistical analysis and data mining procedures. In other embodiments, the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays. The information contained in the attributes of a virtual array (e.g. data points, confidence analysis, information on controls and/or experimental replicates) may be used to generate a data set to perform statistical data mining techniques. Examples of statistical data mining techniques that may be applied to data sets generated from virtual arrays includes but is not limited to support vector machines, K-means analysis, hierarchical clustering analysis, principal component analysis, self-organizing maps, mutual relevancy networks, PLAID models, Bayesian networks, independent component analysis, etc. In other embodiments, the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays. The information contained in the attributes of a virtual array (e.g. data points, confidence analysis, information on controls and/or experimental replicates) may be used as a data set to perform data mining with third party software applications. Data from one or more attributes linked to the spots or wells of a virtual array may be exported in a format (e.g. XML or text file formats) readable by data mining software packages including but not limited to Statistical Informatics™ from Imaging Research of Ontario, Canada; GenExplore™ from Applied Maths of Sint-Martens-Latem, Belgium, GeneSight™ from Biodiscovery of Los Angeles, California; Cluster/TreeView from Eisen-Stanford University of Stanford, California; Expressionist from GeneData of Basel, Switzerland; Resolver™ from Rosetta Inpharmatics of Kirkland, Washington; Genespring™ from Silicon Genetics of Redwood City, California; Spotfire Array Explorer from Spotfire of Cambridge, Masachusetts; etc. The results of the data mining process may be imported back into the attribute fields of the original virtual array or into the attribute fields of a new virtual array for use with other statistical analysis and data mining procedures. [0033] The researcher may generate a purchase or product ordering request (possibly through integration with an ERP system) based on the content contained in the virtual arrays, plates, spots and/or wells. In one embodiment, the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays. The information contained in the attributes of a virtual array may be used to create a requisition request for one or more items including but not limited to reagents, chemicals, microarrays (e.g. bioassay, reagent, DNA, protein, chemical, carbohydrate, etc.), microplates (e.g. bioassay, reagent, DNA, protein, chemical, carbohydrate, etc.), hardware, disposables. The requisition request may be generated in a format (e.g. XML or text file formats) readable by Enterprise Resource Planning (ERP) software used for making and tracking purchase requests and managing inventories. Examples of ERP software include but are not limited to EMAX Substance Manager and EMAX Reagent Manager from EMAX Solutions a subsidiary of SciQuest of Morrisville, NC. In other embodiments, the researcher may create a virtual array from one or more spots or wells from one or more microplates, microarrays and virtual arrays. The information contained in the attributes of a virtual array may be used to create a requisition request for one or more items including but not limited to reagents, chemicals, microarrays (e.g. bioassay, reagent, DNA, protein, chemical, carbohydrate, etc.), microplates (e.g. bioassay, reagent, DNA, protein, chemical, carbohydrate, etc.), hardware, disposables. The requisition request may be generated in a format (e.g. XML or text file formats) readable by e-commerce software that runs over the Internet. Examples of e- commerce software include but are not limited to SciQuest.com from SciQuest of Morrisville, NC.
[0034] As described above, both classes support inclusion of one or more attributes that may be specified by the researcher. Exemplary attributes include:
(1) an Array/Plate Name: This field is read-only and displays the name of the selected array(s) or plates(s);
(2) Array/Plate ID: This field displays the unique ID (barcode) of the selected array(s) or plate(s);
(3) Parent ID: This attribute is used to keep track of which array or plate is parent and which is daughter; it will be set to "None" if there is no parent, and to the value of the parent's "Array/Plate ID" if there is a parent;
(4) Images: This field is read-only and displays the labels of all images associated with the selected array(s); Comments: This field stores comments; (5) X Offset (μm): This field is read-only and specifies the horizontal offset of the top left well center from the Deck View's left edge;
(6) Y Offset (μm): This field is read-only and specifies the vertical offset of the top left well center from the Deck View's top edge; (7) Columns of Multi-Sets: This field is read-only and specifies the number of columns of secondary array combinations for the selected array(s) or plate(s);
(8) Rows of Multi-Sets: This field is read-only and specifies the number of rows of secondary array combinations for the selected array(s) or plate(s);
(9) Multi-Set Column Spacing (μm): This field is read-only and specifies the horizontal center-to-center spacing between elements of all multi-sets within the selected array(s) or plate(s);
(10) Multi-Set Row Spacing (μm): This field is read-only and specifies the vertical center-to-center spacing between elements of all multi-sets within the selected array(s) or plate(s); (11) Columns of Sets: This field is read-only and specifies the number of columns of secondary arrays for the selected array(s) or plate(s);
(12) Rows of Sets: This field is read-only and specifies the number of rows of secondary arrays for the selected array(s) or plate(s);
(13) Set Column Spacing (μm): This field is read-only and specifies the horizontal center-to-center spacing between elements of all secondary arrays within the selected array (s) or plate(s);
(14) Set Row Spacing (μm): This field is read-only and specifies the vertical center-to- center spacing between elements of all secondary arrays within the selected array(s) or plate(s); (15) Columns per Set: This field is read-only and displays the number of columns per set in the selected array(s) or plate(s);
(16) Rows per Set: This field is read-only and displays the number of rows per set in the selected array(s) or plate(s);
(17) X Spacing between Wells (μm): This field is read-only and specifies the horizontal spacing between wells in the selected array(s) or plate(s);
(18) Y Spacing between Wells (μm): This field is read-only and specifies the vertical spacing between wells in the selected array(s) or plate(s); (19) Well Diameter: This field is read-only and specifies the diameter or width of each well in the selected array(s) or plate(s);
(20) Well Shape: This field is read-only and specifies the shape of all elements in the selected array(s) or plate(s); (21) Grid Angle: Tins field specifies the angle of rotation (in degrees) of the grid(s) associated with the array. Positive angles correspond to clockwise rotations; (22) Shape Constraint: This field specifies the spot shape used to represent the spot for intensity calculations. The shape choices are: Fixed, Fitted Min, Fitted Max, and Contour; (23) Max Grid Rotation (deg): Specifies the maximum angle in degrees that the grid associated with an array can rotate;
(24) Max Set Deviation: Specifies the maximum deviation allowed for a set of Grid Spots (spot boundaries), starting from its nominal (template defined) position;
(25) Max Spot Deviation: Specifies the maximum deviation allowed for a spot; (26) Min Spot Diameter: Specifies the minimum tolerated spot diameter value;
(27) Max Spot Diameter: Specifies the maximum tolerated spot diameter value;
(28) Max Spot Compactness: Specifies the maximum tolerated spot compactness. Compactness is a measure of the circularity of a spot. Note that a circle, which is the most compact shape, has a compactness of zero (0); (29) Treatment: This field displays the Treatment variable;
(30) Treatment Units: This field displays the Treatment Units (i.e. time, concentration, etc.) of the selected array(s) or plate(s);
(31) Time: This field displays the time elapsed during hybridization, in minutes;
(32) Temperature: This field displays the temperature during hybridization, in degrees Celsius;
(33) Buffer: This field specifies the buffer used during hybridization;
(34) Cy3 ID: This field contains linked annotations and specifies the identification code of the dye;
(35) Cy3 eye dye: This field is linked to Cy3 ID and specifies the name of the dye; (36) Cy3 enzyme: This field is linked to Cy3 ID and specifies the enzyme of the dye;
(37) Cy5 ID: This field contains linked annotations and specifies the identification code of the dye; (38) Cy5 eye dye: This field is linked to Cy5 ID and specifies the name of the dye;
(39) Cy5 enzyme: This field is linked to Cy5 ID and specifies the enzyme of the dye;
(40) Filename: This field is read-only and stores the local filename of the selected image; (41) Label: This field is read-only and displays a unique descriptor for the selected image. The scope of the unique descriptor is local to each array/plate;
(42) Width (μm): This field specifies the nominal width of the primary array of the selected image(s);
(43) Height (μm): This field is read-only and specifies the nominal height of the primary array of the selected image(s);
(44) Columns of Multi-Sets: This field is read-only and specifies the number of columns of secondary array combinations for the selected array(s) or plate(s);
(45) Rows of Multi-Sets: This field is read-only and specifies the number of rows of secondary array combinations for the selected array(s) or plate(s); (46) Columns of Sets: This field is read-only and specifies the number of columns of secondary arrays for the selected image(s);
(47) Rows of Sets: This field is read-only and specifies the number of rows of secondary arrays for the selected image(s);
(48) Spot Diameter (μm): This field is read-only and specifies the spot diameter of all spots in the selected image(s);
(49) X Offset (pixels): This field specifies the horizontal offset of this image relative to other images in the same array;
(50) Y Offset (pixels): This field specifies the vertical offset of this image relative to other images in the same array; (51) Comments: This field stores comments;
(52) Background type: This field specifies the way that background intensity values are calculated;
(53) Filter: This field specifies the type of prefilter applied to the image;
(54) Has Control Spots: This field indicates whether certain spots on the image represent experimental controls, as opposed to experimental unknowns. If an image has control spots, then a well on the image's array or plate that is annotated Y'may be a control spotV is in fact a control spot, for that particular image; (55) Spot Confidence Threshold: Minimum confidence level, below which a spot is considered abnormal or of bad quality;
(56) Background Mean On: Specifies whether the Background Mean confidence metric is turned on or off; (57) Max Background Mean: Specifies the maximum allowed background mean, below which the confidence decreases. Note that in order for there to be spot background mean values for this metric to work with, a background calculation method has to be turned on in the Project Settings before the image of interest is analyzed or in the Image Annotations after image analysis; (58) Background Mean Window: Specifies the window of values within which the confidence decreases with respect to this metric;
(59) Background SD On: Specifies whether the Background Standard Deviation confidence metric is turned on or off;
(60) Max Background SD: Specifies the maximum permitted background standard deviation, below which the confidence decreases. Note that in order for there to be spot background standard deviation values for this metric to work with, a background calculation method has to be turned on in the Project Settings before the image of interest is analyzed or in the Image Annotations after image analysis;
(61) Background SD Window: Specifies the window of values within which the confidence decreases with respect to this metric;
(62) Signal CV On: Specifies whether the Signal Coefficient of Variance confidence metric is turned on or off;
(63) Max Signal CV: Specifies the maximum acceptable percentage of the signal coefficient of variance, below which the confidence decreases; (64) Signal CV Window: Specifies the window of values within which the confidence decreases with respect to this metric;
(65) Saturated Pixels On: Specifies whether the Saturated Pixels confidence metric is turned on or off;
(66) Max Saturated Pixels: Specifies the maximum allowed percentage of saturated pixels in a spot, below which the confidence decreases;
(67) Saturated Pixels Window: Specifies the window of values within which the confidence decreases with respect to this metric; (68) SNR On: Specifies whether the Signal To Noise Ratio confidence metric is turned on or off;
(69) Min SNR: Specifies the minimum tolerated percentage of the signal to noise ratio, above which the confidence decreases. By definition, this ratio is equal to the mean signal over the signal standard deviation;
(70) SNR Window: Specifies the window of values within which the confidence decreases with respect to this metric;
(71) Scanner ID: This field contains linked annotations, and specifies the identification code of the scanner used to capture the microarray images; (72) Scanner operator: This field specifies the scanner operator;
(73) Scanner date: This field specifies the date when the image was captured;
(74) Number: This field is read-only and displays the absolute number of the selected well(s);
(75) Name: This field stores a meaningful name for the selected well(s) or spot(s); (76) Array/Plate: This field is read-only and displays the array or plate name of the selected well(s);
(77) Multi-Set: This field is read-only and displays the number or alphabetic character of the selected well's multi-set;
(78) Multi-Set Row: This field is read-only and displays the number or alphabetic character of the selected well's multi-set row;
(79) Multi-Set Column: This field is read-only and displays the number or alphabetic character of the selected well's multi-set column;
(80) Set: This field is read-only and displays the number or alphabetic character of the selected well's set; (81) Set Row: This field is read-only and displays the number or alphabetic character of the selected well's set row;
(82) Set Column: This field is read-only and displays the number or alphabetic character of the selected well's set column;
(83) Row: This field is read-only and displays the number or alphabetic character of the selected well's row;
(84) Column: This field is read-only and displays the number or alphabetic character of the selected well's column; (85) Comments: This field stores comments;
(86) Clone ID: This field contains linked annotations and specifies the identification code of the clone or sample in the well or spot. All the other annotations in this tab are linked to the Clone ID; (87) UniGene cluster ID: This field specifies the UniGene cluster identification code of the gene in the well or spot;
(88) Gene symbol: This field specifies the abbreviated name of the gene in the well or spot;
(89) Locus ID: This field specifies the identification code of the locus that corresponds to the gene in the well or spot;
(90) Chromosome ID: This field specifies the chromosome in which the gene in the sample is located;
(91) GenBank accession number: This field specifies the GenBank identification code of the gene in the well or spot; (92) Tissue: This field specifies the tissue from which the sample was extracted;
(93) Gene map: This field specifies the genetic map of the gene in the well or spot;
(94) Gene name: This field specifies the unabbreviated name of the gene in the well or spot;
(95) Organism: This field specifies the organism from which the sample was extracted; (96) Sequence: This field specifies the nucleotide sequence of the gene in the well or spot;
(97) Flagged As: This field indicates whether the user has specifically marked the selected well(s) as being abnormal in some way;
(98) Control Spot: This field indicates whether a well may be a control spot. The well will only be a control spot for those particular images that are attached to the well's array/plate and listed in their annotations as having control spots. Control spots can be set as either negative or positive controls;
(99) Spot Confidence: This field is read-only and indicates the confidence of the spot, as defined by the confidence metrics; (100) Spot Normalized Intensity: This field is read-only and contains the normalized intensity of the spot as defined by the project settings; (101) Spot Ratio: This field is read-only and contains the ratio of the normalized spot intensities, and it uses the selected image (e.g. cy3 or cy5) as the numerator;
(102) Spot Mean Intensity: This field is read-only and stores the mean signal intensity value(s) of the selected spot(s). This is a raw, uncorrected value in which the background has not been subtracted out;
(103) Spot Median Intensity: This field is read-only and stores the median signal intensity value(s) of the selected spot(s). Tins is a raw, uncorrected value in which the background has not been subtracted, out;
(104) Spot Standard Deviation: This field is read-only and stores the standard deviation of the signal pixels for the selected spot(s);
( 105) Spot Number of Pixels: This field is read-only and stores the number of pixels used to calculate the signal for the selected spot(s);
(106) Spot Total Intensity: This field is read-only and stores the total integrated signal intensity value(s) of the selected spot(s). This is a raw, uncorrected value in which the background has not been subtracted out;
(107) Spot Peak Intensity: This field is read-only and stores the highest signal pixel intensity value for the selected spot(s). This is a raw, uncorrected value in which the background has not been subtracted out;
(108) Background Mean Intensity: This field is read-only and stores the mean background intensity value(s) of the selected spot(s);
(109) Background Median Intensity: This field is read-only and stores the median background intensity value(s) of the selected spot(s);
(110) Background Standard Deviation: This field is read-only and stores the standard deviation of the background pixels for the selected spot(s); (111) Background Number of Pixels: This field is read-only and stores the number of pixels used to calculate the background for the selected spot(s). [0035] The attributes identified above may be grouped together for convenient display to the researcher as individual dialog boxes, pull-down menus, or tabbed displays. In one embodiment, the array/plate name attribute, array/plate ID attribute, .parent ID attribute, images attribute, and comments are grouped together as general information attributes; the X offset attribute, Y offset attribute, columns of multi-sets attribute, rows of multi-sets attribute, multi-set column spacing attribute, columns of sets attribute, rows of sets attribute, set column spacing attribute, set row spacing attribute, columns per set attribute, rows per set attribute, X spacing between Wells attribute, Y spacing between Wells attribute, well diameter attribute, well shape attribute, grid angle attribute, and shape constraint attribute are grouped together as geometry tab attributes; max grid rotation attribute, max set deviation attribute, max spot deviation attribute, max spot deviation attribute, min spot diameter attribute, max spot diameter attribute, and max spot compactness attribute are grouped together as detection tab attributes; treatment attribute, treatment units attribute, time attribute, temperature attribute, and buffer attribute are grouped together as hybridization tab attributes; Cy3 ID attribute, Cy3 eye dye attribute, and Cy3 enzyme attribute are grouped together as Probe_cy3 attributes; Cy5 ID attribute, Cy5 eye dye attribute, and Cy5 enzyme attribute are grouped together as Probe_cy5 attributes.
Example [0036] The following example is intended to be exemplary only and is not intended to limit the scope of the claims. [0037] Fig. 3 depicts a screen shot of an embodiment of the invention showing two microplates: microplateOl 302 and microplate02 304. One well 302 (1) of microplateOl 302 is selected and the screen display indicates that well 302 (1) maps to four spots per microarray 310, 312. As shown in Fig. 3, if a well is mapped to a spot it is displayed in a color other than gray. If a well has not been mapped to a spot, then it is displayed as a gray color (as shown in connection with microplate 02 304). Also shown in Fig. 3 is an annotation window 320 which displays to the researcher the attributes associated with selected well 302 (1). As shown in the embodiment of Fig. 3, the geometrical information associated with well 302 (1) is shown. [0038] Fig. 4 depicts a closer view of microarray 312 showing that spots 312 (1), 312 (2), 312 (3), 312 (4), are mapped from a particular well. Annotations window 320 displays attribute values associated with the particular spots. [0039] Fig. 5 is a screen shot displaying to a researcher the results of a search performed on the samples represented by microarrays 310, 312 a search dialog box 510 provides a number of pulldown menus and input fields which allows a researcher to tailor a search. As shown in search dialog box 510, the researcher is able to construct a search that finds all spots or wells throughout a project or only for the currently displayed microarrays or microplates. As shown in the embodiment of Fig. 5, the researcher has elected to find both spots and wells throughout the entire project matching the search criteria displayed. As shown in Fig. 5, the researcher has constructed a search looking for all spots having cDNA from chromosome 3, tissue sample from the brain, and a spot confidence exceeding fifty percent. Spots matching the search criteria are displayed to the user in a darker gray than other spots displayed to the user, although any color may be used to display qualifying spots to the researcher.
[0040] Fig. 6 shows a virtual array 620 created from the spot matching the search criteria of Fig. 5. As shown in the embodiment of Fig. 6, spot 620 (1) has been selected and various attribute values are displayed in the spot/well annotations window 640. Fig. 7 depicts a further query performed on the spots contained in virtual array 620 a search dialog box 720 is depicted in which a researcher is searching for those spots having a spot ratio greater than or equal to 2 (signifying a two-fold change in gene expression). The spots matching the search criteria depicted in search dialog box 720 may be highlighted by various colors or may themselves form a new virtual microarray for further searching or display to the researcher. As shown in Fig. 7, the spot well annotations window 640 depicts clone information corresponding to the selected spot 620 (1). In the embodiment shown in Fig. 7, the spot well annotations page includes a number of tabs (info, clone, intensities) which allows the researcher to select between various collections of data corresponding to the selected spot.
[0041] Because of the diversity and facility of modern programming practices, the classes of the present invention may take many different forms. In some embodiments, the classes are compiled binary objects in accord with CORBA, ActiveX, OpenDoc, or other object-oriented frameworks. In still other embodiments, the classes are binary executables compiled from files written in one or more programming languages including but not limited to C, C++, C#, Lisp, or Pascal.
[0042] Many alterations and modifications may be made by those having ordinary skill in the art without departing from the spirit and scope of the invention. Therefore, it must be expressly understood that the illustrated embodiment has been shown only for the purposes of example and should not be taken as limiting the invention, which is defined by the following claims. The following claims are thus to be read as not only literally including what is set forth by the claims but also to include all equivalent elements for performing substantially the same function in substantially the same way to obtain substantially the same result, even though not identical in other respects to what is shown and described in the-above illustrations.

Claims

CLAIMS What is claimed is: 1. A virtual microplate corresponding to a plurality of physical microplates, the virtual microplate comprising: a first virtual well associated with a first physical well of a first physical microplate; and a second virtual well associated with a second physical well of a second physical microplate. 2. A method for visualizing data comprising the steps of: providing a first data array; providing a second data array; each of said first and second data arrays comprising a plurality of locations, each of said plurality of locations comprising a plurality of attributes associated with said location; and generating a virtual data array comprising a plurality of locations, each location associated with a location in said first data array or said second data array, each location comprising plurality of attributes associated with said location on said first and data array.
PCT/US2001/021807 2000-07-12 2001-07-09 Method and apparatus for visualizing complex data sets WO2002005209A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2001273343A AU2001273343A1 (en) 2000-07-12 2001-07-09 Method and apparatus for visualizing complex data sets

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US21787300P 2000-07-12 2000-07-12
US60/217,873 2000-07-12
US90046401A 2001-07-06 2001-07-06
US09/900,464 2001-07-06

Publications (3)

Publication Number Publication Date
WO2002005209A2 WO2002005209A2 (en) 2002-01-17
WO2002005209A3 WO2002005209A3 (en) 2002-04-04
WO2002005209A9 true WO2002005209A9 (en) 2003-04-03

Family

ID=26912332

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/021807 WO2002005209A2 (en) 2000-07-12 2001-07-09 Method and apparatus for visualizing complex data sets

Country Status (2)

Country Link
AU (1) AU2001273343A1 (en)
WO (1) WO2002005209A2 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003088125A2 (en) * 2002-04-10 2003-10-23 Transtech Pharma, Inc. System and method for integrated computer-aided molecular discovery
US20070011201A1 (en) * 2005-06-30 2007-01-11 Yerramalli Subramaniam Interface method and system for genetic analysis data
US20160012115A1 (en) 2013-02-28 2016-01-14 Celal Korkut Vata Combinational data mining

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1985995A3 (en) * 1996-08-16 2009-09-16 GE Healthcare Niagara Inc. A digital imaging system for assays in well plates, gels and blots

Also Published As

Publication number Publication date
WO2002005209A2 (en) 2002-01-17
AU2001273343A8 (en) 2002-04-18
AU2001273343A1 (en) 2002-01-21
WO2002005209A3 (en) 2002-04-04

Similar Documents

Publication Publication Date Title
US20020052882A1 (en) Method and apparatus for visualizing complex data sets
US6263287B1 (en) Systems for the analysis of gene expression data
US10872681B2 (en) Differential filtering of genetic data
US20030171876A1 (en) System and method for managing gene expression data
US20040002818A1 (en) Method, system and computer software for providing microarray probe data
US20030009295A1 (en) System and method for retrieving and using gene expression data from multiple sources
US20040126840A1 (en) Method, system and computer software for providing genomic ontological data
US20030100999A1 (en) System and method for managing gene expression data
US20060020398A1 (en) Integration of gene expression data and non-gene data
US20030097222A1 (en) Method, system, and computer software for providing a genomic web portal
US20030100995A1 (en) Method, system and computer software for variant information via a web portal
US20030120432A1 (en) Method, system and computer software for online ordering of custom probe arrays
JP2003521057A (en) Methods, systems and computer software for providing a genomic web portal
US20070087368A1 (en) Method, System and Computer Software Providing a Genomic Web Portal for Functional Analysis of Alternative Splice Variants
JP2001511529A (en) Method and apparatus for providing a bioinformatics database
NZ518022A (en) Analysis and diagnosis by intelligent processing of remotely generated genetic hybridization profiles
AU781841B2 (en) Graphical user interface for display and analysis of biological sequence data
WO2002071059A1 (en) A system and method for managing gene expression data
Dresen et al. Software packages for quantitative microarray-based gene expression analysis
US20070148658A1 (en) Systems and methods for biopolymeric probe design using graphical representation of a biopolymeric sequence
WO2002005209A9 (en) Method and apparatus for visualizing complex data sets
Dahlquist Using Gen MAPP and MAPPFinder to View Microarray Data on Biological Pathways and Identify Global Trends in the Data
US8271427B1 (en) Computer database system for single molecule data management and analysis
WO2000016220A9 (en) Method and apparatus for providing an expression data mining database and laboratory information management
EP1396800A2 (en) Method and apparatus for providing a bioinformatics database

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

COP Corrected version of pamphlet

Free format text: PAGES 1/7-7/7, DRAWINGS, REPLACED BY NEW PAGES 1/7-7/7; DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP