Computer systems and methods for automatically viewing multidimensional databases
A method for automatically forming the clearest and most useful visual plot for a given dataset of tuples. A best view type is selected for a view that includes a subsequently added new field. The visual plot is populated with the data in the view and then automatically rendered for the user. A dataset that is retrieved from a storage is analyzed to identify all the data types found in the dataset, and to determine the best view type to assign to the dataset's views. The visual plot is then populated with the data according to this best view type, and is automatically rendered for the user.
Latest Tableau Software LLC Patents:
- Intent driven dashboard recommendations
- Metadata inheritance for data assets
- Optimizing natural language analytical conversations using platform-specific input and output interface functionality
- Using an object model to view data associated with data marks in a data visualization
- Systems and methods for visualizing object models of database tables
This invention relates generally to computer systems and methods for displaying data such as database information. The invention relates specifically to a computer system and method for displaying data clearly and effectively based upon the types of data found in a dataset.
BACKGROUND OF THE INVENTIONData is more than the numbers, values, or predicates of which it is comprised. Data resides in multi-dimensional spaces which harbor rich and variegated landscapes that are not only strange and convoluted, but are not readily comprehendible by the human brain. The most complicated data arises from measurements or calculations that depend on many apparently independent variables. Data sets with hundreds of variables arise today in many contexts, including, for example: gene expression data for uncovering the link between the genome and the various proteins for which it codes; demographic and consumer profiling data for capturing underlying sociological and economic trends; sales and marketing data for huge numbers of products in vast and ever-changing marketplaces; and environmental measurements for understanding phenomena such as pollution, meteorological changes and resource impact issues. International research projects such as the Human Genome Project and the Sloan Digital Sky Survey are also forming massive scientific databases. Furthermore, corporations are creating large data warehouses of historical data on key aspects of their operations. Corporations are also using desktop applications to create many small databases for examining specific aspects of their business.
One challenge with any of these databases is the extraction of meaning from the data they contain: to discover structure, find patterns, and derive causal relationships. Often, the sheer size of these data sets complicates this task and means that interactive calculations that require visiting each record are not plausible. It may also be infeasible for an analyst to reason about or view the entire data set at its finest level of detail. Even when the data sets are small, however, their complexity often makes it difficult to glean meaning without aggregating the data or creating simplifying summaries.
Among the principal operations that may be carried out on data, such as regression, clustering, summarization, dependency modeling, and classification, the ability to see patterns rapidly is of paramount importance. Data comes in many forms, and the most appropriate way to display data in one form may not be the best for another. In the past, where it has been recognized that many methods of display are possible, it has been a painstaking exercise to select the most appropriate one. However, identifying the most telling methods of display can be intimately connected to identifying the underlying structure of the data itself.
Business intelligence is one rapidly growing area that benefits considerably from tools for interactive visualization of multi-dimensional databases. A number of approaches to visualizing such information are known in the art. However, although software programs that implement such approaches are useful, they are often unsatisfactory. Such programs have interfaces that require the user to select the most appropriate way to display the information.
Visualization is a powerful tool for exploring large data, both by itself and coupled with data mining algorithms. However, the task of effectively visualizing large databases imposes significant demands on the human-computer interface to the visualization system. The exploratory process is one of hypothesis, experiment, and discovery. The path of exploration is unpredictable, and analysts need to be able to easily change both the data being displayed and its visual representation. Furthermore, the analyst should be able to first reason about the data at a high level of abstraction, and then rapidly drill down to explore data of interest at a greater level of detail. Thus, a good interface both exposes the underlying hierarchical structure of the data and supports rapid refinement of the visualization.
Currently, Tableau's software and Microsoft's Excel are examples of visualization software that create views of datasets. Specifically, Tableau Table Drop allows users to drag data fields onto a Tableau view to create a graphical views. When the view was a text table, the behavior was similar to the drags supported by Excel Pivot Tables. For example, dragging a quantitative data type (Q) onto a text table (X=O Y=O T=Q, where “O” stands for ordinal data), would extend the table to put the two measures next to each other (X=O Y=O,Om T=Qm, where “Om” stands for measure ordinal data and “Qm” stands for measure quantitative data). However, Tableu's Table Drop has functionality not found in Excel's Pivot Tables in that it may change the view type of a view when fields are dragged onto the view. For example, dragging a Q onto a bar chart (X=O Y=Q) created a stacked bar chart (X=O Y=Qm C=Om). Or, if there was already a field with a color encoding (X=O Y=Q C=F) in the view, then the software would transform the Q data into Qm data, and would place the measure names on the Level of Detail encoding (X=O Y=Qm C=F L=Om). With scatter plots, the logic was similar, except the transformation of Q to Qm and placement of measure names on the Level of Detail encoding would be triggered if an existing field already had a shape encoding.
In addition to various software programs, the known art further provides formal graphical presentations. Bertin's Semiology of Graphics, University of Wisconsin Press, Madison Wis. (1983), is an early attempt at formalizing graphic techniques. Bertin developed a vocabulary for describing data and techniques for encoding the data into a graphic. Bertin identified retinal variables (position, color, size, etc.) in which data can be encoded. Cleveland (The Elements of Graphing Data, Wadsworth Advanced Books and Software, (1985), Pacific Grove, Calif. and Visualizing Data, (1993), Hobart Press) used theoretical and experimental results to determine how well people can use these different retinal properties to compare quantitative variations.
Mackinlay's APT system (ACM Trans. Graphics, 5, 110-141, (1986)) was one of the first applications of formal graphical specifications to computer generated displays. APT uses a graphical language and a hierarchy of composition rules that are searched through in order to generate two-dimensional displays of relational data. The Sage system (Roth, et al., (1994), Proc. SIGCHI '94, 112-117) extends the concepts of APT, providing a richer set of data characterizations and forming a wider range of displays. The existing art also provides for the assignment of a mark based upon the innermost data column and row of a dataset (Hanrahan, et al., U.S. Pat. application Ser. No. 11/005,652, “Computer System and Methods for Visualizing Data with Generation of Marks”). Heuristically guided searches have also been used to generate visualizations of data (Agrawala, et al., U.S. Pat. No. 6,424,933, “System and Method for Non-Uniform Scaled Mapping”).
A drawback with the formal graphical specifications of the art is that they do not provide any guidance to a user as to useful and clear visual formats in which a set of data could be rendered. The rendering of the data is such that there is no analysis to examine the resulting visualization for clarity or usefulness. Further, in the use of heuristic searches (trial-and-error method), the searches fail, leaving the user with the problem of finding clear or useful views. Heuristic algorithms can have complex behavior that creates a poor user experience. When a user does not understand why a heuristic algorithm generates certain views, the algorithm becomes unpredictable to the user and the user will not be inclined to use the algorithm.
Based on the background state of the art, as described herein, what is needed are improved methods and graphical interfaces wherein the initial visualization of data has been determined to be a clear and useful visualization, and this visualization is then automatically presented to the user.
SUMMARY OF THE INVENTIONThe present invention provides improved methods for visualizing data.
A first aspect of the invention provides a computer implemented method for automatically and visually displaying a graphical representation of a dataset, comprising: receiving a user selected and ordered plurality of fields; selecting a resulting view for displaying the dataset based on the order of the user selected fields; and displaying the dataset or a transformation of the dataset according to the resulting view. In one embodiment, the dataset is retrieved from a remote database. In another embodiment, rules are used to select the resulting view. In yet another embodiment, the rules are predetermined. In other embodiments, the rules are determined by the user's preferences or usage. In a further embodiment, heuristics are used to select the resulting view.
A second aspect of the invention provides a computer implemented method for automatically and visually displaying a graphical representation of a dataset with a plurality of tuples, comprising: forming a plurality of rated alternative views, each alternative view showing all tuples, or a transformation of all tuples, in the dataset; selecting a resulting view from the plurality of alternative views, based upon a user selected option; and displaying the dataset according to the resulting view. In one embodiment, the dataset is retrieved from a remote database. In another embodiment, rules are used to select the resulting view. In yet another embodiment, the rules are predetermined. In other embodiments, the rules are determined by the user's preferences or usage. In a further embodiment, heuristics are used to select the resulting view. In yet a further embodiment, when the user selected option is a first option, the selecting step further comprises: ranking the plurality of alternative views according to a rating system; and assigning the resulting view as the highest ranked alternative view. In still another embodiment, when the user selected option is a second option, the selecting step further comprises: displaying a list of the alternative views; receiving the user's selection of an alternative view; and assigning the resulting view as the alternative view selected by the user.
A third aspect of the invention provides a computer program product for use in conjunction with a computer system, the computer program product comprising a computer readable storage medium and a computer program mechanism embedded therein, the computer program mechanism for automatically and visually displaying a graphical representation of a dataset, the computer program mechanism comprising: a field receiver for receiving a user selected and ordered plurality of fields; a resulting view selector for selecting a resulting view for displaying the dataset based on the order of the user selected fields; and a dataset displayer for displaying the dataset or a transformation of the dataset according to the resulting view. In one embodiment, the dataset is retrieved from a remote database. In another embodiment, rules are used to select the resulting view. In yet another embodiment, the rules are predetermined. In other embodiments, the rules are determined by the user's preferences or usage. In a further embodiment, heuristics are used to select the resulting view.
A fourth aspect of the invention provides a computer program product for use in conjunction with a computer system, the computer program product comprising a computer readable storage medium and a computer program mechanism embedded therein, the computer program mechanism for automatically and visually displaying a graphical representation of a dataset with a plurality of tuples, the computer program mechanism comprising: an alternative view former for forming a plurality of rated alternative views, each alternative view showing all tuples, or a transformation of all tuples, in the dataset; a resulting view selector for selecting a resulting view from the plurality of alternative views, based upon a user selected option; and a dataset displayer for displaying the dataset according to the resulting view. In one embodiment, the dataset is retrieved from a remote database. In another embodiment, rules are used to select the resulting view. In yet another embodiment, the rules are predetermined. In other embodiments, the rules are determined by the user's preferences or usage. In a further embodiment, heuristics are used to select the resulting view. In yet a further embodiment, when the user selected option is a first option, the resulting view selector further comprises: an alternative view ranker for ranking the plurality of alternative views according to a rating system; and a view assignor for assigning the resulting view as the highest ranked alternative view. In still another embodiment, when the user selected option is a second option, the resulting view selector further comprises: a list displayer for displaying a list of the alternative views; a selection receiver for receiving the user's selection of an alternative view; and a view assignor for assigning the resulting view as the alternative view selected by the user.
A fifth aspect of the invention provides a computer system for automatically and visually displaying a graphical representation of a dataset, the computer system comprising: a central processing unit; a memory, coupled to the central processing unit, the memory storing: the dataset; a programming module comprising, comprising: instructions for receiving a user selected and ordered plurality of fields; instructions for selecting a resulting view for displaying the dataset based on the order of the user selected fields; and instructions for displaying the dataset or a transformation of the dataset according to the resulting view. In one embodiment, the dataset is retrieved from a remote database. In another embodiment, rules are used to select the resulting view. In yet another embodiment, the rules are predetermined. In other embodiments, the rules are determined by the user's preferences or usage. In a further embodiment, heuristics are used to select the resulting view.
A sixth aspect of the invention provides a computer system for automatically and visually displaying a graphical representation of a dataset with a plurality of tuples, the computer system comprising: a central processing unit; a memory, coupled to the central processing unit, the memory storing: the dataset; a programming module comprising: instructions for forming a plurality of rated alternative views, each alternative view showing all tuples, or a transformation of all tuples, in the dataset; instructions for selecting a resulting view from the plurality of alternative views, based upon a user selected option; and instructions for displaying the dataset according to the resulting view. In one embodiment, the dataset is retrieved from a remote database. In another embodiment, rules are used to select the resulting view. In yet another embodiment, the rules are predetermined. In other embodiments, the rules are determined by the user's preferences or usage. In a further embodiment, heuristics are used to select the resulting view. In yet a further embodiment, when the user selected option is a first option, the instructions for selecting further comprises: instructions for ranking the plurality of alternative views according to a rating system; and instructions for assigning the resulting view as the highest ranked alternative view. In still another embodiment, when the user selected option is a second option, the instructions for selecting further comprises: instructions for displaying a list of the alternative views; instructions for receiving the user's selection of an alternative view; and instructions for assigning the resulting view as the alternative view selected by the user.
Like reference numerals refer to corresponding parts throughout the several views of the drawings.
DETAILED DESCRIPTION OF THE INVENTIONThe present invention provides methods, computer program products, and computer systems for automatically providing a user with a clear and useful view of a dataset. In a typical embodiment, the present invention builds and displays a view of a dataset as a user adds fields to the dataset or as a dataset is accessed, such that the view is clear and useful, and is automatically presented to the user. An advantage of the present invention is that data is presented in a clear and useful form automatically.
The present invention operates on a set of data, called a dataset, that are made up of tuples. As one skilled in the art will realize, the dataset can be a relational database, a multidimensional database, a semantic abstraction of a relational database, or an aggregated or unaggregated subset of a relational database, multidimensional database, or semantic abstraction. Fields are categorizations of data in a dataset. A tuple is an item of data (such as a record) from a dataset, specified by attributes from fields in the dataset. A search query across the dataset will return one or more tuples. Fields contain data that are of particular types, and each field is of a particular type. These types include:
Measure names may include an ordinal field whose domain is the name of one or more Qd fields. Measure values may include a dependent quantitative field whose domain and values are the blending of the Qd fields whose names appear in the domain of measure names.
A view is a visual representation of a dataset or a transformation of that dataset. Text table, bar chart, and scatter plots are all examples of types of views. Views contain marks that represent one or more tuples in a dataset. In other words, marks are visual representations of tuples in a view. A mark is typically associated with a type of graphical display. Some examples of views and their associated marks are as follows:
In
The computer system modules used to perform this embodiment of the invention are shown in
According to one embodiment of the invention, resulting view selector 110 selects the resulting view by choosing rule(s) for adding the user selected ordered fields (step 208). This is accomplished by rule chooser 114. Rule applier 116 then applies the rule(s) to determine the resulting view's view type (step 210). In another embodiment of the invention, before rule chooser 114 chooses rule(s), view determiner 118 determines whether a first view exists (step 212). In yet another embodiment of the invention, the dataset is displayed in step 206 when mark chooser 126 chooses a mark for the resulting view (step 218), and dataset renderer 128 renders the dataset according to the mark (step 220).
The sets of rules are organized first by the type of the field that is dropped (e.g. O or Qd), and then by the type of the view that the field is being dropped onto. The rules are further broken down by the type of the view. The type of a view depends on their innermost row and column. For example, OO is a view with ordinal fields in the row and column; OQ is a view with an ordinal field in the row and a quantitative field in the column; and φ is an empty view with no fields. For each type of field being dropped, a rule table is shown containing the rules for each type of view into which the field is being dropped. The columns of the rule tables represent the contents of the innermost field on the column (X), and the rows of the rules table the innermost field on the row (Y).
In step 208, rule(s) for adding the user selected field's data type are chosen. For example, if a user selected field is an ordinal, then the set of rules in
The order in which fields are added affect the view type of the resulting view. For example, if a measure data type field is added to an empty view, and is subsequently followed by a dimension data type field, the resulting view will be a bar chart. However, if a measure data type field is added to an empty view subsequent to a dimension data type field, then the resulting view will be a text table. The resulting view's view type is thusly selected based upon a set of rules. The view type is then assigned to the resulting view and the view is then populated with data from the dataset. In one embodiment, the set of rules are predetermined. In another embodiment, the set of rules are based upon a user's preferences or actual usage. For example, a user may be given the opportunity to designate the best view type for various sequences of the addition of fields to views. Or, after the visual plot is populated and rendered for the user, the user is allowed to choose a different rendering. The user's choice as to the ultimate resulting view, if recorded, may indicate the user's preference for what view type the user considers the clear and/or useful. In yet another embodiment, heuristics may be used instead of a set of rules for selecting a resulting view.
In one embodiment, the cardinalities of the fields in the resulting view are computed and are considered in determining how the user selected fields are added. In set theory, cardinality is the size of a set. In the present invention, cardinality refers to the number of distinct instances that are associated with a field's type. For example, if a field type is “States of America”, then the cardinality of such a field would be 50.
In another embodiment, the functional dependency of the fields in the resulting view are computed and are considered in determining how the user selected fields are added. Functional dependency refers to the determination of one field by another field. For example, if one field is of the type “States of America,” and a second field is “Inches of Rainfall of the States of America,” then the second field depends upon the first. Another example is shown in
In yet another embodiment, in the application of the selected rule to populate the resulting view with data from the dataset, a mark is chosen for the resulting view's view type and the data from the dataset is rendered according to the mark. This is shown in
Now, referring to
In another embodiment, alternative views are formed based upon a set of criteria.
In one embodiment, if the user selected a first option, then the alternative views are ranked according to a rating system by alternative view ranker 134 in step 608. View assignor 120 then assigns the resulting view as the highest ranked alternative view at step 610. Dataset displayer 112 then displays the dataset according to the resulting view in step 606. For example, if all the data in a dataset is aggregated and does not contain any independently quantitative data, then alternative views of all the view types listed in
In another embodiment, if the user selected a second option, then a list of alternative views would be displayed by list displayer 136 at step 612 for the user's selection. After the user's selection is received at step 614 by selection receiver 138, the resulting view is assigned as the alternative view that the user selected by view assignor 120 at step 616, and dataset displayer 112 then displays the dataset according to the resulting view in step 606.
In yet another embodiment of the invention, cardinality computer 122 computes the cardinality of the fields in the plurality of tuples when forming the alternative views. In a further embodiment, functional dependency computer 124 computes the functional dependency of the fields in the plurality of tuples when forming the alternative views.
The present invention not only accepts datasets and databases as inputs, it also accepts views as inputs. A view can be used to represent a set of fields. Resulting views can also depend on the existing view. For example, rules or operators can take into account the current view to generate a new view that is related to the current view. Also, as one skilled in the art will realize, many other rules are possible, include ones to generate statistical, maps, pie charts, and three dimensional views of data.
The present invention can be implemented as a computer program product that comprises a computer program mechanism embedded in a computer readable storage medium. For instance, the computer program product could contain the program modules shown in
Many modifications and variations of this invention can be made without departing from its spirit and scope, as will be apparent to those skilled in the art. The specific embodiments described herein are offered by way of example only, and the invention is to be limited only by the terms of the appended claims, along with the full scope of equivalents to which such claims are entitled.
All references cited herein are incorporated herein by reference in their entirety and for all purposes to the same extent as if each individual publication or patent or patent application was specifically and individually indicated to be incorporated by reference in its entirety for all purposes.
Claims
1. A computer implemented method for displaying a graphical representation of a dataset, comprising:
- at a computer having one or more processors and memory storing programs executed by the one or more processors:
- displaying a graphical user interface window including a schema display region and a data visualization region, wherein the schema display region includes multiple terms corresponding to a plurality of fields of the dataset, wherein each field defines a respective category of the dataset;
- receiving a user selection of at least a subset of the multiple terms in the schema display region and a user ordering of the user-selected terms in the data visualization region, each of the user-selected terms having an associated data type; and
- in response to the user selection and the user ordering, forming a plurality of alternative views using the dataset based on the user ordering of the user-selected terms; determining a rank for each of the plurality of alternative views based on the user-specified order and the associated data types of the user-selected terms and a rating system; selecting from among the plurality of alternative views a resulting view for displaying the dataset based on comparing the rank of the resulting view with the rank of at least another view of the plurality of alternative views; and displaying the dataset or a transformation of the dataset in the data visualization region according to the resulting view.
2. The computer implemented method of claim 1, wherein the selecting step further comprises:
- with no further user instruction: choosing one or more rules that are associated with adding the user selected fields; and applying the one or more rules to determine the resulting view's view type.
3. The computer implemented method of claim 2, further comprising, before the choosing step:
- determining whether a first view exists;
- wherein the one or more rules selected is also based upon the first view, if the first view is determined to exist.
4. The computer implemented method of claim 1, wherein the dataset is a database.
5. The computer implemented method of claim 4, wherein the database is remote.
6. The computer implemented method of claim 4, wherein the database is a relational database.
7. The computer implemented method of claim 4, wherein the database is a multidimensional database.
8. The computer implemented method of claim 4, wherein the database contains semantic extractions of another database.
9. The computer implemented method of claim 1, wherein the dataset is retrieved from a remote database.
10. The computer implemented method of claim 2, wherein the rules are predetermined.
11. The computer implemented method of claim 2, wherein the rules relate to user preference.
12. The computer implemented method of claim 2, wherein the rules relate to user usage.
13. The computer implemented method of claim 1, the selecting step further comprising:
- computing the cardinalities of all the fields in the resulting view.
14. The computer implemented method of claim 2, wherein the rules relate to the cardinalities of all the fields in the resulting view.
15. The computer implemented method of claim 1, the selecting step further comprising:
- computing the functional dependencies of all the fields in the resulting view, using an algorithm.
16. The computer implemented method of claim 2, wherein the rules relate to the functional dependency of all the fields in the resulting view.
17. The computer implemented method of claim 1, the displaying of the dataset step further comprising:
- choosing a mark for the resulting view; and
- rendering the dataset according to the mark.
18. The computer implemented method of claim 1, wherein the receiving step is accomplished when the user drags and drops a representation of a field.
19. The computer implemented method of claim 1, wherein the receiving step is accomplished when the user double clicks a representation of a field.
20. The computer implemented method of claim 1, wherein the receiving step is accomplished when the user right clicks a representation of a field and selects an option to drop or add the field.
21. The computer implemented method of claim 1, wherein the receiving step is accomplished when the user inputs text to designate the plurality of user selected fields and their order.
22. The computer implemented method of claim 2, wherein the one or more rules state that the resulting view is a text table when the resulting view's first field is of the dimension data type.
23. The computer implemented method of claim 2, wherein the one or more rules state that the resulting view is a bar chart when the resulting view's first field is of the measure data type, the resulting view's second field is of the dimension data type, and the resulting view has no field of the date dimension data type.
24. The computer implemented method of claim 2, wherein the one or more rules state that the resulting view is a line when the resulting view's first field is of the measure data type, the resulting view's second field is of the dimension data type, and the resulting view has at least one field of the date dimension data type.
25. The computer implemented method of claim 2, wherein the one or more rules state that the resulting view is a discrete line when the resulting view's first field is of the date dimension data type, and the resulting view's second field is of the measure data type.
26. The computer implemented method of claim 2, wherein the one or more rules state that the resulting view is a continuous line when the resulting view's first field is of the continuous dimension data type, and the resulting view's second field is of the measure data type.
27. The computer implemented method of claim 2, wherein the one or more rules state that the resulting view is a scatter plot when the resulting view's first and second fields are of the measure data type, and the resulting view's subsequent fields are of the dimension data type.
28. The computer implemented method of claim 2, wherein the one or more rules state that the resulting view is a scatter matrix when the resulting view's first, second, and a subsequent field is of the measure data type.
29. The computer implemented method of claim 17, wherein when the resulting view is a text table, the mark is text.
30. The computer implemented method of claim 17, wherein when the resulting view is a heat map, the mark is a square.
31. The computer implemented method of claim 17, wherein when the resulting view is a bar chart, the mark is a bar.
32. The computer implemented method of claim 17, wherein when the resulting view is a line graph, the mark is a line segment.
33. The computer implemented method of claim 17, wherein when the resulting view is a circle chart, the mark is a circle.
34. The computer implemented method of claim 17, wherein when the resulting view is a Gantt plot, the mark is a bar.
35. The computer implemented method of claim 17, wherein when the resulting view is a scatter plot, the mark is a point or shape.
36. The computer implemented method of claim 1, wherein the selecting step uses heuristics.
37. A computer implemented method for displaying a graphical representation of a dataset with a plurality of tuples, comprising:
- at a computer having one or more processors and memory storing programs executed by the one or more processors:
- displaying a graphical user interface window including a schema display region and a data visualization region, wherein the schema display region includes multiple terms corresponding to a plurality of fields of the dataset, wherein each field defines a respective category of the dataset;
- receiving a user instruction for generating a graphical representation of the dataset, wherein the user instruction includes a user selection of at least a subset of multiple terms in the schema display region and a user ordering of the user-selected terms in the data visualization region, each of the user-selected terms having an associated data type; and
- in response to the user selection and the user ordering, forming a plurality of alternative views based on the user ordering of the user-selected terms, each alternative view showing all tuples, or a transformation of all tuples, in the dataset; determining a rank for each of the plurality of alternative views based on the user-specified order and the associated data types of the user-selected terms and a rating system; selecting a resulting view from the plurality of rated alternative views, based upon a user selected option and comparing the rank of the resulting view with the rank of at least another view of the plurality of alternative views; and displaying the dataset or a transformation of the dataset in the data visualization region according to the resulting view.
38. The computer implemented method of claim 37, wherein the dataset is a database.
39. The computer implemented method of claim 38, wherein the database is remote.
40. The computer implemented method of claim 38, wherein the database is a relational database.
41. The computer implemented method of claim 38, wherein the database is a multidimensional database.
42. The computer implemented method of claim 38, wherein the database contains semantic extractions of another database.
43. The computer implemented method of claim 37, wherein the dataset is retrieved from a remote database.
44. The computer implemented method of claim 37, wherein the forming step further comprises:
- computing the cardinality of the fields in the plurality of tuples.
45. The computer implemented method of claim 37, wherein the forming step further comprises:
- computing the functional dependency on the plurality of tuples, using an algorithm.
46. The computer implemented method of claim 37, wherein, when the user selected option is a first option, the selecting step further comprises:
- ranking the plurality of alternative views according to a rating system; and
- assigning the resulting view as the highest ranked alternative view.
47. The computer implemented method of claim 46, wherein the rating system is predetermined.
48. The computer implemented method of claim 46, wherein the rating system relates to user preference.
49. The computer implemented method of claim 46, wherein the rating system relates to user usage.
50. The computer implemented method of claim 37, wherein, when the user selected option is a second option, the selecting step further comprises:
- displaying a list of the alternative views;
- receiving the user's selection of an alternative view; and
- assigning the resulting view as the alternative view selected by the user.
51. The computer implemented method of claim 37, wherein the forming step further comprises:
- determining the applicable view types according to the dataset's data types;
- identifying the associated mark for each applicable view type; and
- forming an alternative view for each applicable view type according to the associated mark.
52. The computer implemented method of claim 51, wherein the determining step is accomplished using a set of criteria.
53. The computer implemented method of claim 52, wherein the set of criteria is predetermined.
54. The computer implemented method of claim 52, wherein the set of criteria relates to user preference.
55. The computer implemented method of claim 52, wherein the set of criteria relates to user usage.
56. The computer implemented method of claim 52, wherein an applicable view type is a text table and the associated mark for such an applicable view type is text, if all the data contained in the dataset are aggregated according to their fields, and no data type is independently quantitative.
57. The computer implemented method of claim 52, wherein an applicable view type is a heat map and the associated mark for such an applicable view type is a colored shape, if all the data contained in the dataset are aggregated according to their fields, at least one data type is ordinal, no more than two data types are dependently quantitative, and no data type is independently quantitative.
58. The computer implemented method of claim 52, wherein an applicable view type is a Gantt plot and the associated mark for such an applicable view type is a bar, if all the data contained in the dataset are aggregated according to their fields, at least one data type is ordinal, less than three data types are dependently quantitative, and at least one data type is either independently quantitative or a date.
59. The computer implemented method of claim 52, wherein an applicable view type is a side-by-side bar chart and the associated mark for such an applicable view type is a bar, if all the data contained in the dataset are aggregated according to their fields, at least one data type is ordinal, at least one data type is dependently quantitative, and no data type is independently quantitative.
60. The computer implemented method of claim 52, wherein an applicable view type is a stacked bar chart and the associated mark for such an applicable view type is a bar, if all the data contained in the dataset are aggregated according to their fields, at least one data type is dependently quantitative, no more than two data types are ordinal, and no data types are independently quantitative.
61. The computer implemented method of claim 52, wherein an applicable view type is a measure bar chart and the associated mark for such an applicable view type is a bar, if all the data contained in dataset are aggregated according to their fields, at least one data type is ordinal, no more than two data types are dependently quantitative, and no data types are independently quantitative.
62. The computer implemented method of claim 52, wherein an applicable view type is a dimension line graph and the associated mark for such an applicable view type is a line, if all the data contained in the dataset are aggregated according to their fields, at least one data type is dependently quantitative, at least one data type is a date, and no data types are independently quantitative.
63. The computer implemented method of claim 52, wherein an applicable view type is a measure line graph and the associated mark for such an applicable view type is a line, if all the data contained in the dataset are aggregated according to their fields, at least one data type is dependently quantitative, and at least one data type is either independently quantitative or a date.
64. The computer implemented method of claim 52, wherein an applicable view type is a circle chart and the associated mark for such an applicable view type is a circle, if at least one data type is ordinal, at least one data type is dependently quantitative, and no data type can be aggregated or is independently quantitative.
65. The computer implemented method of claim 52, wherein an applicable view type is a single scatter chart and the associated mark for such an applicable view type is a point, if the data contained in the dataset included between two to four dependently quantitative data types, and no independently quantitative data type.
66. The computer implemented method of claim 52, wherein an applicable view type is a matrix scatter chart and the associated mark for such an applicable view type is a point, if the dataset included at least three dependently quantitative data types, and no independently quantitative data type.
67. The computer implemented method of claim 52, wherein an applicable view type is a histogram and the associated mark for such an applicable view type is a bar, if all the data contained in the dataset are aggregated according to their fields, at least one data type is dependently quantitative, and no data types are independently quantitative.
68. The computer implemented method of claim 37, wherein the selecting step uses heuristics.
69. A computer program product including a computer readable storage medium and a computer program mechanism embedded therein, the computer program mechanism comprising instructions that are executed to display a graphical representation of a dataset, the instructions comprising:
- a module for displaying a graphical user interface window including a schema display region and a data visualization region, wherein the schema display region includes multiple terms corresponding to a plurality of fields of the dataset, wherein each field defines a respective category of the dataset;
- a field receiver for receiving a user selection of at least a subset of the multiple terms in the schema display region and a user ordering of the user-selected terms in the data visualization region, wherein each user-selected term has an associated data type;
- a resulting view selector for, in response to the user selection and the user ordering, forming a plurality of alternative views using the dataset based on the user ordering of the user-selected terms; determining a rank for each of the plurality of alternative views based on the user-specified order and the associated data types of the user-selected terms and a rating system; selecting a resulting view from the plurality of alternative views for displaying the dataset based on comparing the rank of the resulting view with the rank of at least another view of the plurality of alternative views; and
- a dataset displayer for displaying the dataset or a transformation of the dataset according to the resulting view in the data visualization region.
70. The computer program product of claim 69, wherein the resulting view selector further comprises:
- with no further user instruction: a rule chooser for choosing one or more rules that are associated with adding the user selected fields; and a rule applier for applying the one or more rules to determine the resulting view's view type.
71. The computer program product of claim 70, further comprising:
- a view determiner for determining whether a first view exists;
- wherein the one or more rules selected is also based upon the first view, if the first view is determined to exist.
72. The computer program product of claim 69, wherein the dataset is a database.
73. The computer program product of claim 72, wherein the database is remote.
74. The computer program product of claim 72, wherein the database is a relational database.
75. The computer program product of claim 72, wherein the database is a multidimensional database.
76. The computer program product of claim 72, wherein the database contains semantic extractions of another database.
77. The computer program product of claim 69, wherein the dataset is retrieved from a remote database.
78. The computer program product of claim 70, wherein the rules are predetermined.
79. The computer program product of claim 70, wherein the rules relate to user preference.
80. The computer program product of claim 70, wherein the rules relate to user usage.
81. The computer program product of claim 69, the resulting view selector further comprising:
- a cardinality computer computing the cardinalities of all the fields in the resulting view.
82. The computer program product of claim 70, wherein the rules relate to the cardinalities of all the fields in the resulting view.
83. The computer program product of claim 69, the resulting view selector further comprising:
- a functional dependency computer for computing the functional dependencies of all the fields in the resulting view, using an algorithm.
84. The computer program product of claim 70, wherein the rules relate to the functional dependency of all the fields in the resulting view.
85. The computer program product of claim 69, the dataset displayer further comprising:
- a mark chooser for choosing a mark for the resulting view; and
- a dataset renderer for rendering the dataset according to the mark.
86. The computer program product of claim 69, wherein the field receiver allows the user to drag and drop a representation of a field.
87. The computer program product of claim 69, wherein the field receiver allows the user to double click a representation of a field.
88. The computer program product of claim 69, wherein the field receiver allows the user to right click a representation of a field and selects an option to drop or add the field.
89. The computer program product of claim 69, wherein the field receiver allows the user to designate the plurality of user selected fields and their order.
90. The computer program product of claim 70, wherein the one or more rules state that the resulting view is a text table when the resulting view's first field is of the dimension data type.
91. The computer program product of claim 70, wherein the one or more rules state that the resulting view is a bar chart when the resulting view's first field is of the measure data type, the resulting view's second field is of the dimension data type, and the resulting view has no field of the date dimension data type.
92. The computer program product of claim 70, wherein the one or more rules state that the resulting view is a line when the resulting view's first field is of the measure data type, the resulting view's second field is of the dimension data type, and the resulting view has at least one field of the date dimension data type.
93. The computer program product of claim 70, wherein the one or more rules state that the resulting view is a discrete line when the resulting view's first field is of the date dimension data type, and the resulting view's second field is of the measure data type.
94. The computer program product of claim 70, wherein the one or more rules state that the resulting view is a continuous line when the resulting view's first field is of the continuous dimension data type, and the resulting view's second field is of the measure data type.
95. The computer program product of claim 70, wherein the one or more rules state that the resulting view is a scatter plot when the resulting view's first and second fields are of the measure data type, and the resulting view's subsequent fields are of the dimension data type.
96. The computer program product of claim 70, wherein the one or more rules state that the resulting view is a scatter matrix when the resulting view's first, second, and a subsequent field is of the measure data type.
97. The computer program product of claim 85, wherein when the resulting view is a text table, the mark is text.
98. The computer program product of claim 85, wherein when the resulting view is a heat map, the mark is a square.
99. The computer program product of claim 85, wherein when the resulting view is a bar chart, the mark is a bar.
100. The computer program product of claim 85, wherein when the resulting view is a line graph, the mark is a line segment.
101. The computer program product of claim 85, wherein when the resulting view is a circle chart, the mark is a circle.
102. The computer program product of claim 85, wherein when the resulting view is a Gantt plot, the mark is a bar.
103. The computer program product of claim 85, wherein when the resulting view is a scatter plot, the mark is a point or shape.
104. The computer program product of claim 69, wherein the selecting step uses heuristics.
105. A computer program product including a computer readable storage medium and a computer program mechanism embedded therein, the computer program mechanism comprising instructions that are executed to display a graphical representation of a dataset with a plurality of tuples, the instructions comprising:
- a module for displaying a graphical user interface window including a schema display region and a data visualization region, wherein the schema display region includes multiple terms corresponding to a plurality of fields of the dataset, wherein each field defines a respective category of the dataset;
- an alternative view former for forming a plurality of alternative views in response to a user instruction for generating a graphical representation of the dataset, each alternative view showing all tuples, or a transformation of all tuples, in the dataset, wherein the user instruction includes a user selection of at least a subset of multiple terms in the schema display region and a user ordering of the user-selected terms in the data visualization region, each of the user-selected terms having an associated data type, and determining a rank for each of the plurality of alternative views based on the user-specified order and the associated data types of the user-selected terms and a rating system, wherein the user instruction includes a user selection of at least a subset of multiple terms in the schema display region and a user ordering of the user-selected terms in the data visualization region, each of the user-selected terms having an associated data type;
- a resulting view selector for selecting a resulting view from the plurality of rated alternative views in response to the user instruction, based upon a user selected option and comparing the rank of the resulting view with the rank of at least another view of the plurality of alternative views; and
- a dataset displayer for displaying the dataset or a transformation of the dataset in the data visualization region according to the resulting view in response to the user instruction.
106. The computer program product of claim 105, wherein the dataset is a database.
107. The computer program product of claim 106, wherein the database is remote.
108. The computer program product of claim 106, wherein the database is a relational database.
109. The computer program product of claim 106, wherein the database is a multidimensional database.
110. The computer program product of claim 106, wherein the database contains semantic extractions of another database.
111. The computer program product of claim 105, wherein the dataset is retrieved from a remote database.
112. The computer program product of claim 105, wherein the alternative view former further comprises:
- a cardinality computer for computing the cardinality of the fields in the plurality of tuples.
113. The computer program product of claim 105, wherein the alternative view former further comprises:
- a functional dependency computer for computing the functional dependency on the plurality of tuples, using an algorithm.
114. The computer program product of claim 105, wherein, when the user selected option is a first option, the resulting view selector further comprises:
- an alternative view ranker for ranking the plurality of alternative views according to a rating system; and
- a view assignor for assigning the resulting view as the highest ranked alternative view.
115. The computer program product of claim 114, wherein the rating system is predetermined.
116. The computer program product of claim 114, wherein the rating system relates to user preference.
117. The computer program product of claim 114, wherein the rating system relates to user usage.
118. The computer program product of claim 105, wherein, when the user selected option is a second option, the resulting view selector further comprises:
- a list displayer for displaying a list of the alternative views;
- a selection receiver for receiving the user's selection of an alternative view; and
- a view assignor for assigning the resulting view as the alternative view selected by the user.
119. The computer program product of claim 105, wherein the alternative view former forms an alternative view for each applicable view type according to an associated mark, the alternative view former further comprising:
- a view type determiner for determining the applicable view types according to the dataset's data types; and
- a mark chooser for choosing the associated mark for each applicable view type.
120. The computer program product of claim 119, wherein the view type determiner uses a set of criteria.
121. The computer program product of claim 120, wherein the set of criteria is predetermined.
122. The computer program product of claim 120, wherein the set of criteria relates to user preference.
123. The computer program product of claim 120, wherein the set of criteria relates to user usage.
124. The computer program product of claim 120, wherein an applicable view type is a text table and the associated mark for such an applicable view type is text, if all the data contained in the dataset are aggregated according to their fields, and no data type is independently quantitative.
125. The computer program product of claim 120, wherein an applicable view type is a heat map and the associated mark for such an applicable view type is a colored shape, if all the data contained in the dataset are aggregated according to their fields, at least one data type is ordinal, no more than two data types are dependently quantitative, and no data type is independently quantitative.
126. The computer program product of claim 120, wherein an applicable view type is a Gantt plot and the associated mark for such an applicable view type is a bar, if all the data contained in the dataset are aggregated according to their fields, at least one data type is ordinal, less than three data types are dependently quantitative, and at least one data type is either independently quantitative or a date.
127. The computer program product of claim 120, wherein an applicable view type is a side-by-side bar chart and the associated mark for such an applicable view type is a bar, if all the data contained in the dataset are aggregated according to their fields, at least one data type is ordinal, at least one data type is dependently quantitative, and no data type is independently quantitative.
128. The computer program product of claim 120, wherein an applicable view type is a stacked bar chart and the associated mark for such an applicable view type is a bar, if all the data contained in the dataset are aggregated according to their fields, at least one data type is dependently quantitative, no more than two data types are ordinal, and no data types are independently quantitative.
129. The computer program product of claim 120, wherein an applicable view type is a measure bar chart and the associated mark for such an applicable view type is a bar, if all the data contained in dataset are aggregated according to their fields, at least one data type is ordinal, no more than two data types are dependently quantitative, and no data types are independently quantitative.
130. The computer program product of claim 120, wherein an applicable view type is a dimension line graph and the associated mark for such an applicable view type is a line, if all the data contained in the dataset are aggregated according to their fields, at least one data type is dependently quantitative, at least one data type is a date, and no data types are independently quantitative.
131. The computer program product of claim 120, wherein an applicable view type is a measure line graph and the associated mark for such an applicable view type is a line, if all the data contained in the dataset are aggregated according to their fields, at least one data type is dependently quantitative, and at least one data type is either independently quantitative or a date.
132. The computer program product of claim 120, wherein an applicable view type is a circle chart and the associated mark for such an applicable view type is a circle, if at least one data type is ordinal, at least one data type is dependently quantitative, and no data type can be aggregated or is independently quantitative.
133. A computer system for automatically and visually displaying a graphical representation of a dataset, the computer system comprising:
- a central processing unit;
- a memory, coupled to the central processing unit, the memory storing:
- the dataset;
- a programming module comprising, comprising: instructions for displaying a graphical user interface window including a schema display region and a data visualization region, wherein the schema display region includes multiple terms corresponding to a plurality of fields of the dataset, wherein each field defines a respective category of the dataset; instructions for receiving a user selection of at least a subset of multiple terms in the schema display region and a user ordering of the user-selected terms in the data visualization region, wherein each field has an associated data type; instructions for forming a plurality of alternative views using the dataset based on the user ordering of the user-selected terms; instructions for determining a rank for each of the plurality of alternative views based on the user-specified order and the associated data types of the user-selected terms and a rating system and selecting a resulting view from the plurality of alternative views for displaying the dataset based on comparing the rank of the resulting view with the rank of at least another view of the plurality of alternative views; and instructions for displaying the dataset or a transformation of the dataset in the data visualization region according to the resulting view.
134. The computer system of claim 133, wherein the instructions for selecting further comprises:
- with no further user instruction: instructions for choosing one or more rules that are associated with adding the user selected fields; and instructions for applying the one or more rules to determine the resulting view's view type.
135. The computer system of claim 134, further comprising, before the instructions for choosing:
- instructions for determining whether a first view exists;
- wherein the one or more rules selected is also based upon the first view, if the first view is determined to exist.
136. The computer system of claim 133, wherein the dataset is a database.
137. The computer system of claim 136, wherein the database is remote.
138. The computer system of claim 136, wherein the database is a relational database.
139. The computer system of claim 136, wherein the database is a multidimensional database.
140. The computer system of claim 136, wherein the database contains semantic extractions of another database.
141. The computer system of claim 133, wherein the dataset is retrieved from a remote database.
142. The computer system of claim 134, wherein the rules are predetermined.
143. The computer system of claim 134, wherein the rules relate to user preference.
144. The computer system of claim 134, wherein the rules relate to user usage.
145. The computer system of claim 133, the instructions for selecting further comprising:
- instructions for computing the cardinalities of all the fields in the resulting view.
146. The computer system of claim 134, wherein the rules relate to the cardinalities of all the fields in the resulting view.
147. The computer system of claim 133, the instructions for selecting further comprising:
- instructions for computing the functional dependencies of all the fields in the resulting view, using an algorithm.
148. The computer system of claim 134, wherein the rules relate to the functional dependency of all the fields in the resulting view.
149. The computer system of claim 133, the instructions for displaying the dataset further comprising:
- instructions for choosing a mark for the resulting view; and
- instructions for rendering the dataset according to the mark.
150. The computer system of claim 133, wherein the instructions for receiving is accomplished when the user drags and drops a representation of a field.
151. The computer system of claim 133, wherein the instructions for receiving is accomplished when the user double clicks a representation of a field.
152. The computer system of claim 133, wherein the instructions for receiving is accomplished when the user right clicks a representation of a field and selects an option to drop or add the field.
153. The computer system of claim 133, wherein the instructions for receiving is accomplished when the user inputs text to designate the plurality of user selected fields and their order.
154. The computer system of claim 134, wherein the one or more rules state that the resulting view is a text table when the resulting view's first field is of the dimension data type.
155. The computer system of claim 134, wherein the one or more rules state that the resulting view is a bar chart when the resulting view's first field is of the measure data type, the resulting view's second field is of the dimension data type, and the resulting view has no field of the date dimension data type.
156. The computer system of claim 134, wherein the one or more rules state that the resulting view is a line when the resulting view's first field is of the measure data type, the resulting view's second field is of the dimension data type, and the resulting view has at least one field of the date dimension data type.
157. The computer system of claim 134, wherein the one or more rules state that the resulting view is a discrete line when the resulting view's first field is of the date dimension data type, and the resulting view's second field is of the measure data type.
158. The computer system of claim 134, wherein the one or more rules state that the resulting view is a continuous line when the resulting view's first field is of the continuous dimension data type, and the resulting view's second field is of the measure data type.
159. The computer system of claim 134, wherein the one or more rules state that the resulting view is a scatter plot when the resulting view's first and second fields are of the measure data type, and the resulting view's subsequent fields are of the dimension data type.
160. The computer system of claim 134, wherein the one or more rules state that the resulting view is a scatter matrix when the resulting view's first, second, and a subsequent field is of the measure data type.
161. The computer system of claim 149, wherein when the resulting view is a text table, the mark is text.
162. The computer system of claim 149, wherein when the resulting view is a heat map, the mark is a square.
163. The computer system of claim 149, wherein when the resulting view is a bar chart, the mark is a bar.
164. The computer system of claim 149, wherein when the resulting view is a line graph, the mark is a line segment.
165. The computer system of claim 149, wherein when the resulting view is a circle chart, the mark is a circle.
166. The computer system of claim 149, wherein when the resulting view is a Gantt plot, the mark is a bar.
167. The computer system of claim 149, wherein when the resulting view is a scatter plot, the mark is a point or shape.
168. The computer system of claim 133, wherein the selecting step uses heuristics.
169. A computer system for automatically and visually displaying a graphical representation of a dataset with a plurality of tuples, the computer system comprising:
- a central processing unit;
- a memory, coupled to the central processing unit, the memory storing:
- the dataset;
- a programming module comprising: instructions for displaying a graphical user interface window including a schema display region and a data visualization region, wherein the schema display region includes multiple terms corresponding to a plurality of fields of the dataset, wherein each field defines a respective category of the dataset; instructions for forming a plurality of alternative views in response to a user instruction for generating a graphical representation of the dataset, each alternative view showing all tuples, or a transformation of all tuples, in the dataset, wherein the user instruction includes a user selection of at least a subset of multiple terms in the schema display region and a user ordering of the user-selected terms in the data visualization region, each of the user-selected terms having an associated data type, and determining a rank for each of the plurality of alternative views based on the user-specified order and the associated data types of the user-selected terms and a rating system, wherein the user instruction includes a user selection of at least a subset of multiple terms in the schema display region and a user ordering of the user-selected terms in the data visualization region, each of the user-selected terms having an associated data type; instructions for selecting a resulting view from the plurality of rated alternative views in response to the user instruction, based upon a user selected option and comparing the rank of the resulting view with the rank of at least another view of the plurality of alternative views; and instructions for displaying the dataset or a transformation of the dataset in the data visualization region according to the resulting view in response to the user instruction.
170. The computer system of claim 169, wherein the dataset is a database.
171. The computer system of claim 170, wherein the database is remote.
172. The computer system of claim 170, wherein the database is a relational database.
173. The computer system of claim 170, wherein the database is a multidimensional database.
174. The computer system of claim 170, wherein the database contains semantic extractions of another database.
175. The computer system of claim 169, wherein the dataset is retrieved from a remote database.
176. The computer system of claim 169, wherein the instructions for forming further comprises:
- instructions for computing the cardinality of the fields in the plurality of tuples.
177. The computer system of claim 169, wherein the instructions for forming further comprises:
- instructions for computing the functional dependency on the plurality of tuples, using an algorithm.
178. The computer system of claim 169, wherein, when the user selected option is a first option, the instructions for selecting further comprises:
- instructions for ranking the plurality of alternative views according to a rating system; and
- instructions for assigning the resulting view as the highest ranked alternative view.
179. The computer system of claim 178, wherein the rating system is predetermined.
180. The computer system of claim 178, wherein the rating system relates to user preference.
181. The computer system of claim 178, wherein the rating system relates to user usage.
182. The computer system of claim 169, wherein, when the user selected option is a second option, the instructions for selecting further comprises:
- instructions for displaying a list of the alternative views;
- instructions for receiving the user's selection of an alternative view; and
- instructions for assigning the resulting view as the alternative view selected by the user.
183. The computer system of claim 169, wherein the instructions for forming further comprises:
- instructions for determining the applicable view types according to the dataset's data types;
- instructions for identifying the associated mark for each applicable view type; and
- instructions for forming an alternative view for each applicable view type according to the associated mark.
184. The computer system of claim 183, wherein the instructions for determining is accomplished using a set of criteria.
185. The computer system of claim 184, wherein the set of criteria is predetermined.
186. The computer system of claim 184, wherein the set of criteria relates to user preference.
187. The computer system of claim 184, wherein the set of criteria relates to user usage.
188. The computer system of claim 184, wherein an applicable view type is a text table and the associated mark for such an applicable view type is text, if all the data contained in the dataset are aggregated according to their fields, and no data type is independently quantitative.
189. The computer system of claim 184, wherein an applicable view type is a heat map and the associated mark for such an applicable view type is a colored shape, if all the data contained in the dataset are aggregated according to their fields, at least one data type is ordinal, no more than two data types are dependently quantitative, and no data type is independently quantitative.
190. The computer system of claim 184, wherein an applicable view type is a Gantt plot and the associated mark for such an applicable view type is a bar, if all the data contained in the dataset are aggregated according to their fields, at least one data type is ordinal, less than three data types are dependently quantitative, and at least one data type is either independently quantitative or a date.
191. The computer system of claim 184, wherein an applicable view type is a side-by-side bar chart and the associated mark for such an applicable view type is a bar, if all the data contained in the dataset are aggregated according to their fields, at least one data type is ordinal, at least one data type is dependently quantitative, and no data type is independently quantitative.
192. The computer system of claim 184, wherein an applicable view type is a stacked bar chart and the associated mark for such an applicable view type is a bar, if all the data contained in the dataset are aggregated according to their fields, at least one data type is dependently quantitative, no more than two data types are ordinal, and no data types are independently quantitative.
193. The computer system of claim 184, wherein an applicable view type is a measure bar chart and the associated mark for such an applicable view type is a bar, if all the data contained in dataset are aggregated according to their fields, at least one data type is ordinal, no more than two data types are dependently quantitative, and no data types are independently quantitative.
194. The computer system of claim 184, wherein an applicable view type is a dimension line graph and the associated mark for such an applicable view type is a line, if all the data contained in the dataset are aggregated according to their fields, at least one data type is dependently quantitative, at least one data type is a date, and no data types are independently quantitative.
195. The computer system of claim 184, wherein an applicable view type is a measure line graph and the associated mark for such an applicable view type is a line, if all the data contained in the dataset are aggregated according to their fields, at least one data type is dependently quantitative, and at least one data type is either independently quantitative or a date.
196. The computer system of claim 184, wherein an applicable view type is a circle chart and the associated mark for such an applicable view type is a circle, if at least one data type is ordinal, at least one data type is dependently quantitative, and no data type can be aggregated or is independently quantitative.
197. The computer system of claim 184, wherein an applicable view type is a single scatter chart and the associated mark for such an applicable view type is a point, if the data contained in the dataset included between two to four dependently quantitative data types, and no independently quantitative data type.
198. The computer system of claim 184, wherein an applicable view type is a matrix scatter chart and the associated mark for such an applicable view type is a point, if the dataset included at least three dependently quantitative data types, and no independently quantitative data type.
199. The computer system of claim 184, wherein an applicable view type is a histogram and the associated mark for such an applicable view type is a bar, if all the data contained in the dataset are aggregated according to their fields, at least one data type is dependently quantitative, and no data types are independently quantitative.
200. The computer system of claim 169, wherein the selecting step uses heuristics.
201. The computer implemented method of claim 1, the selecting step further comprising:
- displaying the plurality of alternative views; and
- receiving a user selection of one of the plurality of alternative views as the resulting view.
202. The computer implemented method of claim 37, the selecting step further comprising:
- displaying the plurality of alternative views; and
- receiving a user selection of one of the plurality of alternative views as the resulting view.
4800810 | January 31, 1989 | Masumoto |
5036314 | July 30, 1991 | Barillari et al. |
5060980 | October 29, 1991 | Johnson et al. |
5143888 | September 1, 1992 | Olbrich |
5144452 | September 1, 1992 | Abuyama |
5169713 | December 8, 1992 | Kumurdjian |
5265244 | November 23, 1993 | Ghosh et al. |
5265246 | November 23, 1993 | Li et al. |
5377348 | December 27, 1994 | Lau et al. |
5383029 | January 17, 1995 | Kojima |
5461708 | October 24, 1995 | Kahn |
5560007 | September 24, 1996 | Thai |
5577241 | November 19, 1996 | Spencer |
5581677 | December 3, 1996 | Myers et al. |
5664172 | September 2, 1997 | Antoshenkov |
5664182 | September 2, 1997 | Nierenberg et al. |
5668987 | September 16, 1997 | Schneider |
5794246 | August 11, 1998 | Sankaran et al. |
5864856 | January 26, 1999 | Young |
5893088 | April 6, 1999 | Hendricks et al. |
5933830 | August 3, 1999 | Williams |
6031632 | February 29, 2000 | Yoshihara et al. |
6032158 | February 29, 2000 | Mukhopadhyay et al. |
6044374 | March 28, 2000 | Nesamoney et al. |
6100901 | August 8, 2000 | Mohda et al. |
6115744 | September 5, 2000 | Robins et al. |
6154766 | November 28, 2000 | Yost et al. |
6173310 | January 9, 2001 | Yost et al. |
6188403 | February 13, 2001 | Sacerdoti et al. |
6208990 | March 27, 2001 | Suresh et al. |
6222540 | April 24, 2001 | Sacerdoti |
6247008 | June 12, 2001 | Cambot et al. |
6253257 | June 26, 2001 | Dundon |
6260050 | July 10, 2001 | Yost et al. |
6269393 | July 31, 2001 | Yost et al. |
6300957 | October 9, 2001 | Rao et al. |
6301579 | October 9, 2001 | Becker |
6317750 | November 13, 2001 | Tortolani et al. |
6327628 | December 4, 2001 | Anuff et al. |
6339775 | January 15, 2002 | Zamanian et al. |
6377259 | April 23, 2002 | Tenev et al. |
6397195 | May 28, 2002 | Pinard et al. |
6400366 | June 4, 2002 | Davies et al. |
6405195 | June 11, 2002 | Ahlberg |
6405208 | June 11, 2002 | Raghavan et al. |
6424933 | July 23, 2002 | Agrawala et al. |
6490593 | December 3, 2002 | Proctor |
6492989 | December 10, 2002 | Wilkinson |
6505205 | January 7, 2003 | Kothuri et al. |
6522342 | February 18, 2003 | Gagnon et al. |
6529217 | March 4, 2003 | Maguire et al. |
6590577 | July 8, 2003 | Yonts |
6611825 | August 26, 2003 | Billheimer et al. |
6643646 | November 4, 2003 | Su et al. |
6701485 | March 2, 2004 | Igra et al. |
6707454 | March 16, 2004 | Barg et al. |
6714897 | March 30, 2004 | Whitney et al. |
6725230 | April 20, 2004 | Ruth et al. |
6750864 | June 15, 2004 | Anwar |
6763308 | July 13, 2004 | Chu et al. |
6768986 | July 27, 2004 | Cras et al. |
6867788 | March 15, 2005 | Takeda |
6906717 | June 14, 2005 | Couckuyt et al. |
6920608 | July 19, 2005 | Davis |
6928433 | August 9, 2005 | Goodman et al. |
6985905 | January 10, 2006 | Prompt et al. |
7009609 | March 7, 2006 | Miyadai |
7089266 | August 8, 2006 | Stolte et al. |
7117058 | October 3, 2006 | Lin et al. |
7168035 | January 23, 2007 | Bell et al. |
7181450 | February 20, 2007 | Malloy et al. |
7250951 | July 31, 2007 | Hurley et al. |
7315305 | January 1, 2008 | Crotty et al. |
7379601 | May 27, 2008 | Yang et al. |
7630971 | December 8, 2009 | Arrouye et al. |
7694278 | April 6, 2010 | Pasumansky et al. |
7725483 | May 25, 2010 | Poyourow et al. |
20020016699 | February 7, 2002 | Hoggart et al. |
20020118192 | August 29, 2002 | Couckuyt et al. |
20020123865 | September 5, 2002 | Whitney et al. |
20020154118 | October 24, 2002 | McCarthy et al. |
20030042928 | March 6, 2003 | Tsai |
20030200034 | October 23, 2003 | Fellenberg et al. |
20030204511 | October 30, 2003 | Brundage et al. |
20040183800 | September 23, 2004 | Peterson |
20040224577 | November 11, 2004 | Kaji |
20040227759 | November 18, 2004 | McKnight et al. |
20040243593 | December 2, 2004 | Stolte et al. |
20050035966 | February 17, 2005 | Pasquarette et al. |
20050035967 | February 17, 2005 | Joffrain et al. |
20050060300 | March 17, 2005 | Stolte et al. |
20050099423 | May 12, 2005 | Brauss |
20050234688 | October 20, 2005 | Pinto et al. |
20050234920 | October 20, 2005 | Rhodes |
20060020586 | January 26, 2006 | Prompt et al. |
20060031187 | February 9, 2006 | Pyrce et al. |
20060053363 | March 9, 2006 | Bargh et al. |
20060100873 | May 11, 2006 | Bittner et al. |
20060101391 | May 11, 2006 | Ulke et al. |
20060129913 | June 15, 2006 | Vigesaa et al. |
20060136825 | June 22, 2006 | Cory et al. |
20060206512 | September 14, 2006 | Hanrahan et al. |
20070055487 | March 8, 2007 | Habitz et al. |
20070061344 | March 15, 2007 | Dickerman et al. |
20070112844 | May 17, 2007 | Tribble et al. |
20080133573 | June 5, 2008 | Haft et al. |
215657 | January 1994 | HU |
WO2006/-060773 | June 2006 | WO |
- Screen Dumps for Microsoft Office Excel 2003 SP2, figures 1-24, 10 pp. 1-19.
- Becker et al., “Trellis Graphics Displays: A Multi-Dimensional Data Visualization Tool for Data Mining,” Third Annual Conference on Knowledge Discovery in Databases, Aug. 1997.
- Becker, “Visualizing Decision Table Classifiers”, Proceedings IEEE Symposium on Information Visualization, pp. 102-105, 1998.
- Bosch et al., “Performance Analysis and Visualization of Parallel Systems Using SimOS and Rivet: A Case Study,” Proceedings of the Sixth IEEE International Symposium on High-Performance Computer Architecture, pp. 360-371, Jan. 2000.
- Bosch et al., “Rivet: A Flexible Environment for Computer Systems Visualization,” Computer Graphics.34, pp. 68-73, Feb. 2000.
- Brunk et al., “MineSet: An Integrated System for Data Mining,” Proceeding of the 3rd International Conference on Knowledge Discovery and Data Mining, pp. 135-138, AAA1 Press, 1997.
- Derthick et al., “An Interactive Visual Query Environment for Exploring Data,” Proceedings of ACM SIGGRAPH Symposium on User Interface Software & Technology, pp. 189-198, 1997.
- Freeze, Unlocking OLAP with Microsoft SQL Server and Excel 2000, IDG Books Worldwide, Foster City, CA, pp. 155-332 and 379-422.
- Fua et al., “Navigating Hierarchies with Structure-Based Brushes,” Proc. of Infovis '99 (San Francisco, CA, USA, 1999), IEEE Computer Soc Press.
- Goldstein et al., “A Framework for Knowledge-Based Interactive Data Exploration,” Journal of Visual Languages and Computing 5, pp. 339-363, Dec. 1994.
- Gray et al., “Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Total,” Data Mining and Knowledge Discovery 1, pp. 29-53, 1997.
- Healey, “On the Use of Perceptual Cues and Data Mining for Effective Visualization of Scientific Datasets,” Proceedings Graphics Interface '98, pp. 177-187.
- Kohavi, “Data Mining and Visualization,” Frontiers of Engineering: Reports of leading-Edge Engineering from the 200 NAE Sym.
- Livny et al., “DEVise: Integrated Querying and Visual Exploration of Large Datasets,” Proceedings of ACM SIGMOD, pp. 301-312, May 1997.
- MacDonald, “Creating Basic Charts,” Excel 2003, Chapter 9, pp. 298-342, 2006.
- MacKinlay, J., “Automating the Design of Graphical Presentations of Relational Information,” ACM Transactionss on Graphics 5(2), pp. 110-141, 1986.
- Perlin et al., “An Alternative Approach to the Computer Interface,” Proc. of the 20th International Conference on Computer Graphics and Interactive Techniques, pp. 57-64, 1993.
- Rao et al., “The Table Lens: Merging Graphical and Symbolic Representations in an Interactive Focus+Context Visualization for Tabular Information,” Proc. of ACM SIGCHI, Apr. 1994, pp. 318-322.
- Roth et al., “Interactive Graphics Design Using Automatic Presentation Knowledge,” Proceedings of the Conference on Human Factors in Computer Systems, Proc. SIGCHI '94, pp. 112-117, 1994.
- Roth et al., “Visage: A User Interface Environment for Exploring Information” Proceedings of Information Visualization Symposium 1996, pp. 3-12, Mar. 12, 1999.
- Spenke et al., Focus: The Interactive Table for Product Comparison and Selection, Proc. of ACM Symposium on User Interface Software and Technology, Nov. 1996.
- Stevens, “On the Theory of Scales of Measurement,” Science vol. 103, No. 2684 pp. 677-680, Jun. 7, 1946.
- Stolte et al, “Query, Analysis, and Visualization of Hierarchically Structured Data Using Polaris,” Proceedings of the Eight ACM SIGKDD '02, International Conference on Knowledge Discovery and Data Mining, Jul. 2002.
- Stolte et al., “Multiscale Visualization Using Data Cubes,” Proceedings of the Eighth IEEE Symposium on Information Visualization, 2002.
- Stolte et al., “Polaris: A System for Query, Analysis, and Visualization of Multidimensional Relational Databases,” IEEE Transactions on Visualization and Computer Graphics 8(1), pp. 52-65, Jan. 2002.
- Stolte et al., “Visualizing Application Behavior on Superscalar Processors,” Proceedings of the IEEE Symposium on Information Visualization, pp. 10-17, 1999.
- Thearling et al., “Visualizing Data Mining Models,” Pub. in Information Visualization in Data Mining and Knowledge Discovery, Fayyad, Grinstein and Wierse, eds., Morgan Kaufman, 2001.
- Welling et al., “Visualization of Large Multi-Dimensional Datasets,” arXiv: astro-ph/0008186, Aug. 11, 2000.
- Wilkinson et al., “nViZn: An Algebra-Based Visualization System,” Smart Graphics Symposium UK, Mar. 21-23, 2001, Hawthorne, NY USA.
- Wilkinson, “Statistics and Computing—The Grammar of Graphics,” Springer-Verlag, Inc., New York, 1999.
- Notice of Allowance for U.S. Appl. No. 10/453,834 dated Mar. 27, 2006.
- Office Action for U.S. Appl. No. 10/667,194 dated Jun. 26, 2006.
- Office Action for U.S. Appl. No. 10/667,194 dated Jan. 18, 2007.
- Office Action for U.S. Appl. No. 10/667,194 dated Aug. 14, 2007.
- Office Action for U.S. Appl. No. 10/667,194 dated Jan. 7, 2008.
- Office Action for U.S. Appl. No. 10/667,194 dated Aug. 14, 2008.
- Office Action for U.S. Appl. No. 10/667,194 dated Feb. 9, 2009.
- Office Action for U.S. Appl. No. 11/005,652 dated Dec. 27, 2007.
- Office Action for U.S. Appl. No. 11/005,652 dated Jul. 24, 2008.
- Office Action for U.S. Appl. No. 11/005,652 dated Feb. 20, 2009.
- Office Action for U.S. Appl. No. 11/488,407 dated Apr. 3, 2009.
- Office Action for U.S. Appl. No. 11/787,761 dated Jun. 12, 2008.
- Office Action for U.S. Appl. No. 11/787,761 dated Dec. 17, 2008.
- International Search Report and Written Opinion for PCT/US2004/30396 dated Aug. 24, 2006.
- International Preliminary Report on Patentability for PCT/US2004/30396 dated Apr. 19, 2007.
- International Search Report and Written Opinion for PCT/US2005/043937 dated Apr. 18, 2007.
- International Preliminary Report on Patentability for PCT/US2005/043937 dated Jun. 14, 2007.
- International Search Report and Written Opinion for PCT/US2007/009810 dated Jul. 7, 2008.
- International Preliminary Report on Patentability for PCT/US2007/009810 dated Oct. 30, 2008.
- International Search Report and Written Opinion for PCT/US2006/35300 dated Jul. 7, 2008.
- International Search Report and Written Opinion for PCT/US04/18217 dated Feb. 7, 2006.
- International Preliminary Report on Patentability for PCT/US04/18217 dated Oct. 19, 2006.
- Hungarian Search Report for HU P0700460 dated Oct. 9, 2007.
- Supplementary European Search Report for EP 04754739.3, dated Dec. 17, 2007.
- Specification for U.S. Appl. No. 10/453,834, filed Jun. 2, 2003.
- Specification for U.S. Appl. No. 11/005,652, filed Dec. 2, 2004.
- Specification for U.S. Appl. No. 11/223,658, filed Sep. 5, 2005.
- Notice of Allowance, U.S. Appl. No. 11/488,407, Dec. 29, 2009, 8 pages.
- Notice of Allowance, U.S. Appl. No. 11/005,652, May 18, 2010, 4 pages.
- Notice of Allowance, U.S. Appl. No. 10/667,194, Mar. 5, 2010, 4 pages.
- Office Action, U.S. Appl. No. 10/667,194, Sep. 25, 2009, 10 pages.
- Office Action, U.S. Appl. No. 11/005,652, Oct. 23, 2009, 14 pages.
- Office Action, U.S. Appl. No. 11/787,761, Jul. 8, 2009, 14 pages.
- Office Action, U.S. Appl. No. 11/787,761, Nov. 30, 2009, 14 pages.
- Office Action, U.S. Appl. No. 11/787,761, Apr. 28, 2010, 10 pages.
- Office Action, U.S. Appl. No. 11/787,761, Sep. 14, 2010, 12 pages.
- International Preliminary Report on Patentability, PCT/US2006/035300, Mar. 24, 2009, 5 pages.
Type: Grant
Filed: Sep 9, 2005
Date of Patent: Jan 17, 2012
Patent Publication Number: 20070061611
Assignee: Tableau Software LLC (Seattle, WA)
Inventors: Jock Douglas Mackinlay (Bellevue, WA), Christopher Richard Stolte (Seattle, WA), Patrick Hanrahan (Portola Valley, CA)
Primary Examiner: Simon Ke
Assistant Examiner: Erik Stitt
Attorney: Morgan, Lewis & Bockius LLP
Application Number: 11/223,658
International Classification: G06F 3/048 (20060101); G06F 3/00 (20060101); G06F 17/22 (20060101); G06Q 40/00 (20060101);