Selecting good views of high-dimensional data using class consistency
Mike Sips, Boris Neubert, John P. Lewis, Pat Hanrahan
In Computer Graphics Forum, 28(3), 2009.
Abstract: Many visualization techniques involve mapping high-dimensional data spaces to lower-dimensional views. Unfortunately, mapping a high-dimensional data space into a scatterplot involves a loss of information; or, even worse, it can give a misleading picture of valuable structure in higher dimensions. In this paper, we propose class consistency as a measure of the quality of the mapping. Class consistency enforces the constraint that classes of n–D data are shown clearly in 2–D scatterplots. We propose two quantitative measures of class consistency, one based on the distance to the class's center of gravity, and another based on the entropies of the spatial distributions of classes. We performed an experiment where users choose good views, and show that class consistency has good precision and recall. We also evaluate both consistency measures over a range of data sets and show that these measures are efficient and robust.
Keyword(s): Data Mining [I.5.3]: Clustering, User Interfaces [H.5.2]: Evaluation/methodology
Article URL: http://dx.doi.org/10.1111/j.1467-8659.2009.01467.x
BibTeX format:
@article{CGF:CGF1467,
  author = {Mike Sips and Boris Neubert and John P. Lewis and Pat Hanrahan},
  title = {Selecting good views of high-dimensional data using class consistency},
  journal = {Computer Graphics Forum},
  volume = {28},
  number = {3},
  pages = {831--838},
  year = {2009},
}
Search for more articles by Mike Sips.
Search for more articles by Boris Neubert.
Search for more articles by John P. Lewis.
Search for more articles by Pat Hanrahan.

Return to the search page.


graphbib: Powered by "bibsql" and "SQLite3."