<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Justin Talbot &#187; visualization</title>
	<atom:link href="http://www.justintalbot.com/category/visualization/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.justintalbot.com</link>
	<description></description>
	<lastBuildDate>Wed, 07 Sep 2011 05:26:17 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Arc Length-based Aspect Ratio Selection</title>
		<link>http://www.justintalbot.com/2011/08/arc-length-based-aspect-ratio-selection/</link>
		<comments>http://www.justintalbot.com/2011/08/arc-length-based-aspect-ratio-selection/#comments</comments>
		<pubDate>Wed, 31 Aug 2011 20:15:43 +0000</pubDate>
		<dc:creator>Justin Talbot</dc:creator>
				<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://www.justintalbot.com/?p=205</guid>
		<description><![CDATA[Here&#8217;s a preprint of our paper on aspect ratio selection which will appear in InfoVis 2011. In it we propose a new criteria for banking data plots, building on previous ideas from Bill Cleveland and Jeff Heer and Maneesh Agrawala. We frame the aspect ratio selection problem as one of minimizing the length of the [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s a <a href="http://vis.stanford.edu/papers/arclength-banking">preprint</a> of our paper on aspect ratio selection which will appear in InfoVis 2011. In it we propose a new criteria for banking data plots, building on previous ideas from Bill Cleveland and Jeff Heer and Maneesh Agrawala.</p>
<p>We frame the aspect ratio selection problem as one of minimizing the length of the data curve while keeping the area of the plot constant. This leads to a method that is substantially more robust than previous approaches. We&#8217;re also able to demonstrate empirically that the resulting aspect ratios are a compromise between those suggested by previous methods. As shown below, the arc length method can also effectively bank both standard line charts (in this case a loess regression line) as wells as contour charts.</p>
<p><img src="http://vis.stanford.edu/images/figures/arclength-banking.png" alt="Arc-length banking example" width=500/></p>
<p>Perhaps the most surprising result is that good aspect ratios can be selected without explicit reference to the slopes or orientations of the line segments within the plot.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.justintalbot.com/2011/08/arc-length-based-aspect-ratio-selection/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>C# code for labeling paper</title>
		<link>http://www.justintalbot.com/2011/07/c-code-for-labeling-paper/</link>
		<comments>http://www.justintalbot.com/2011/07/c-code-for-labeling-paper/#comments</comments>
		<pubDate>Sat, 23 Jul 2011 03:57:02 +0000</pubDate>
		<dc:creator>Justin Talbot</dc:creator>
				<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://www.justintalbot.com/?p=200</guid>
		<description><![CDATA[I&#8217;ve finally had time to pull the labeling algorithm out of my much larger visualization package. It&#8217;s now up on github: https://github.com/jtalbot/Labeling. This implements all parts of the labeling paper, including the formatting variations. Let me know if you run into any problems with it or have any suggestions for improvement.]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve finally had time to pull the labeling algorithm out of my much larger visualization package. It&#8217;s now up on github: <a href="https://github.com/jtalbot/Labeling">https://github.com/jtalbot/Labeling</a>. This implements all parts of the labeling paper, including the formatting variations.</p>
<p>Let me know if you run into any problems with it or have any suggestions for improvement.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.justintalbot.com/2011/07/c-code-for-labeling-paper/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>R labeling package released on CRAN</title>
		<link>http://www.justintalbot.com/2010/12/r-labeling-package-released-on-cran/</link>
		<comments>http://www.justintalbot.com/2010/12/r-labeling-package-released-on-cran/#comments</comments>
		<pubDate>Wed, 08 Dec 2010 18:23:46 +0000</pubDate>
		<dc:creator>Justin Talbot</dc:creator>
				<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://www.justintalbot.com/?p=192</guid>
		<description><![CDATA[Version 0.1 of the labeling package has been released on CRAN.]]></description>
			<content:encoded><![CDATA[<p>Version 0.1 of the labeling package has been released on <a href="http://cran.at.r-project.org/web/packages/labeling/index.html">CRAN</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.justintalbot.com/2010/12/r-labeling-package-released-on-cran/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>R code for our axis labeling algorithm</title>
		<link>http://www.justintalbot.com/2010/09/r-code-for-our-axis-labeling-algorithm/</link>
		<comments>http://www.justintalbot.com/2010/09/r-code-for-our-axis-labeling-algorithm/#comments</comments>
		<pubDate>Fri, 03 Sep 2010 03:52:00 +0000</pubDate>
		<dc:creator>Justin Talbot</dc:creator>
				<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://www.justintalbot.com/?p=178</guid>
		<description><![CDATA[The R version of our labeling code is now hosted at R-forge. You can get it here or install it from within R using install.packages("labeling", repos="http://R-Forge.R-project.org"). A few small bugs in the implementation of our algorithm have been fixed thanks to feedback from Ahmet Karahan who is working on a Java version. I have also [...]]]></description>
			<content:encoded><![CDATA[<p>The R version of our labeling code is now hosted at R-forge. You can get it <a href="https://r-forge.r-project.org/R/?group_id=872 ">here</a> or install it from within R using <code>install.packages("labeling",  repos="http://R-Forge.R-project.org")</code>.</p>
<p>A few small bugs in the implementation of our algorithm have been fixed thanks to feedback from Ahmet Karahan who is working on a Java version. I have also added a number of other labeling algorithms that have been proposed or used in the past, including those by Sparks, Thayer, and Nelder (from about 40 years ago), and adaptations of the matplotlib, gnuplot, and R&#8217;s pretty labeling functions.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.justintalbot.com/2010/09/r-code-for-our-axis-labeling-algorithm/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Monitoring Hadoop</title>
		<link>http://www.justintalbot.com/2010/08/monitoring-hadoop/</link>
		<comments>http://www.justintalbot.com/2010/08/monitoring-hadoop/#comments</comments>
		<pubDate>Tue, 24 Aug 2010 19:53:28 +0000</pubDate>
		<dc:creator>Justin Talbot</dc:creator>
				<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://www.justintalbot.com/?p=167</guid>
		<description><![CDATA[As a side project this summer, I implemented a simple visual interface for HOP, an extended version of Hadoop. This was used by the HOP creators in their demo of HOP at this summer&#8217;s SIGMOD. The graphical elements were produced using Protovis since I needed an excuse to play around with it. We ran into minor [...]]]></description>
			<content:encoded><![CDATA[<p>As a side project this summer, I implemented a simple visual interface for <a href="http://code.google.com/p/hop/">HOP</a>, an extended version of Hadoop. This was used by the HOP creators in their <a href="http://neilconway.org/docs/sigmod2010_hop_demo.pdf">demo of HOP</a> at this summer&#8217;s SIGMOD.</p>
<div id="attachment_168" class="wp-caption aligncenter" style="width: 304px"><a href="http://www.justintalbot.com/wp-content/uploads/2010/08/hop_screenshot.png"><img class="size-medium wp-image-168" title="hop_screenshot" src="http://www.justintalbot.com/wp-content/uploads/2010/08/hop_screenshot-294x300.png" alt="Screenshot of hop visual interface" width="294" height="300" /></a><p class="wp-caption-text">Hop visual interface</p></div>
<p>The graphical elements were produced using <a href="http://vis.stanford.edu/protovis/">Protovis</a> since I needed an excuse to play around with it. We ran into minor performance problems using Protovis for so many plots in a single page. In a production system it would be wiser to generate and cache the plots on the server side.</p>
<p><strong>Update</strong>: The screenshot shows a task scheduling imbalance bug that we found in HOP using the visual interface.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.justintalbot.com/2010/08/monitoring-hadoop/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>An Extension of Wilkinson’s Algorithm for Positioning Tick Labels on Axes</title>
		<link>http://www.justintalbot.com/2010/07/an-extension-of-wilkinson%e2%80%99s-algorithm-for-positioning-tick-labels-on-axes/</link>
		<comments>http://www.justintalbot.com/2010/07/an-extension-of-wilkinson%e2%80%99s-algorithm-for-positioning-tick-labels-on-axes/#comments</comments>
		<pubDate>Fri, 30 Jul 2010 21:55:01 +0000</pubDate>
		<dc:creator>Justin Talbot</dc:creator>
				<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://www.justintalbot.com/?p=157</guid>
		<description><![CDATA[Here&#8217;s a preprint of our paper on selecting tick labels for axes which will appear in this year&#8217;s InfoVis! Source code of the implementation will be made available before the conference. We&#8217;re hoping to get this implemented in a number of common plotting libraries. I already have a partial matplotlib version working. I would also [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s a <a href="http://graphics.stanford.edu/vis/publications/2010/labeling-preprint.pdf">preprint</a> of our paper on selecting tick labels for axes which will appear in this year&#8217;s InfoVis! Source code of the implementation will be made available before the conference. We&#8217;re hoping to get this implemented in a number of common plotting libraries. I already have a partial matplotlib version working. I would also like to have one for ggplot. Other suggestions are welcome.</p>
<p>The abstract:</p>
<blockquote>
<div id="_mcePaste">The non-data components of a visualization, such as axes and legends, can often be just as important as the data itself. They provide contextual information essential to interpreting the data. In this paper, we describe an automated system for choosing positions and labels for axis tick marks. Our system extends Wilkinson’s optimization-based labeling approach to create a more robust, full-featured axis labeler. We define an expanded space of axis labelings by automatically generating additional nice numbers as needed and by permitting the extreme labels to occur inside the data range. These changes provide flexibility in problematic cases, without degrading quality elsewhere. We also propose an additional optimization criterion, legibility, which allows us to simultaneously optimize over label formatting, font size, and orientation. To solve this revised optimization problem, we describe the optimization function and an efficient search algorithm. Finally, we compare our method to previous work using both quantitative and qualitative metrics. This paper is a good example of how ideas from automated graphic design can be applied to information visualization.</div>
</blockquote>
<p><strong>Update</strong>: We&#8217;ve released a preliminary <a href="http://www.justintalbot.com/research/axis-labeling/">R package</a> implementing the three labeling algorithms we compared in the paper. Feedback is appreciated. The final version should be released by InfoVis (in October).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.justintalbot.com/2010/07/an-extension-of-wilkinson%e2%80%99s-algorithm-for-positioning-tick-labels-on-axes/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Gallup-Healthways Well-Being</title>
		<link>http://www.justintalbot.com/2009/03/gallup-healthways-well-being/</link>
		<comments>http://www.justintalbot.com/2009/03/gallup-healthways-well-being/#comments</comments>
		<pubDate>Thu, 12 Mar 2009 04:38:07 +0000</pubDate>
		<dc:creator>Justin Talbot</dc:creator>
				<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://justintalbot.org/?p=89</guid>
		<description><![CDATA[Will Wilkinson points to the Gallup-Healthways Well-Being Index which purports to measure overall health (&#8220;not only the absence of infirmity and disease, but also a state of physical, mental, and social well-being&#8221;) at the congressional district level for the United States. Will hypothesizes that Utah&#8217;s high score may be due to &#8220;a skoche of culture-driven [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-92" title="wellness1" src="http://justintalbot.org/wp-content/uploads/2009/03/wellness1.png" alt="wellness1" width="292" height="640" />Will Wilkinson <a href="http://www.willwilkinson.net/flybottle/2009/03/10/united-states-of-happiness/">points to</a> the <a href="http://www.ahiphiwire.org/wellbeing/">Gallup-Healthways Well-Being Index</a> which purports to measure overall health (&#8220;not only the absence of infirmity and disease, but also a state of physical, mental, and social well-being&#8221;) at the congressional district level for the United States. Will hypothesizes that Utah&#8217;s high score may be due to &#8220;a skoche of culture-driven upward inflation&#8221; (Mormons overstating their happiness).</p>
<p>Fortunately, the components of the Well-Being Index are reported as well. Two components, Life Evaluation and Emotional Health, measure self-reported happiness. If Will&#8217;s hypothesis were correct, we would expect these components to account for a disproportionate share of Utah&#8217;s overall index. In the scatterplots to the right, the three Utah congressional districts are highlighted in orange. Contrary to Will, Utah is above average only in the Work Quality component. On all the others, including Life Evaluation and Emotional Health, Utah is average or below average.</p>
<p><a href="http://justintalbot.org/wp-content/uploads/2009/03/wellness.xlsx">Wellness data</a> in Excel, since I couldn&#8217;t figure out how to get it from the Gallup-Healthways site. The visualization was done in Tableau.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.justintalbot.com/2009/03/gallup-healthways-well-being/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>our day, visualizing the American Time Use Survey</title>
		<link>http://www.justintalbot.com/2009/02/our-day-visualizing-the-american-time-use-survey/</link>
		<comments>http://www.justintalbot.com/2009/02/our-day-visualizing-the-american-time-use-survey/#comments</comments>
		<pubDate>Sat, 14 Feb 2009 02:18:45 +0000</pubDate>
		<dc:creator>Justin Talbot</dc:creator>
				<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://justintalbot.org/?p=85</guid>
		<description><![CDATA[For Jeff&#8217;s class I created an interactive visualization of the American Time Use Survey. I got sick last week so didn&#8217;t have a lot of time to work on it. As a result it turned out somewhat derivative of the Baby Name Voyager and other stacked area plots. That said, I think it lets you [...]]]></description>
			<content:encoded><![CDATA[<p>For Jeff&#8217;s class I created an <a href="http://research.justintalbot.org/ourday">interactive visualization</a> of the American Time Use Survey. I got sick last week so didn&#8217;t have a lot of time to work on it. As a result it turned out somewhat derivative of the <a href="http://www.babynamewizard.com/voyager">Baby Name Voyager</a> and other stacked area plots.</p>
<p>That said, I think it lets you find some rather interesting patterns in how people use their time. Most noticeable is the extra hour or so that people sleep in on the weekends.</p>
<p><a href="http://research.justintalbot.org/ourday"><img class="alignnone size-medium wp-image-86" title="ourday" src="http://justintalbot.org/wp-content/uploads/2009/02/ourday-300x159.png" alt="ourday" width="300" height="159" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.justintalbot.com/2009/02/our-day-visualizing-the-american-time-use-survey/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Fundamental Statistical Concepts in Presenting Data: Principles for Constructing Better Graphics</title>
		<link>http://www.justintalbot.com/2009/02/fundamental-statistical-concepts-in-presenting-data-principles-for-constructing-better-graphics/</link>
		<comments>http://www.justintalbot.com/2009/02/fundamental-statistical-concepts-in-presenting-data-principles-for-constructing-better-graphics/#comments</comments>
		<pubDate>Sat, 14 Feb 2009 02:11:57 +0000</pubDate>
		<dc:creator>Justin Talbot</dc:creator>
				<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://justintalbot.org/?p=82</guid>
		<description><![CDATA[Via Andrew Gelman I came across this long paper (updated version) on statistical visualization by Rafe Donahue. I haven&#8217;t read it through carefully yet, but I enjoyed the examples of visualizations from his children&#8217;s schoolwork. He criticizes boxplots, which caused a discussion in the comments to Andrew&#8217;s post. I read Tukey&#8217;s EDA recently and was [...]]]></description>
			<content:encoded><![CDATA[<p>Via <a href="http://www.stat.columbia.edu/~cook/movabletype/archives/2009/02/fundamental-sta.html">Andrew Gelman</a> I came across <a href="http://biostat.mc.vanderbilt.edu/twiki/pub/Main/RafeDonahue/graphicshandout20080717_01.pdf">this long paper</a> (<a href="http://biostat.mc.vanderbilt.edu/wiki/pub/Main/RafeDonahue/fscipdpfcbg_currentversion.pdf">updated version</a>) on statistical visualization by Rafe Donahue. I haven&#8217;t read it through carefully yet, but I enjoyed the examples of visualizations from his children&#8217;s schoolwork.</p>
<p>He criticizes boxplots, which caused a discussion in the comments to Andrew&#8217;s post. I read Tukey&#8217;s EDA recently and was surprised to see how much of Tukey&#8217;s work was focused on visualization by hand. The boxplot is a sensible visualization when you had to compute and plot manually. Using only 5 numbers it portrayed much of what was important about the data. However, now that plotting is cheap, it makes a lot more sense to just plot all the data.</p>
<p>In general, summaries, visual or otherwise, which assume a single mode, or worse normality, should be treated with a great deal of caution.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.justintalbot.com/2009/02/fundamental-statistical-concepts-in-presenting-data-principles-for-constructing-better-graphics/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Exploratory Model Analysis</title>
		<link>http://www.justintalbot.com/2009/01/exploratory-model-analysis/</link>
		<comments>http://www.justintalbot.com/2009/01/exploratory-model-analysis/#comments</comments>
		<pubDate>Wed, 21 Jan 2009 04:17:28 +0000</pubDate>
		<dc:creator>Justin Talbot</dc:creator>
				<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://justintalbot.org/?p=65</guid>
		<description><![CDATA[I&#8217;ve recently come across a few papers on Exploratory Model Analysis. I wasn&#8217;t familiar with this work when writing the EnsembleMatrix paper, but they are very closely related. I was working with a ML researcher while designing the EnsembleMatrix visual interface and so did quite of bit of looking around in the ML literature. EMA [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve recently come across a few papers on Exploratory Model Analysis. I wasn&#8217;t familiar with this work when writing the EnsembleMatrix paper, but they are very closely related. I was working with a ML researcher while designing the EnsembleMatrix visual interface and so did quite of bit of looking around in the ML literature. EMA is emerging in statistics and so didn&#8217;t appear in my search.</p>
<p>Here are a few pointers:<br />
<a href="http://portal.acm.org/citation.cfm?id=942045">Parallel coordinates for exploratory modelling analysis</a> (Antony Unwin)<br />
<a href="http://www.springerlink.com/index/v7wx480737343288.pdf">Exploratory modelling analysis: visualizing the value of variables</a> (Antony Unwin)<br />
<a href="http://www.stat.iastate.edu/preprint/articles/2007-04.pdf">Meifly: Models explored interactively</a> (Hadley Wickham)</p>
<p>I tried installing meifly, but it appears to depend on ggplot which is no longer available since ggplot2 has been released.</p>
<p><a href="http://www.stat.columbia.edu/%7Egelman/research/published/p755.pdf">[2004] Exploratory data analysis for complex models (with discussion)</a> (Andrew Gelman)<br />
<a href="http://www.stat.columbia.edu/%7Egelman/research/published/p780.pdf">Discussion of this paper by Andreas Buja</a> (Andreas Buja)<br />
<a href="http://www.stat.columbia.edu/%7Egelman/research/published/p785.pdf">Rejoinder to discussion</a> (Andrew Gelman)</p>
<p>This discussion is quite inspiring. The idea that visualizations can be thought of a statistical tests was quite eye opening. I think that this suggests quite a few directions for research in InfoVis. However, there hasn&#8217;t been much work in this area in the 4 years since the paper came out. Why? Perhaps the artificial division between InfoVis and statistical visualization has kept it from being noticed. Perhaps it&#8217;s just very hard</p>
]]></content:encoded>
			<wfw:commentRss>http://www.justintalbot.com/2009/01/exploratory-model-analysis/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

