<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>

<channel>
	<title>Visual Business Intelligence</title>
	<atom:link href="http://www.perceptualedge.com/blog/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://www.perceptualedge.com/blog</link>
	<description>A blog by Stephen Few</description>
	<pubDate>Thu, 17 Jun 2010 22:47:14 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.3</generator>
	<language>en</language>
			<item>
		<title>Business Intelligence Industry – Get to Know Your Real Customers</title>
		<link>http://www.perceptualedge.com/blog/?p=814</link>
		<comments>http://www.perceptualedge.com/blog/?p=814#comments</comments>
		<pubDate>Thu, 17 Jun 2010 22:47:14 +0000</pubDate>
		<dc:creator>Stephen Few</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.perceptualedge.com/blog/?p=814</guid>
		<description><![CDATA[The BI industry has always failed to understand and support its real customers. With few exceptions, BI product vendors and consultancies continue to be acquainted primarily with IT. This is a comfortable, compatible relationship, for BI and IT both tend to see the world from an engineering-oriented, techno-centric perspective. But the BI industry&#8217;s real customers [...]]]></description>
			<content:encoded><![CDATA[<p>The BI industry has always failed to understand and support its real customers. With few exceptions, BI product vendors and consultancies continue to be acquainted primarily with IT. This is a comfortable, compatible relationship, for BI and IT both tend to see the world from an engineering-oriented, techno-centric perspective. But the BI industry&#8217;s real customers are the folks who actually use BI tools to transform data into the meaningful information they need to make better decisions. Although some of these folks work in IT, most do not. Most are not software engineers. Most are not technologists. Most are people who have a job to do that requires an awareness of what&#8217;s going on and how they might influence it, which is primarily gleaned from data. To do this, they need tools that enlighten.</p>
<p>In the past, when the BI industry focused exclusively on building an infrastructure for decision support by developing technologies that acquire, improve, store, and dispense massive amounts of data at high speeds, it was perhaps legitimate to engage primarily with IT. Today, however, the BI industry can no longer sit comfortably in locked rooms filled with servers, discussing bits and bytes with their IT comrades. Most organizations that have purchased BI solutions now know that they need more than BI infrastructure—they need to make sense of all that data they&#8217;re collecting, most of which today serves as a massive paper weight. Unfortunately, the BI vendors that helped build the infrastructure can&#8217;t use the same perspective, knowledge, and skills that made them successful in the past to produce data sensemaking (analytics) and communication tools. They must now shift from an engineering-oriented, techno-centric mindset to one that is design-oriented and human-centric. They must venture into unfamiliar territory. If they don&#8217;t, they&#8217;ll be left behind. Unfortunately, most of the major BI players haven&#8217;t realized this yet. Before they can begin to make the shift, they must first wake up.</p>
<p>I was prompted to write these words when I read a recent blog post by Boris Evelson of Forrester Research entitled &#8220;<a href="http://blogs.forrester.com/boris_evelson/10-06-07-bi_vs_analytics" target="_blank">BI vs. Analytics</a>.&#8221; Despite my <a href="http://www.perceptualedge.com/blog/?p=637" target="_blank">impassioned disagreement</a> with Evelson several months ago when he attempted to list the features of &#8220;advanced data visualization solutions&#8221; without first developing an understanding of data visualization, I found myself shouting &#8220;Amen&#8221; when I read the first two sentences of his recent blog entry:</p>
<blockquote><p><em>In my definition—and believe it, I am fighting and defending it every day—analytics has always been, and will always be part of BI. </em></p></blockquote>
<p>Indeed it has, at least by definition. Unfortunately, only in recent years have a few vendors managed to make analytics a part of BI in terms of actual analytical functionality. As I continued to read Evelson&#8217;s blog, however, I soon stumbled over the following statement: &#8220;Today most of the top BI vendors do have&#8230;advanced analytics&#8230;functionality, so it&#8217;s really a commodity now.&#8221; Apparently Evelson and I still see things quite differently. Analytics are now being claimed but not actually supported by most BI vendors. What most of them call analytics is so far from actual data sensemaking, it would be amusing if it weren&#8217;t so tragic. Analytics is not and never will be a commodity (that is, a good &#8220;which is supplied without qualitative differentiation across a market,&#8221; according to Wikipedia).</p>
<p>Evelson is not unique as a BI industry thought leader who fails to understand analytics. Few BI industry analysts and thought leaders have ever actually done the work of a data analyst. They&#8217;ve written ETL code, they&#8217;ve planned and managed BI implementations, they&#8217;ve developed reports, they&#8217;ve developed BI methodologies and strategies, and they&#8217;ve learned the intricacies of BI technologies, but they&#8217;ve never actually dipped below the surface of data sensemaking. What I&#8217;m saying is that most of BI&#8217;s prominent voices have at best a vague understanding of analytics, so they&#8217;re not the people you ought to be listening to for insight and advice in this particular realm. Only a few new experts with actual experience in analytics have raised their voices within BI circles in recent years—people like Tom Davenport and Jeanne Harris, the authors of <em><a href="http://www.amazon.com/gp/product/1422103323?ie=UTF8&amp;tag=perceedge-20&amp;linkCode=as2&amp;camp=1789&amp;creative=9325&amp;creativeASIN=1422103323" target="_blank">Competing on Analytics</a></em> and <em><a href="http://www.amazon.com/gp/product/1422177696?ie=UTF8&amp;tag=perceedge-20&amp;linkCode=as2&amp;camp=1789&amp;creative=9325&amp;creativeASIN=1422177696" target="_blank">Analytics at Work</a>. </em>Their efforts are complementing statisticians and information visualization experts to raise the banner of BI&#8217;s ultimate purpose: data sensemaking. These are the voices that must be raised to a higher volume than those of the past if BI hopes to fulfill its original promise and ultimate goal—helping organizations function more intelligently by basing their decisions on evidence contained in data. The opportunity is now; the door is open. Not everyone in the BI industry, however, will walk through it.</p>
<p>Take care,</p>
<p><img class="alignnone size-full wp-image-18" title="Signature" src="http://perceptualedge.com/blog/wp-content/uploads/2006/11/Signature.jpg" alt="" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.perceptualedge.com/blog/?feed=rss2&amp;p=814</wfw:commentRss>
		</item>
		<item>
		<title>Circle-Lust Continues</title>
		<link>http://www.perceptualedge.com/blog/?p=799</link>
		<comments>http://www.perceptualedge.com/blog/?p=799#comments</comments>
		<pubDate>Thu, 27 May 2010 17:53:55 +0000</pubDate>
		<dc:creator>Bryan Pierce</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.perceptualedge.com/blog/?p=799</guid>
		<description><![CDATA[This blog entry was written by Bryan Pierce of Perceptual Edge.
Last week Stephen published an article entitled, &#8220;Our Irresistible Fascination with All Things Circular,&#8221; which describes how people&#8217;s seemingly innate love for circles has led to the creation of many dysfunctional graphs, such as pie charts. Today, another example of a poorly designed circular graph [...]]]></description>
			<content:encoded><![CDATA[<p><em>This blog entry was written by Bryan Pierce of Perceptual Edge.</em></p>
<p>Last week Stephen published an article entitled, &#8220;<a href="http://www.perceptualedge.com/articles/visual_business_intelligence/our_fascination_with_all_things_circular.pdf" target="_blank">Our Irresistible Fascination with All Things Circular</a>,&#8221; which describes how people&#8217;s seemingly innate love for circles has led to the creation of many dysfunctional graphs, such as pie charts. Today, another example of a poorly designed circular graph came to our attention. A couple months ago, Sunlight Labs hosted a contest called &#8220;Design for America,&#8221; which asked designers to create displays of government information for the purpose of making &#8220;government data more accessible and comprehensible to the American public.&#8221; A couple days ago, they announced the <a href="http://civsourceonline.com/2010/05/25/design-for-america-winners-announced-at-gov-2-0-expo/" target="_blank">winners</a>. In the data visualization category there are plenty of examples of what <em>not</em> to do, the worst of which appears below.</p>
<div style="text-align:center;"><a href="http://www.pitchinteractive.com/usbudget/" target="_blank"><img title="Circular Graph Displaying Government Spending and Media Coverage" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/05/circular-graphs.jpg" alt="" /></a></div>
<p>This display is supposed to be used to compare the 2009 US Federal Contract Spending for several sectors to the amount of Media Coverage that those sectors received during the year. As you can see, the designers seem to have fallen into the same sort of circle-lust that Stephen wrote about last week. In this case, the circular shape seems to be entirely arbitrary, because the quantitative data is encoded only by the thickness of the rings. These circles serve the same purpose as stacked-bar graphs; they&#8217;ve just been stretched out and distorted into a circular shape.</p>
<p>Ignoring the uselessness of the circular design for a moment, what does this visualization tell us? The only thing it tells me is that Defense spending was vastly under-reported in the media during 2009 while Health and Energy spending were comparatively over-reported. Without a lot of effort, I can&#8217;t make meaningful comparisons between the information in the other sectors, because they&#8217;re too small and hard to see, and I can&#8217;t even make comparisons between the three largest sectors with much accuracy. It&#8217;s also difficult to read the names of the smaller sectors because they overlap.</p>
<p>Although it might not be as sexy, two horizontal bar graphs next to one another would work better: one for Federal Contract Spending and one for Media Coverage. The Federal Contract Spending graph could be sorted from highest to lowest and the Media Coverage graph could present the bars in the same order. This would make it very easy to compare a sector&#8217;s spending and media coverage (because they&#8217;d be aligned in a row), it would make exceptions jump out (because there&#8217;d be a difference in the length of the bar in the Media Coverage graph compared to its neighboring bars), and it would be easy to read the names of all the sectors. It would still be hard to decode the contract spending in some of the smaller sectors accurately (because their bars would be so much smaller than the Dept. of Defense bar), but at least all of the bars would share a labeled quantitative scale, which would make the task easier.</p>
<p>Another useful alternative, which would put even more focus onto the relationship between Federal Contract Spending and Media Coverage, while making the exceptions jump out, would be a scatterplot that displayed Federal Contract Spending on the x-axis and Media Coverage on the y-axis.</p>
<p>It is unfortunate that most of the winners of Design for America contest don&#8217;t represent useful designs. The fact that the circular graph above was a winner either means that the judges of the contest had a terrible selection of designs to choose from, or that the judges don&#8217;t understand data visualization. This is sad, not just because people are being given $5,000 prizes for impoverished displays, but because this information is important and it could benefit people if it was presented in a useful way.</p>
<p>-Bryan</p>
]]></content:encoded>
			<wfw:commentRss>http://www.perceptualedge.com/blog/?feed=rss2&amp;p=799</wfw:commentRss>
		</item>
		<item>
		<title>BP Oil Collection – Is the Effort Really Improving?</title>
		<link>http://www.perceptualedge.com/blog/?p=790</link>
		<comments>http://www.perceptualedge.com/blog/?p=790#comments</comments>
		<pubDate>Wed, 26 May 2010 22:10:37 +0000</pubDate>
		<dc:creator>Stephen Few</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.perceptualedge.com/blog/?p=790</guid>
		<description><![CDATA[A colleague sent me a link to Rachel Maddow&#8217;s website today where she features a graph that was used by BP senior vice president Kent Wells to show how the company&#8217;s efforts to collect the oil that&#8217;s spewing into the ocean at a rate of several thousands of barrels per day is improving. He talks [...]]]></description>
			<content:encoded><![CDATA[<p>A colleague sent me a link to <a href="http://maddowblog.msnbc.msn.com/_news/2010/05/26/4360037-a-misleading-graph-from-bp" target="_blank"><span style="text-decoration: underline;">Rachel Maddow&#8217;s website</span></a> today where she features a graph that was used by BP senior vice president Kent Wells to show how the company&#8217;s efforts to collect the oil that&#8217;s spewing into the ocean at a rate of several thousands of barrels per day is improving. He talks about adjustments that they&#8217;ve made to the siphon, then says &#8220;Here you can see how we&#8217;ve continued to ramp up.&#8221; But is this really what&#8217;s happening?</p>
<div style="text-align:center;"><a href="http://www.perceptualedge.com/blog/wp-content/uploads/2010/05/bp-oil-collection-graph.png"><img title="BP Oil Collection Graph" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/05/bp-oil-collection-graph-small.jpg" alt="" /></a></div>
<p>Although the graph doesn&#8217;t outright lie, BP is relying on the viewer&#8217;s assumption that a series of bars that increases in height represents an increase in performance. In this case it does not, however, because the bars display the cumulative amount of oil collected per day, not the daily amount. In my graph below, which shows daily oil collection, the story is obviously quite different.</p>
<div style="text-align:center;"><img title="BP Oil Collection Redesign" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/05/bp-oil-collection-2.jpg" alt="" /></div>
<p>While the amount of collection increased in the beginning, it has decreased or held steady for the last four days and is now well below the average amount of daily collection for this period as a whole. Things are definitely not getting better. How do you spin bad news like this? One way is to create a misleading graph, but cover your ass by doing it in a way that isn&#8217;t an outright lie.</p>
<p>Take care,</p>
<p><img class="alignnone size-full wp-image-20" title="Signature" src="http://perceptualedge.com/blog/wp-content/uploads/2006/11/Signature1.jpg" alt="" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.perceptualedge.com/blog/?feed=rss2&amp;p=790</wfw:commentRss>
		</item>
		<item>
		<title>Oracle—Have you no shame?</title>
		<link>http://www.perceptualedge.com/blog/?p=784</link>
		<comments>http://www.perceptualedge.com/blog/?p=784#comments</comments>
		<pubDate>Thu, 29 Apr 2010 19:13:10 +0000</pubDate>
		<dc:creator>Stephen Few</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.perceptualedge.com/blog/?p=784</guid>
		<description><![CDATA[Oracle Corporation takes its name from the treasured advice-givers of ancient Greece. Its name is ironic at times, however, when its advice is far from sage. When it comes to data visualization, and dashboard design in particular, Oracle gives some downright awful advice.
I received an email from one of my readers who uses Oracle&#8217;s OBIEE [...]]]></description>
			<content:encoded><![CDATA[<p>Oracle Corporation takes its name from the treasured advice-givers of ancient Greece. Its name is ironic at times, however, when its advice is far from sage. When it comes to data visualization, and dashboard design in particular, Oracle gives some downright awful advice.</p>
<p>I received an email from one of my readers who uses Oracle&#8217;s OBIEE tool to develop applications for his customers. He attached the following graph as an example of what Oracle teaches people when they attend the online course &#8220;Oracle BI Enterprise Edition – Build Good Dashboards&#8221;:</p>
<div style="text-align:center;"><img title="Graph from Oracle" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/04/oracle-graph.jpg" alt="" /></div>
<p>Based on this graph, I&#8217;m guessing that Oracle now outsources the development of its courses to the primate house of the local zoo. Although I haven&#8217;t seen the course myself, I&#8217;m told that this graph is typical. If this is what a leading Business Intelligence software vendor considers an effective way to display data, it&#8217;s no wonder that people are frustrated with the industry.</p>
<p>Almost every aspect of this graph fails miserably.</p>
<ul>
<li> It has been complicated by a 3-D rendering of the plot area and the bars (or tubes in this case), which does nothing but make it harder to interpret the values. Notice that the quantitative scale for &#8220;Dollars&#8221; and &#8220;Year Ago Dollars&#8221; on the left axis is aligned with the front of the graph, but the scale for &#8220;Forecasted Dollars&#8221; and &#8220;Forecasted units&#8221; on the right is aligned with the back of the graph.</li>
<li> I assume that the quantitative scale on the left is for dollars and the one on the right is for units. If this is the case, however, the title that appears on the right-&#8221;Forecasted Dollars, Forecasted Units&#8221;-is incorrect.</li>
<li> A dual-scaled graph—one quantitative scale on the left for dollars and one on the right for units—should usually be avoided, especially on dashboards, because it can be confusing and misleading. For example, notice that the forecasted units line intersects the dollars&#8217; bars, which would naturally incline anyone viewing the graph to compare their magnitudes, yet this would be entirely meaningless, because magnitude comparisons can&#8217;t be made between them when they have entirely different scales and units of measure.</li>
<li> Forecasted units would not be useful on this graph without including actual units, which has apparently been forgotten.</li>
<li> Lines representing &#8220;Forecasted Dollars&#8221; and &#8220;Forecasted Units&#8221; have been used to connect values per region, which makes no sense. The patterns formed by the lines are completely arbitrary and could be changed by sorting the regions in a different order.</li>
<li> The lines have large, clutter-inducing data points along them.</li>
<li> The lines appear to have some sort of drop shadow or lighting effect, which makes it look as if there are four lines rather than two.</li>
<li> &#8220;Dollars&#8221; for the current year and &#8220;Year Ago Dollars&#8221; are meant to be compared, not summed. By using stacked bars rather than placing separate bars side by side for the current year&#8217;s dollars and the previous year&#8217;s dollars, the comparison is difficult to make. The bars as a whole, consisting of both years stacked on one another, represent a sum that is useless in this situation.</li>
<li> Given the fact that the X-axis has the title &#8220;Region&#8221;, there is no reason to clutter the graph by including &#8220;REGION&#8221; in each of the labels.</li>
<li> The prominent vertical grid lines that separate the regions are unnecessary, resulting in clutter.</li>
<li> The tick marks along both vertical axes are unnecessary, because gridlines appear in the graph at the same positions.</li>
<li> The minor tick marks on the right-hand vertical axes are darker than the major tick marks.</li>
<li> The positions of the two Y-axis titles are inconsistent, resulting in a sloppy appearance.</li>
</ul>
<p>It is as if the person who created this &#8220;Good Dashboards&#8221; example of a graph did everything possible to make it as ineffective as possible.</p>
<p>How can a vendor that claims to understand data and presumes to teach people best practices in its use know so little? Oracle, you should be embarrassed.</p>
<p>Take care,</p>
<p><img class="alignnone size-full wp-image-18" title="Signature" src="http://perceptualedge.com/blog/wp-content/uploads/2006/11/Signature.jpg" alt="" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.perceptualedge.com/blog/?feed=rss2&amp;p=784</wfw:commentRss>
		</item>
		<item>
		<title>The Unprecedented is Overrated</title>
		<link>http://www.perceptualedge.com/blog/?p=774</link>
		<comments>http://www.perceptualedge.com/blog/?p=774#comments</comments>
		<pubDate>Mon, 12 Apr 2010 19:04:03 +0000</pubDate>
		<dc:creator>Stephen Few</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.perceptualedge.com/blog/?p=774</guid>
		<description><![CDATA[I was invited to speak at a recent TEDx event in Berkeley, but I withdrew late in the game when the TED folks asked me to sign a contract that would have given them the right to edit my talk however they wished without my permission. This is something that I never allow, because I&#8217;ve [...]]]></description>
			<content:encoded><![CDATA[<p>I was invited to speak at a recent TEDx event in Berkeley, but I withdrew late in the game when the TED folks asked me to sign a contract that would have given them the right to edit my talk however they wished without my permission. This is something that I never allow, because I&#8217;ve learned the hard way that even people with good intentions can screw things up by making bad edits. I&#8217;m writing today, not to talk about the rights of content creators to their work, but about the theme of this TEDx event, which struck me as misguided. I and the other speakers were asked to tie our talks to the theme &#8221;Doing the Unprecedented.&#8221; When I received this request from the event coordinator (TED calls them &#8220;curators&#8221;), I told her that I would tie my talk to this theme by making the case that doing the unprecedented is highly overrated.</p>
<div style="text-align:center;"><img title="Doing the Unprecedented" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/04/doing-the-unprecedented.jpg" alt="" /></div>
<p>Most of what we can do to make the world a better place involves, not doing the unprecedented, but <strong>doing what matters and what works</strong>, whether unprecedented or not. This might not be as exciting as the unprecedented, but it&#8217;s desperately needed. I believe that too many opportunities are wasted because we glorify the unprecedented for its own sake.</p>
<p>In the United States over 150,000 people die each year due to post-surgical complications. That&#8217;s three times the number of traffic fatalities. What makes this even more shocking, however, is the fact that half of these post-surgical deaths could have been prevented, not by doing the unprecedented, but by doing what medical professionals already know, but often fail to do.</p>
<div style="text-align:center;"><img title="Post Surgical Deaths" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/04/post-surgical-deaths.jpg" alt="" /></div>
<p>Many of these surgical failures are caused by the complexity of the work. When tasks are complex and you&#8217;re working under stress amidst distractions, it&#8217;s hard to remember everything that should be done. A movement is now underway to solve this problem, which involves nothing unprecedented, but something that another highly skilled group of professionals—pilots—have been doing for many years. Surgical teams are beginning to use checklists.</p>
<div style="text-align:center;"><img title="Checklist" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/04/checklist.jpg" alt="" /></div>
<p>Atul Gawande, who led the effort to create this surgical safety checklist for the World Health Organization, writes convincingly about the need for checklists in all professions that deal with complexity in his new book <em>The Checklist Manifesto: How to Get Things Right</em>.</p>
<div style="text-align:center;"><img title="Checklist Manifesto" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/04/checklist-manifesto.jpg" alt="" /></div>
<p>In the field of data visualization, failures are more common today than successes, not due to complexity, but to the fact that few people have been trained in the simple principles and practices of graph design. As a result, they rely on software tools to do the work for them and most of those tools lead them astray, encouraging them to produce silly, useless displays like this.</p>
<div style="text-align:center;"><img title="Silly Graph" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/04/silly-graph.jpg" alt="" /></div>
<p>This is a travesty, because we are living at a time when we could be making tremendous use of data to inform better decisions, and most of the rules for doing this well have been known for years.</p>
<p>Here&#8217;s an example of one of the earliest quantitative graphs, hand drawn by William Playfair in 1786. In his time, Playfair did the unprecedented by inventing or greatly improving many of the quantitative graphs that we use today.</p>
<div style="text-align:center;"><img title="Playfair" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/04/playfair.jpg" alt="" /></div>
<p>Back in 1983, Edward Tufte published his first book, <em>The Visual Display of Quantitative Information</em>, in response to the problem of ineffectively designed graphs. And yet, despite Tufte&#8217;s efforts, plus my own and the work of several others since, it appears that graphical communication skills in general might actually be declining. Problems like this silly pie chart on Fox News, which adds up to 193%, are far too common.</p>
<div style="text-align:center;"><img title="Fox News Graph" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/04/fox-news-graph.jpg" alt="" /></div>
<p>When did we lose sight of the fact that data displays are about data, expressed clearly, accurately, simply, and meaningfully? When did Business Intelligence (BI) take a wrong turn down the path to business stupidity? In our efforts to do the unprecedented, to make ourselves look impressive by decorating our data in impoverishing ways, we&#8217;ve adopted practices that make us dumb. Most of the principles for doing this right have been known for a long time. Let&#8217;s save the unprecedented for situations that demand it. For most data sense-making and presentation, let&#8217;s do what&#8217;s needed and what works.</p>
<p>Take care,</p>
<p><img class="alignnone size-full wp-image-18" title="Signature" src="http://perceptualedge.com/blog/wp-content/uploads/2006/11/Signature.jpg" alt="" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.perceptualedge.com/blog/?feed=rss2&amp;p=774</wfw:commentRss>
		</item>
		<item>
		<title>The Vicious Cycle of Data Impoverishment</title>
		<link>http://www.perceptualedge.com/blog/?p=756</link>
		<comments>http://www.perceptualedge.com/blog/?p=756#comments</comments>
		<pubDate>Mon, 29 Mar 2010 22:38:08 +0000</pubDate>
		<dc:creator>Stephen Few</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.perceptualedge.com/blog/?p=756</guid>
		<description><![CDATA[It is not difficult to present data in clear, accurate, and meaningful ways. The skills that are required to do this can be easily learned. They involve simple principles, most of which have been taught for many years. So why are most data displays so impoverished?
One reason is that we live in a time when [...]]]></description>
			<content:encoded><![CDATA[<p>It is not difficult to present data in clear, accurate, and meaningful ways. The skills that are required to do this can be easily learned. They involve simple principles, most of which have been taught for many years. So why are most data displays so impoverished?</p>
<p>One reason is that we live in a time when people use their computers as a replacement for skills. If you rely on Excel or almost any other software tool that creates charts to do the work for you, then your data displays will fail miserably. Effective data visualization practices are only built into a few products that are available today. Most software vendors have decided that they can satisfy us with razzle dazzle—pie charts that spin and bars shaped like pyramids—and so far we haven&#8217;t discouraged them by refusing to buy their silly products.</p>
<p>We are stuck in a vicious cycle of data impoverishment. Vendors show us bad examples of data visualization and we emulate them in our work. When vendors then look at what their customers are doing, they see examples that lead them to give us more of the same. Only a few vendors care enough for their customers to avoid the silly stuff that undermines our efforts.</p>
<p>This morning, I was faced with a fresh reminder of the current state of data impoverishment. A reader invited me to visit Microstrategy&#8217;s website to see the <a href="http://www.microstrategy.com/dashboards/#customershowcase" target="_blank">finalists of its customer dashboard competition</a>. What I found was depressing. I couldn&#8217;t find a single example of a dashboard that could be used to monitor information effectively. At best they could be used to look up a few facts when what&#8217;s actually needed is a rich set of comparisons. The problems that I found are too many to delineate, but all the dashboards suffer from a common flaw: they say too little and what they do say they say poorly.</p>
<p>To give you a sense of what I found, here are three of the winning entries:</p>
<div style="text-align:center;"><a href="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/us_postal_service_dashboard.jpg" target="_blank"><img title="US Postal Service Dashboard" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/us_postal_service_dashboard-small.jpg" alt="" /></a></div>
<p>The U.S. Postal Service is currently struggling to adapt to changes in the way that people communicate. If they find this dashboard helpful, it&#8217;s no wonder they&#8217;re struggling.</p>
<div style="text-align:center; margin-top:25px;"><a href="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/alloso_executive_dashboard.jpg" target="_blank"><img title="Alloso Executive Dashboard" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/alloso_executive_dashboard-small.jpg" alt="" /></a></div>
<p>This entire dashboard displays four measures, and none in a manner that&#8217;s particularly useful. The slight exception is that you can choose one of the four measures, such as Occupancy, which is currently selected above, to view a time-series display. Unfortunately, this combination bar and line graph performs poorly compared to a simple line graph with three lines.</p>
<div style="text-align:center; margin-top:25px;"><a href="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/herbalife_sales_dashboard.jpg" target="_blank"><img title="Herbalife Sales Dashboard" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/herbalife_sales_dashboard-small.jpg" alt="" /></a></div>
<p>Poor Eli Kiwanuka. It appears that he was photographed in a police lineup. This &#8220;Executive Dashboard Summary - Sales&#8221; displays the performance of one person only. How is this a summary of sales performance? Rather than a dashboard for monitoring performance, this provides a means to look up a few facts about individual sales people, one at a time, and requires that you wade through a series of poorly-designed, eye-assaulting graphs to painfully piece together a picture of one person&#8217;s performance.</p>
<p style="margin-top:25px;">I don&#8217;t blame Microstrategy&#8217;s customers. They&#8217;re working within the constraints of the tool and emulating impoverished examples, which is probably all they&#8217;ve ever seen. I mostly blame the folks at Microstrategy, who should know better. This competition gave the folks at Microstrategy a perfect opportunity to critique the designs that were submitted and show their customers how much better these dashboards could work if designed more effectively, assuming their software makes this possible. Did they miss this opportunity because they don&#8217;t know any better themselves?</p>
<p>Impoverished displays of data are what you get when vendors care a lot about sales but little about the real needs of their customers.</p>
<p>Take care,</p>
<p><img class="alignnone size-full wp-image-20" title="Signature" src="http://perceptualedge.com/blog/wp-content/uploads/2006/11/Signature1.jpg" alt="" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.perceptualedge.com/blog/?feed=rss2&amp;p=756</wfw:commentRss>
		</item>
		<item>
		<title>Tableau Public – A Powerful New Tool for Democratizing Data</title>
		<link>http://www.perceptualedge.com/blog/?p=753</link>
		<comments>http://www.perceptualedge.com/blog/?p=753#comments</comments>
		<pubDate>Wed, 10 Mar 2010 17:14:42 +0000</pubDate>
		<dc:creator>Stephen Few</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.perceptualedge.com/blog/?p=753</guid>
		<description><![CDATA[In the January 2010 issue of the Visual Business Intelligence Newsletter I redesigned a display of health care costs that appears on GE&#8217;s website. I did this using Tableau and provided a link in the article to a live version of the interactive display on the Web. You might have wondered what I did to [...]]]></description>
			<content:encoded><![CDATA[<p>In the January 2010 issue of the <em>Visual Business Intelligence Newsletter</em> I redesigned a display of health care costs that appears on GE&#8217;s website. I did this using Tableau and provided a link in the article to a live version of the interactive display on the Web. You might have wondered what I did to make that live analytical application publicly available. The answer is that I used something brand new from Tableau called <em>Tableau Public</em>. It was developed as a means for people to share live analytical displays that are of interest to the public via the Web.</p>
<p>A few months ago, when I was approached by UNESCO to help them find a means to share worldwide education data via the Web, I put them in touch with Tableau, and you can now view that information on UNESCO&#8217;s website, thanks to Tableau Public. The service is free and includes almost all of Tableau&#8217;s usual functionality. You upload your data, build the analytical display or entire application using Tableau Public, and then take the little snippet of HTML code that is generated to embed it right into your website, even though the data and functionality is hosted in the cloud.</p>
<p>I believe in the democratization of important data. I believe in providing people with simple tools for exploring and analyzing data. Tableau Public makes this possible in a way that is more analytically powerful than any free service that&#8217;s been offered to date. I appreciate Tableau&#8217;s willingness to provide this service, because it not only helps people explore important public data, it does so in a way that demonstrates to those who are stuck with obsolete tools how much more they could do if they had a good visual analysis tool at their fingertips.</p>
<p>For a quick look at Tableau Public, watch the beautiful demonstration video that Tableau has produced, appropriately titled <a href="http://www.tableausoftware.com/public/" target="_blank">Data In, Brilliance Out</a>.</p>
<p>Take care,</p>
<p><img title="Signature" src="http://perceptualedge.com/blog/wp-content/uploads/2006/11/Signature.jpg" alt="" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.perceptualedge.com/blog/?feed=rss2&amp;p=753</wfw:commentRss>
		</item>
		<item>
		<title>Big BI is Stuck: Illustrated by SAP BusinessObjects Explorer</title>
		<link>http://www.perceptualedge.com/blog/?p=727</link>
		<comments>http://www.perceptualedge.com/blog/?p=727#comments</comments>
		<pubDate>Tue, 09 Mar 2010 19:34:18 +0000</pubDate>
		<dc:creator>Stephen Few</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.perceptualedge.com/blog/?p=727</guid>
		<description><![CDATA[The software vendors that have dominated the business intelligence market for the last 15 years or so have hit the wall and they haven&#8217;t a clue how to scale it. They&#8217;re stuck because they insist on applying the skills and methods that helped them successfully build data warehouses and production reporting systems to a radically [...]]]></description>
			<content:encoded><![CDATA[<p>The software<strong> </strong>vendors that have dominated the business intelligence market for the last 15 years or so have hit the wall and they haven&#8217;t a clue how to scale it. They&#8217;re stuck because they insist on applying the skills and methods that helped them successfully build data warehouses and production reporting systems to a radically different problem: data sense-making. Their past achievements were grand feats of engineering, solved almost entirely with technology, but data sense-making (also known as, analytics) requires a different approach—one that leads with design, not engineering, and focuses on people, their needs and abilities, not technology. Attempts of big BI companies to open the data repository for exploration have produced some embarrassing tools, and they keep on coming. One of the newest examples is <em>SAP BusinessObjects Explorer</em>.</p>
<p>In an article written for <em>SAP Insider</em>, Jeff Veis, Vice President of Industry Solutions and Strategic Initiatives for SAP BusinessObjects, sets expectations for BusinessObjects Explorer:</p>
<blockquote><p><em>When most users think of business intelligence (BI), they think of it in a very traditional sense: predefined reports that can&#8217;t account for real-time market fluctuations and that don&#8217;t allow business users to truly engage with the information.</em></p>
<p><em>SAP BusinessObjects Explorer software changes all that. Using the tool, companies can extend the reach of BI to all business users — not just a small subset of expert data analysts&#8230;SAP BusinessObjects Explorer enables deep exploration of vast amounts of data, enabling users to identify, manipulate, and act on insights that a pre-structured, traditional BI tool would be hard pressed to deliver.</em></p>
<p>(&#8221;Transform the Way Your Company Thinks about Business Intelligence&#8221;, Jeff Veis, <em>SAP Insider</em>, Jan-Feb-Mar 2010)</p></blockquote>
<p>In the same issue of SAP Insider, Jonathan D. Becher, Senior Vice President of Marketing, answered the question &#8220;Can you contrast SAP BusinessObjects Explorer to the business intelligence (BI) tools our readers now have in place?&#8221; as follows:</p>
<blockquote><p><em>SAP BusinessObjects Explorer is about data exploration, not report generation.</em></p>
<p><em>The second big differentiator is accessibility. Everybody who needs access to your company&#8217;s business data can use SAP BusinessObjects Explorer. While many of your readers work for companies that are now running BI solutions, most of the employees who need access to business data can&#8217;t use the tools. Mastering their requirements and interfaces just isn&#8217;t practical, so the BI tools remain the exclusive purview of a relatively scarce number of power users, analysts, and IT department members, and the broader business user community has to go through one of these intermediaries to get their questions answered.</em></p>
<p><em>It&#8217;s analogous to the way people made phone calls a few generations back. To place a call, you would pick up the receiver and wait for an operator to get on the line and ask to whom you&#8217;d like to speak. If that person was in a different part of the country, a series of local operators, each covering specific regions of the country, worked to facilitate the connection. There was nothing self-service about it. In a very real sense, this is the way BI — and frankly, decision making — works today.</em></p>
<p><em>SAP BusinessObjects Explorer changes this. It is so easy to use, it democratizes access to data.</em></p>
<p>(&#8221;Better Answers through Better Questions&#8221;, Jonathan D. Becher, <em>SAP Insider</em>, Jan-Feb-Mar 2010)</p></blockquote>
<p>This is roughly the same explanation that big BI companies have been giving for the last 15 years. Terms like &#8220;self-service BI&#8221; and the &#8220;democratization of data&#8221; have been used in association with every new product that they&#8217;ve introduced since the day that the term &#8220;business intelligence&#8221; was coined. Obviously, however, none of their past products have achieved this, which is why they keep coming out with new ones to cure the ills caused by every unnecessarily complicated under-performing product that they&#8217;ve delivered in the past. But if the previous generation of products didn&#8217;t achieve these goals, why should we believe that BusinessObjects Explorer will?</p>
<p>Let&#8217;s look at an example of how Becher thinks this new tool will operate in the workplace.</p>
<blockquote><p><em>It&#8217;s pretty common for participants to show up at planning meetings with their go-to PowerPoint presentation, replete with their favorite metrics about what strategic concerns the business faces. And it&#8217;s extraordinarily common for these metrics not to match up — at all. Consider a simple question: How many new customers did we acquire last quarter?</em></p>
<p><em>The operations organization posits that people who bought and then returned products do not constitute &#8220;new customers.&#8221;</em></p>
<p><em>Metrics from the head of marketing, who views returns as a quality issue, not a sales issue, do count customers who bought and subsequently returned products last quarter as &#8220;new customers.&#8221;</em></p>
<p><em>The Large Enterprise Sales organization recognizes &#8220;new customers&#8221; as only those with orders in excess of US$10,000.</em></p>
<p><em>Given that the purpose of the meeting is to devise or refine plans, do you really want to lose another planning cycle sending participants off in pursuit of a new definition of the term &#8220;new customer,&#8221; asking them to regenerate their figures? With SAP BusinessObjects Explorer, this could be done in real time, with all stakeholders looking at the same data.</em></p></blockquote>
<p>So, what this new product will finally put within our reach is the earth-shattering ability to get an answer to the question &#8220;How many new customers did we acquire last quarter?&#8221; without having to involve the IT department. Be still my heart; it&#8217;s all a-flutter. A few paragraphs later in the same article, still referring to this data sense-making miracle, Becher states: &#8220;If SAP BusinessObjects Explorer sounds revolutionary, it&#8217;s because it is revolutionary.&#8221; [Long dumbfounded silence] Huh?!!!</p>
<p>Lest we be accused of missing the real miracle here, let&#8217;s take into account Becher&#8217;s claim that this new tool will eliminate the confusion and roadblocks to consensus caused by the fact that the Operations, Marketing, and Sales departments each define new customers differently, and therefore come up with different new customer counts. This must be some magical tool if it somehow puts everyone on the same page, despite their different perspectives. If this sounds to you like marketing smoke and mirrors, you&#8217;re getting the picture.</p>
<p>And it gets better.</p>
<blockquote><p><em>Take the example one step further. Let&#8217;s say that I am the head of the Large Enterprise Sales organization, and I want to compare sales in select regions of the country. I throw in a few other requirements, and we&#8217;re no longer dealing with a standard query — so I have to enlist the help of an analyst. The analyst needs certain warehouse statistics, but finds the right data isn&#8217;t loaded, so a call goes out to a data architect, who in turn enlists the help of others to cleanse and load the data. Eventually, I get the report. And 99 times out of 100, the experience ends with something like this: &#8220;Oh! That&#8217;s not the question I meant to ask. I meant to specify New York City, not New York state, and I actually needed to account for sales that took place in the wake of a new promotional campaign.&#8221;</em></p></blockquote>
<p>With BusinessObjects Explorer, according to Becher, problems like these will go away. How? Through a new interface that will allow you to ask questions of your data similar to the way you search the Web with Google today. Anyone who understands BI, however, knows that no interface, no matter how magical, will give you access to data that isn&#8217;t available, will clean data that is dirty, or will simplify the navigation of complicated operational databases. These improvements are accomplished by a whole lot of hard work on the back end (probably done by someone in IT, because only they have access) to prepare the data for use.</p>
<p>Enough of these same old hollow claims by the big BI vendors that have been frustrating and angering users for years. Are we going to let them continue to raise our hopes and dash them forever, never going elsewhere for answers?</p>
<p>Let&#8217;s forget what SAP BusinessObjects is saying about Explorer and take an honest, objective look at it ourselves.</p>
<p><strong>Caution</strong></p>
<p>Don&#8217;t mistake what I&#8217;ve written as a case against Big BI in favor of Small BI. It is entirely possible for large BI vendors to provide effective tools for data sense-making. To do this, they need to switch from a technology-centric engineering-focused approach to a human-centric design-focus approach, and base their efforts on a deep understanding of data sense-making. Most of the small BI vendors have done no better in cracking this nut than the big guys. They might be more agile due to their small size and thus able to bring a new product to market more quickly, but when they approach the problem in the same dysfunctional way as the big guys, they fail just as miserably. Just like politicians who sell themselves as &#8220;not like the guys in Washington,&#8221; new players in the BI space often point to the failures of the big guys and then go on to do exactly the same. I am not making a case of small vs. big, but of clear-headed, informed, and effective vs. an old paradigm that doesn&#8217;t work for the challenges of data sense-making.</p>
<p><strong>Review of BusinessObjects Explorer</strong></p>
<p>As quoted above, Jeff Veis claims that <em>&#8220;</em><em>SAP BusinessObjects Explorer enables deep exploration of vast amounts of data, enabling users to identify, manipulate, and act on insights that a pre-structured, traditional BI tool would be hard pressed to deliver.&#8221;</em> To test these claims, I asked Bryan Pierce who works with me here at Perceptual Edge to access an evaluation copy of the tool on SAP&#8217;s website and put it through its paces. The following are Bryan&#8217;s findings.</p>
<blockquote><p><em>Basically, I didn&#8217;t really find anything good about SAP BusinessObjects Explorer. If it was all you had, you could use it to perform some analysis, and it might be a little easier for certain types of exploratory analysis than a tool like Excel, but compared to other tools that are actually designed for exploratory analysis, it&#8217;s a joke. Here is an example of the BusinessObjects Explorer interface:</em></p></blockquote>
<div style="text-align:center;"><a href="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/bo_explorer.jpg" target="_blank"><img title="BusinessObjects Explorer" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/bo_explorer_small1.jpg" alt="" /></a></div>
<blockquote><p><em>1)  Perhaps the single biggest problem with SAP BusinessObjects Explorer is that it only allows you to view one graph at a time. In addition to this, it only allows a maximum of three measures in a graph, so you can only have three lines in a line graph and three segments in a stacked bar graph. There is a notable—but not useful—exception to the single graph rule. If you select two or three measures and choose a pie chart or a radar chart it will create two or three graphs next to each other, although I couldn&#8217;t find a way to make them share a quantitative scale (this would only be applicable for radar charts). Unfortunately, I couldn&#8217;t find any way to get multiple versions of any useful graph types.</em></p></blockquote>
<blockquote><p><em>2)  Although you can view up to three measures at once, there&#8217;s even less functionality when viewing categorical variables. For instance, the dataset I analyzed had three years&#8217; worth of quarterly data. Using a line graph, I could view the values for the three years and I could look at a particular year and see the quarterly values, but I couldn&#8217;t find a way to view all three years at a quarterly level simultaneously (either by using a single line that spanned twelve quarters or by using three lines that each spanned four quarters). Similarly, while I could get two lines or stacked-bar segments for Profit and Expenses (both measures), I couldn&#8217;t find a way to get separate lines or stacked-bar segments for States or Cities.</em></p></blockquote>
<blockquote><p><em>3)  In attempting to determine why I couldn&#8217;t get a line graph to display quarterly sales for more than one year at a time, I uploaded a custom dataset of time-series data. It appears that BusinessObjects Explorer handles time-series data very poorly. The dataset I uploaded contained daily sales data for two products over 90 days. When I first opened it, this is how the date variable was displayed<br />
</em></p></blockquote>
<div style="text-align:center;"><em><img title="Unsorted Dates" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/date_unsorted.jpg" alt="" /></em></div>
<blockquote><p><em>As you can see, the variable has been sorted by the total sales for each date, rather than the dates themselves. To fix this, I clicked the little down arrow in the top-right corner of the image and told it to re-sort by date. This is what appeared:</em></p></blockquote>
<div style="text-align:center;"><img title="Sorted Dates" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/date_sorted.jpg" alt="" /></div>
<blockquote><p><em>For some reason, every date except for one of them disappeared; I had to close and reopen the dataset to get the other dates to reappear. So, apparently there&#8217;s something buggy with the way BusinessObjects Explorer handles dates. In fact, the only way I could get time-series data to work correctly was when I separated years, months, and days into different variables. If you do this, just make sure that you format the months as numbers, because that&#8217;s the only way they&#8217;ll sort correctly. Unless, of course, you like alphabetical time:</em></p></blockquote>
<div style="text-align:center;"><img title="Alphabetical Time" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/alphabetical_time.jpg" alt="" /></div>
<blockquote><p><em>4)  With BusinessObjects Explorer, you&#8217;re not able to customize the appearance of graphs in any way. This isn&#8217;t as important as it would be for a data presentation tool, like Excel, but even an analysis tool should let you do things like disable the gridlines or remove the data points from a line graph.</em></p></blockquote>
<blockquote><p><em>5)  Like most web-based analysis tools, it responds too slowly for seamless interaction. After a filter is applied there is short delay while the application contacts the server for the new data. This delay is only about one or two seconds long, but that&#8217;s still more than enough to hamper an analyst&#8217;s train of thought.</em></p></blockquote>
<blockquote><p><em>6)  There were several times when I switched over to time-based views where the graphs weren&#8217;t sorted in chronological order. For example, I was viewing a bar graph that showed Margin and Quantity Sold by State, which was sorted by the Margin values. I then switched from viewing the graph by State to viewing it by Quarter. The sort by Margin was still in effect so the graph displayed the bars in this order: Q4, Q2, Q1, Q3. It&#8217;s one thing to allow people to arrange time-series information in non-chronological order for those extremely rare cases when that might be useful. It&#8217;s quite another thing to allow time-series data to be arranged in this way by the software.</em></p></blockquote>
<blockquote><p><em>7)  The program includes all the standard graph types and a few unexpected ones, such as treemaps (although, how useful is a treemap when it only takes up about 1/3 of your screen space?), but it doesn&#8217;t include box plots.</em></p></blockquote>
<blockquote><p><em>8)  Speaking of treemaps, they should be used to navigate hierarchical data, for instance, to view sales and margin data at the regional, state, and city levels. However, the treemaps in BusinessObjects Explorer appear to only allow a single level of hierarchy. Here is a treemap that is displaying Sales and Margin by State:</em></p></blockquote>
<div style="text-align:center;"><a href="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/treemap.jpg" target="_blank"><img title="Treemap Small" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/treemap_small.jpg" alt="" /></a></div>
<blockquote><p><em>In a functional treemap, I should be able to display each city&#8217;s data as a smaller square within the corresponding state square and, potentially, to select a particular city and drill into it to see even more detailed data (such as sales by individual stores). Unfortunately, I could find no way to do any of this. As a side note, red and green are the worst two colors to use to encode data (in this case they&#8217;re encoding margin values from highest to lowest), because most people who are colorblind (10% of males and 1% of females) can&#8217;t distinguish between the two colors. I would have switched to more suitable colors, but, as I mentioned before, I couldn&#8217;t find a way to modify the appearance of any of the graphs.</em></p></blockquote>
<blockquote><p><em>9)  The user doesn&#8217;t have enough control of the layout of the display. When viewing both the filter controls and the visualization, the filter controls take up as much space as the visualization. You can hide the filter controls, which gives the visualization more space, but there&#8217;s no way to just reduce the filter controls&#8217; size. In addition to using too much space, the filter controls make poor use of the space they require. The filters were designed so that each filter column has the same width. As a result, there are some filters that contain large amounts of wasted space, like the Quarter filter below:</em></p></blockquote>
<div style="text-align:center;"><img title="Column Width" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/column_width.jpg" alt="" /></div>
<blockquote><p><em>If the filters had been designed to size themselves based on their contents, more filters could have fit on the screen at the same time, which would make filtering more efficient.</em></p></blockquote>
<blockquote><p><em>10)  The shape of the graph is also a problem sometimes. Because the plot area is about four times wider than it is tall, it makes certain types of graphs awkward to read, such as scatterplots (which should usually be roughly square in shape) or vertical bar graphs that only have a few bars (in which case, the bars might be wider than they are tall).</em></p></blockquote>
<blockquote><p><em>11)  In the filter section, totals are displayed next to each categorical value. For instance, here is the quarter variable:</em></p></blockquote>
<div style="text-align:center;"><img title="Inconsistent Precision" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/03/inconsistent_precision.jpg" alt="" /></div>
<blockquote><p><em>As you can see, the totals aren&#8217;t all written to the same precision. The Q1, Q3, and Q4 totals are written to the tenth of a dollar, while the Q2 total is written to the whole dollar. This means the decimal points don&#8217;t line up, which makes the numbers harder to read.</em></p></blockquote>
<p>These are the problems that Bryan found while spending two hours with the product. A deeper look would no doubt produce a longer list, but Bryan was only trying to spot the big problems that most severely undermine the products use for data exploration and analysis. Had we taken the time to compare BusinessObjects Explorer to one of the good data exploration and analysis products that are available today, such as Tableau or Spotfire, the claim by BusinessObjects that Explorer is &#8220;revolutionary&#8221; would be exposed more clearly for what it is: a sad statement about this Big BI company&#8217;s understanding of data sense-making. BusinessObjects is struggling to catch up with human-centered, design-focused companies like Tableau and Spotfire, which are running circles around them, and making them look pathetic. SAP BusinessObjects and most other Big BI companies haven&#8217;t taken the time to understand data sense-making in general, data visualization in particular, or even the real needs of their customers. They need a new mindset, but learning to see the world with new eyes is hard. By the time they figure this out and make the shift, will it be too late?</p>
<p>Take care,</p>
<p><img title="Signature" src="http://perceptualedge.com/blog/wp-content/uploads/2006/11/Signature.jpg" alt="" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.perceptualedge.com/blog/?feed=rss2&amp;p=727</wfw:commentRss>
		</item>
		<item>
		<title>What can the Wall Street Journal teach us about information graphics?</title>
		<link>http://www.perceptualedge.com/blog/?p=707</link>
		<comments>http://www.perceptualedge.com/blog/?p=707#comments</comments>
		<pubDate>Mon, 22 Feb 2010 22:08:07 +0000</pubDate>
		<dc:creator>Stephen Few</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.perceptualedge.com/blog/?p=707</guid>
		<description><![CDATA[
A new book about information graphics was published last month titled The Wall Street Journal Guide to Information Graphics, by Dona M. Wong, the graphics director for this respected newspaper. I get excited whenever a new book about data visualization is published, especially one that teaches practical techniques, because too few of us are working [...]]]></description>
			<content:encoded><![CDATA[<div style="text-align:center;"><img title="The Wall Street Journal Guide to Information Graphs" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/wsj.jpg" alt="" /></div>
<p>A new book about information graphics was published last month titled <em>The Wall Street Journal Guide to Information Graphics</em>, by Dona M. Wong, the graphics director for this respected newspaper. I get excited whenever a new book about data visualization is published, especially one that teaches practical techniques, because too few of us are working in this field. This new addition to my library has its merits, but unfortunately it has its problems as well.</p>
<p>To begin, this book is not what its advertising claims it to be. Rather than &#8220;the definitive guide to the graphic presentation of information&#8221; and &#8220;an invaluable reference work for students and professionals in all fields,&#8221; which the dust cover claims, it would be more accurately described as a graphical style guide for financial journalism. I suspect that the content of this book was in fact written by Wong originally as the graphics style guide that is used internally at <em>The Wall Street Journal</em>, and that the newspaper envisioned a new source of revenue by revising it slightly and publishing it as a book. There&#8217;s certainly nothing wrong with that, but they should have more clearly described its scope as restricted primarily to the interests of financial journalism.</p>
<p>The quality of this book that will no doubt appeal to many potential readers is, in my opinion, its fundamental failure: it includes relatively few words. Unlike her mentor, Edward Tufte, who uses words liberally and eloquently, Wong&#8217;s style of writing is closer to the bullet point approach that Tufte disdains. In this respect, it is different from my books, which have at times been criticized for having too many words. A few readers have remarked that I don&#8217;t follow my own principle of simplicity in my books because I use too many words to present the material. What they don&#8217;t appreciate is the important difference between simplicity and over-simplification. I provide the context that people need to understand what I teach. When you tell people what they should and shouldn&#8217;t do without explaining why, they can at best learn only superficially. To learn deeply, people must understand things at a conceptual level-why things work as they do. This requires more than a few words. Wong&#8217;s book has too few. In total, the book includes 120 pages of actual content, which consists mostly of figures. The fact that so many figures exist is not the problem; it is in failing to explain her recommendations that she errs. She says &#8220;Do this and don&#8217;t do that,&#8221; but rarely helps her readers understand why. One problem with this is that Wong isn&#8217;t always right, but people who are learning about information graphics for the first time won&#8217;t realize this.</p>
<p>Wong states a few rules that entirely miss the mark, but more often she emphatically states what are at best rules of thumb, which must allow many exceptions. While reading the book, I found myself frequently writing comments in the margins such as &#8220;it all depends&#8221; and even &#8220;not true.&#8221; To give you a sense of this, here are a few excerpts from the book, followed by my margin comments:</p>
<table style="border:0; color:#666666;" border="0" cellspacing="0" cellpadding="4">
<tbody>
<tr>
<td style="border-bottom:1px solid #AAAAAA;" width="210" valign="top">Wong&#8217;s Words</td>
<td style="border-bottom:1px solid #AAAAAA;" width="10" valign="top"></td>
<td style="border-bottom:1px solid #AAAAAA;" width="370" valign="top">My Margin Comments</td>
</tr>
<tr>
<td width="210" valign="top"></td>
<td width="10" valign="top"></td>
<td width="370" valign="top"></td>
</tr>
<tr>
<td width="210" valign="top">&#8220;Do not plot more than four lines on a simple [line] chart.&#8221; (p. 54)</td>
<td width="10" valign="top"></td>
<td width="370" valign="top"><strong>Rule of thumb with many exceptions.</strong> Depending on the nature of the data (for example, how close   the lines are in value and how much variability in values exists along the   lines), a graph could contain many more than four lines and still work quite   well. Also, when line graphs are used, not for comparing individual lines,   but to provide an overview in a way that features exceptions and predominant   patterns, far more than four lines can be included.</td>
</tr>
<tr>
<td width="210" valign="top"></td>
<td width="10" valign="top"></td>
<td width="370" valign="top"></td>
</tr>
<tr>
<td width="210" valign="top">&#8220;Don&#8217;t use different colors or colors on the opposite side of the color   wheel in a multiple-bar chart.&#8221; (p. 40)</td>
<td width="10" valign="top"></td>
<td width="370" valign="top"><strong>It depends. </strong>Different hues   work best for differentiating items, which is what&#8217;s usually needed in line   graphs with multiple lines, bar graphs with multiple sets of bars, and so on.<strong></strong></td>
</tr>
<tr>
<td width="210" valign="top"></td>
<td width="10" valign="top"></td>
<td width="370" valign="top"></td>
</tr>
<tr>
<td width="210" valign="top"></td>
<td width="10" valign="top"></td>
<td width="370" valign="top"></td>
</tr>
<tr>
<td width="210" valign="top">&#8220;Choose the y-axis scale so that the height of the fever line   occupies roughly two-thirds of the chart area.&#8221; (p. 51)</td>
<td width="10" valign="top"></td>
<td width="370" valign="top"><strong>Ineffective rule. </strong>I think what Wong&#8217;s trying to do is bank the line to 45° so it&#8217;s not so flat that the trend and pattern can&#8217;t be seen,   but this approach won&#8217;t guarantee this result. Setting the y-axis scale to begin just a little below the lowest value and end just a little above the   highest value makes better use of the plot area. Once this is done, the aspect ratio of the graph (the ratio of its width to its height) can be   adjusted to prevent the slope of the line from being either too shallow or too   steep.</td>
</tr>
<tr>
<td width="210" valign="top"></td>
<td width="10" valign="top"></td>
<td width="370" valign="top"></td>
</tr>
<tr>
<td width="210" valign="top">&#8220;A segmented bar chart in general is more effective than a pie chart   at showing proportions of a whole.&#8221; (p. 79)</td>
<td width="10" valign="top"></td>
<td width="370" valign="top"><strong>Not true.</strong> Actually, for   showing a single part-to-whole relationship, a segmented (a.k.a., stacked) bar   is never more effective than a pie chart, and in my opinion, neither works as well as a standard bar graph.</td>
</tr>
<tr>
<td width="210" valign="top"></td>
<td width="10" valign="top"></td>
<td width="370" valign="top"></td>
</tr>
<tr>
<td width="210" valign="top">&#8220;Always label the value of a vertical bar if it is close to zero.&#8221;</td>
<td width="10" valign="top"></td>
<td width="370" valign="top"><strong>It depends on how the graph is   used.</strong> Labeling these values is only useful when people need precise   values, and why would this rule apply to vertical bars and not to horizontal   bars?</td>
</tr>
<tr>
<td width="210" valign="top"></td>
<td width="10" valign="top"></td>
<td width="370" valign="top"></td>
</tr>
<tr>
<td width="210" valign="top">When it is appropriate to use different color intensities to   differentiate series of bars in a bar graph, Wong states: &#8220;The shading of the   bars should move from the lightest to the darkest for easy comparison.&#8221; (p.   67)</td>
<td width="10" valign="top"></td>
<td width="370" valign="top"><strong>Huh?</strong> Why not ever from the   darkest to the lightest bars?</td>
</tr>
<tr>
<td width="210" valign="top"></td>
<td width="10" valign="top"></td>
<td width="370" valign="top"></td>
</tr>
<tr>
<td width="210" valign="top">&#8220;When plotting horizontal bars over time, the bars should be ordered   from the most recent data point [at the bottom] and go back in time   [proceeding upward].&#8221; (p. 71)</td>
<td width="10" valign="top"></td>
<td width="370" valign="top"><strong>Don&#8217;t do this.</strong> I recommend   that horizontal bars never be used for time-series data, because it is much   more natural for people to think of time as proceeding horizontally from left to   right.</td>
</tr>
<tr>
<td width="210" valign="top"></td>
<td width="10" valign="top"></td>
<td width="370" valign="top"></td>
</tr>
</tbody>
</table>
<p>This is just a sample of the problems that I noted. Another point on which Wong and I definitely disagree has to do with her recommendations for making the quantitative scales of multiple line graphs different in an effort to make them more comparable, which she addresses in four different sections of the book. In one instance, she wants to make sure that people don&#8217;t miss the fact that the following two stocks increased at much different rates, which might occur if they were shown the following graph.</p>
<div style="text-align:center;"><img title="Example of problem" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/example-problem.jpg" alt="" /></div>
<p>Her solution is to show the following graph instead.</p>
<div style="text-align:center;"><img title="Example 1" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/example-1.jpg" alt="" /></div>
<p>Although I share Wong&#8217;s concern, her solution is misleading. To feature the differences in percentage change, the same percentage scale could be used for both graphs, as shown below.</p>
<div style="text-align:center;"><img title="Example 2" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/example-2.jpg" alt="" /></div>
<p>The best solution, however, unless the differences in the magnitudes of change really don&#8217;t matter, would be to tell a richer story by presenting the following collection of graphs.</p>
<div style="text-align:center;"><img title="Combined Examples" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/combined-examples.jpg" alt="" /></div>
<p>Given the fact that Wong studied under Tufte&#8217;s supervision at Yale, I expected to find little with which I would disagree. I was surprised to discover otherwise. Despite our disagreements, I agree with most of Wong&#8217;s suggestions, but in almost all such cases she restates what I and others have said before. If you&#8217;re already an expert in data visualization, you&#8217;ll learn little from this book, except a few techniques that are specific to financial journalism. If you&#8217;re a novice hoping to learn the fundamentals of information graphics, be warned that this book advocates a few bad practices along with the good, and it rarely explains the concepts that you must understand to produce effective graphs on your own.</p>
<p>Take care,</p>
<p><img class="alignnone size-full wp-image-18" title="Signature" src="http://perceptualedge.com/blog/wp-content/uploads/2006/11/Signature.jpg" alt="" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.perceptualedge.com/blog/?feed=rss2&amp;p=707</wfw:commentRss>
		</item>
		<item>
		<title>NodeXL: Network Visualizations in Excel</title>
		<link>http://www.perceptualedge.com/blog/?p=680</link>
		<comments>http://www.perceptualedge.com/blog/?p=680#comments</comments>
		<pubDate>Wed, 10 Feb 2010 21:58:29 +0000</pubDate>
		<dc:creator>Bryan Pierce</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://www.perceptualedge.com/blog/?p=680</guid>
		<description><![CDATA[This blog entry was written by Bryan Pierce of Perceptual Edge.
The chances are good that you&#8217;ve seen network visualizations before, such as the one below in which the circles and octagons represent large U.S. companies and each connecting line represents a person who sits on the board of both companies.

(This image was created by Toby [...]]]></description>
			<content:encoded><![CDATA[<p><em>This blog entry was written by Bryan Pierce of Perceptual Edge.</em></p>
<p>The chances are good that you&#8217;ve seen network visualizations before, such as the one below in which the circles and octagons represent large U.S. companies and each connecting line represents a person who sits on the board of both companies.</p>
<div style="text-align:center; color:#666666;"><a href="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/corporate_map.png" target="_blank"><img title="Corporate Map Network Visualization" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/corporate_map_small1.png" alt="" /></a></div>
<p style="text-align:center;">(This image was created by Toby Segaran: <a href="http://blog.kiwitobes.com/?p=57" target="_blank">http://blog.kiwitobes.com/?p=57</a>)</p>
<p>While these types of graphs have become more common in recent years, there&#8217;s still a good chance that you&#8217;ve never created one yourself. This is because, traditionally, to create network visualizations, you&#8217;ve either needed specialized (and often unwieldy) network visualization software or a full-featured (and usually expensive) visualization suite. That&#8217;s no longer the case. A team of contributors from several universities and research groups, including the University of Maryland and Microsoft Research, recently released <a href="http://nodexl.codeplex.com/" target="_blank">NodeXL</a>, a free add-in for Excel that allows you to create and analyze network visualizations.</p>
<p>Using NodeXL you can import data from a variety of file formats and it will automatically lay out the visualization for you, using one of twelve built-in layout algorithms. For instance, here&#8217;s one<span style="color: #008000;"> </span>with a circular layout:</p>
<div style="text-align:center;"><img title="Network Graph with Circular Layout" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/circular_layout.png" alt="" /></div>
<p>Below, the same dataset is laid out using the Harel-Koren Fast Multiscale algorithm, which is one of NodeXL’s two force-directed algorithms. Force-directed algorithms are designed to make all the lines (a.k.a. “edges”) about the same length and to minimize line crossings, which can make for a more aesthetically pleasing and readable graph.</p>
<div style="text-align:center;"><img title="Network Graph with Harel-Korem Layout" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/harel-korem_layout.png" alt="" /></div>
<p>You can also manually select and position the data points (a.k.a. &#8220;vertexes&#8221; or &#8220;nodes&#8221;). Here I&#8217;ve selected a group of nodes, which are highlighted in red, and dragged them away from the rest of the graph.</p>
<div style="text-align:center;"><img title="Network Graph with Highlighted Section" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/highlighted_section.png" alt="" /></div>
<p>Once your information has been laid out, you can start exploring and making sense of it. One useful feature of NodeXL is its implementation of dynamic filters, which is something Excel has been sorely lacking for years. For instance, the graph below shows U.S. Senators in 2007, the connecting lines represent two senators who have voted the same way at least 65% of the time, and the color of each circle represents the senator&#8217;s political party (blue for Democrat, red for Republican, and yellow for Independent).</p>
<div style="text-align:center;"><img title="Visualization of U.S. Senate Voting Data" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/us_senate.png" alt="" /></div>
<p>If we want to change how the information is filtered we can simply open the Dynamic Filters dialog box and apply or modify the filters. For instance, here I&#8217;ve used the slider below to modify the filter so it only displays connections between senators who have voted the same way at least 95% of the time:</p>
<div style="text-align:center;"><img title="Percent Agreement Slider" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/percent_agreement_slider.png" alt="" /></div>
<p>Now we&#8217;re down to just a few lines and can see that significantly more Democrats voted the same way at least 95% of the time compared to their Republican counterparts:</p>
<div style="text-align:center;"><img title="Filtered U.S. Senate Voting Visualization" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/us_senate_filtered.png" alt="" /></div>
<p>NodeXL also supports zooming, panning, scaling, and the ability to automatically or manually create clusters of similar data. Below are a couple examples from the NodeXL website to give you a taste of the visualizations that can be created with it.</p>
<div style="text-align:center;"><img title="NodeXL Example" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/nodexl_example1.gif" alt="" /></div>
<div style="text-align:center;"><img title="NodeXL Example" src="http://www.perceptualedge.com/blog/wp-content/uploads/2010/02/nodexl_example2.jpg" alt="" /></div>
<p>NodeXL is currently in beta release, so you might find a few remaining bugs here and there, but if you think network visualizations might be useful for your work, NodeXL provides a great way to get started.</p>
<p>-Bryan</p>
]]></content:encoded>
			<wfw:commentRss>http://www.perceptualedge.com/blog/?feed=rss2&amp;p=680</wfw:commentRss>
		</item>
	</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.130 seconds -->
<!-- Cached page served by WP-Cache -->
