Thanks for taking the time to read my thoughts about Visual Business Intelligence. This blog provides me (and others on occasion) with a venue for ideas and opinions that are either too urgent to wait for a full-blown article or too limited in length, scope, or development to require the larger venue. For a selection of articles, white papers, and books, please visit my library.

 

Data Visualization and the Blind

September 16th, 2013

Recently, I received an email from a fellow name Mark Ostroff who has written a guide to designing “accessible” content using the Oracle Business Intelligence Suite (OBIEE). In particular, the guide addresses issues regarding impaired vision, such as colorblindness and total blindness. Despite the fact that Mark began by saying that he and I “could be ‘twins separated at birth’ in our orientation about business intelligence,” by the second email in our conversation it became clear that he had a bone to pick. He accused me of shirking my responsibility by not teaching people to design information displays in ways that are accessible to the blind—dashboards in particular. Actually, his accusation was a bit harsher. He suggested that, by failing to teach people to design dashboards in ways that were accessible to the blind, I was encouraging my clients to break the law. Mark’s bold accusation prompted me to write about this issue.

I’ll begin by stating my fundamental position: a dashboard that is accessible to the blind is a contradiction in terms. “A dashboard is a visual display of the most important information needed to achieve one or more objectives, consolidated and arranged on a single screen so the information can be monitored at a glance” (Few, 2005). No forms of data visualization, not just dashboards jam-packed with graphics, can be made fully accessible to someone who is blind. I am not insensitive to the needs of people who are visually or otherwise impaired. I am merely pointing out what anyone who understands data visualization knows: no channel of perception other than vision can fully duplicate the contents of graphs. Similarly, what someone can communicate through the audio channel in the form of music cannot be fully expressed visually. If it could, why bother performing or recording music? Why not just distribute the written score? Vision is unique in its abilities to inform and enable thinking. Those who lack vision can develop their other senses to compensate to an amazing degree, but never in a way that fully duplicates the visual experience.

The information that is displayed in a dashboard can and should be presented to people who are blind in a different form when needed. Despite Mark’s bold challenge, current laws regarding accessibility require some organizations—mostly government—to provide the information contained in something like a dashboard in a way that is accessible to the blind, not necessarily to make the dashboard itself accessible. Unfortunately, an alternative form of presentation will not convey all of the information contained in a well-designed dashboard and it won’t communicate the information as efficiently, but if someone who is blind needs the information, it behooves us to provide a reasonable, even if imperfect, alternative. The alternative, however, will not be a dashboard. By definition, a dashboard is a visual display, because the visual channel provides the richest and most efficient means of presenting information for monitoring purposes, which no other channel can match—not even close. If airlines were required by law to provide flight-phobic customers with an earthbound form of transportation, that alternative would be called a train or a bus, not an airplane. In like manner, a means of monitoring that uses braille or a screen reader as its medium should not be called a dashboard. There’s enough confusion about the term already. Let’s not muddy it further.

When quantitative information is presented graphically, it offers the following advantages over written or spoken words and numbers:

  1. Patterns in the values are revealed
  2. Series of values (e.g., 12 months worth of sales revenues) are chunked together into visual objects (e.g., a line in a line graph), which makes it possible for us to see the entire series at once and compare it to other entire series of values, thus augmenting the capacity of working memory
  3. Much more information can be presented in the limited space that’s available on the page or screen
  4. The visual cortex processes the graphical information in parallel and more efficiently than the slower, sequential process that’s required for language processing

Data visualization is not only useful, it is finally being recognized as essential. It’s hard to imagine how any other channel of perception will ever be able to provide viable alternatives for these advantages of vision. It certainly isn’t possible to come close to doing this now.

I support the Americans with Disabilities Act (ADA). The ADA became law to prevent discrimination against people with disabilities. It does not, however, heal disabilities. It cannot give sight to the blind. It can require that organizations remove roadblocks to equal rights for those with disabilities and accommodate them in reasonable ways, but it should never try to equalize the playing field between those with sight and those without by forcing those with sight to wear blindfolds. Unfortunately, some efforts to expand accessibility venture into this territory, and I find that intolerable.

Mark seems to believe that all dashboards should be designed so that every bit of information is accessible to a screen reader to accommodate the needs of those without sight. To do this, a great deal of information would have to be added to dashboards and much of it would have to be expressed in inferior ways to make the contents of a dashboard accessible to a screen reader. Despite Mark’s good intention, this would result in dashboards unworthy of the name. The experience of those with sight would be unnecessarily compromised to a costly degree. I say unnecessarily, because the needs of the blind would be better served by a separate display that was designed specifically for a screen reader without compromising the design of the original dashboard. This approach, rather than the way that Mark advocates, would result in less time, effort, and cost. We should approach accessibility intelligently. What might work for a general purpose website might not work for a dashboard. One size definitely does not fit all.

It was hard for me to imagine what Mark had in mind as an accessible dashboard, so I downloaded his guide to take a look. I quickly learned that his idea of a dashboard is quite different from anything that I would qualify as such. Here’s an illustration from the guide:

What he calls a dashboard looks a lot like an online report with a couple of tables on it. A few graphs do appear in the guide, and Mark suggests that they should be made accessible to those who are colorblind in the following manner:

That’s right—according to the guide, crosshatching should be used in addition to colors. Crosshatching can create an annoying shimmering effect known as moiré vibration. This affects people who are colorblind as much as anyone. What this recommendation fails to take into account is the fact that people who are colorblind can see color (except for extremely rare cases of complete color blindness), they just can’t discriminate particular colors, primarily red and green. Avoiding combinations of colors that those who are colorblind cannot discriminate solves the problem without resorting to the scourge of crosshatching.

Despite a search, I failed to find anything in the accessibility guide that explained how information contained in graphs (i.e., images) and thus inaccessible to screen readers could be communicated to those without sight. Text descriptions can be attached to a graph that can be accessed by screen readers, but those descriptions would not contain any information about the values in the graph. Apparently, a dashboard that is accessible to the blind would need to eliminate graphics altogether. As I said before, the result would not be a dashboard. When accessibility to information in dashboards is needed by those who are blind, it currently works best to give them an alternative that displays text and tables of values formatted for easy accessibility by screen readers. A table, even though it information, such as patterns of change and the means of comparing entire series of values, but no automated presentation of the data that isn’t visual could achieve that. At best, someone could write a description of the patterns and summarize the story contained in the graph with words, but that would require human intervention, which cannot be automated—at least not yet.

We should be concerned about accessibility to information, not only for those with disabilities. Good design makes information accessible. It is a sad fact of life, however, that everything cannot be made equally accessible to everyone. People differ in ability and experience. Accessibility is achieved by understanding these differences and designing communications in a way takes them into account. Accessibility is not achieved by slighting one audience in an attempt to meet the needs of another. So far, the business intelligence (BI) industry in general has not taken even the shared needs of humans into account, let alone the unique needs of particular groups. I’m not surprised that Oracle’s attempt to accommodate the needs of the visually impaired fails to exhibit thoughtful design. Oracle’s approach to accessibility so far is simpleminded, and certainly is not worthy of the name “business intelligence.”

Take care,

Signal Detection: An Important Skill in a Noisy World

August 12th, 2013

This summer I’ve been spending most of my time working on a new book. The current working title is Signal. As the title suggests, this book will focus on analytical techniques for detecting signals in the midst of noisy data. And guess what? All data sets are noisy. In fact, at any given moment, most of the data that we collect are noise. This will always be true, because signals in data are the exception, not the rule.

Signal detection is actually getting harder with the advent of so-called Big Data. By its very nature, most Big Data will never be anything but noise. Collecting everything possible, based on the Big Data argument that the costs of doing so are negligible and that even data that you can’t imagine as useful today could become useful tomorrow, is a dangerous premise. The costs of collecting and storing everything extend far beyond the hardware that’s used to store it. People already struggle to use data effectively. This will become dramatically harder as the volume of data grows. Finding a needle in a haystack doesn’t get easier as you’re tossing more and more hay on the pile.

Most people who are responsible for data analysis in organizations have never been trained to do this work. An insidious assumption exists, promoted by software vendors, that knowing how to use a particular data analysis software product “auto-magically” imbues one with the skills of a data analyst. Even with good software—something that’s rare—this is far from true. Just as with any area of expertise, data analysis requires training and practice, practice, practice. Because few people whose work involves data analysis possess the required skills, much time is wasted and money lost as analysts pore over data without knowing what to look for. They end up chasing patterns that mean nothing and missing those that are gold. Essentially, data analysis is the process of signal detection.

Data that do not convey useful knowledge are noise. When data are displayed, noise can exist both as data that don’t provide useful knowledge and also as useless non-data elements of the display (e.g., irrelevant visual attributes, such as a third dimension of depth in bars, meaningless color variation, and effects of light and shadow). Both sources of noise must be filtered to find and focus on the signals.

When we rely on data for decision making, what qualifies as a signal and what is merely noise? In and of themselves, data are neither. Data are merely facts. When facts are useful, they serve as signals. When they aren’t useful, data clutter the environment with distracting noise.

For data to be useful, they must:

  • Address something that matters
  • Promote understanding
  • Provide an opportunity for action to achieve or maintain a desired state

When any of these qualities are missing, data remain noise.

Signals are always signs of something in particular. In a sense, a signal is not a thing but a relationship. Data become useful knowledge of something that matters when they connect understanding to a question to form an answer. This connection (relationship) is the signal.

As I work on this book to define the nature of signals and to describe techniques for detecting them, I could benefit from your thoughts on the matter. In your experience, what data qualify as signals? How do you find them? What do you do to understand them? What do you do about them once found? What examples have you seen in your own organization or others of time wasted chasing noise. What can we do to reduce noise? Please share with me any thoughts that you have along these lines.

Take care

A new edition of Information Dashboard Design

July 23rd, 2013

Just in case you haven’t already noticed, the new edition of Information Dashboard Design is now available!

New chapters have been added that focus on the following topics:

  • Fundamental considerations while assessing requirements
  • In-depth instruction in the design of bullet graphs
  • In-depth instruction in the design of sparklines
  • Critical steps that you should take during the design process

Examples of graphics and dashboards have been updated throughout the book and many new examples have been added, including a few more of dashboards that are well designed. In total, approximately 30% more content has been added to the book. It has been a labor of love that I hope you find useful.

Take care,

technology isn’t spelled with a capital “T”

June 26th, 2013

I recently read the most thorough, thoughtful, and cogent treatise on technology that I’ve ever encountered: To Save Everything Click Here: The Folly of Technological Solutionism, by Evgeny Morozov.

My attraction to this book is not without bias. Morozov seems to view technology—its potential for both good and ill—much as I do, but the technologies that reside within his purview, the depths to which he’s studied them, and the disciplines on which he draws to understand them, exceed my own. His approach and grasp is that of a philosopher.

Morozov decries technological solutionism.

Alas, all too often, this never-ending quest to ameliorate—or what the Canadian anthropologist Tania Murray Li, writing in a very different context, has called “the will to improve”—is shortsighted and only perfunctorily interested in the activity for which improvement is sought. Recasting all complex social situations either as neatly defined problems with definite, computable solutions or as transparent and self-evident processes that can be easily optimized—if only the right algorithms are in place!—this quest is likely to have unexpected consequences that could eventually cause more damage than the problems they seek to address.

I call the ideology that legitimizes and sanctions such aspirations “solutionism.” I borrow this unabashedly pejorative term from the world of architecture and urban planning, where it has come to refer to an unhealthy preoccupation with sexy, monumental, and narrow-minded solutions—the kind of stuff that wows audiences at TED Conferences—to problems that are extremely complex, fluid, and contentious…Design theorist Michael Dobbins has it right: solutionism presumes rather than investigates the problems that it is trying to solve, reaching “for the answer before the questions have been fully asked.” How problems are composed matters every bit as much as how problems are resolved. (pp. 5 and 6)

This book exposes the threat of solutionism and proposes healthier ways to embrace and benefit from technologies.

The ultimate goal of this book…is to uncover the attitudes, dispositions, and urges that comprise the solutionist mind-set, to show how they manifest themselves in specific projects to ameliorate the human condition, and to hint at how and why some of these attitudes, dispositions, and urges can and should be resisted, circumvented, and unlearned. For only by unlearning solutionism—that is, be transcending the limits it imposes on our imaginations and by rebelling against its value system—will we understand why attaining technological perfection, without attending to the intricacies of the human condition and accounting for the complex world of practices and traditions, might not be worth the price. (p. xv)

If you’ve spent much time listening to or reading the words of Silicon Valley’s prominent spokespersons (Kevin Kelly of IDEO, Mark Zuckerberg of Facebook, Eric Schmidt of Google, to name a few) you might have noticed that they tend to speak of technology as if it were spelled with a capital “T.” For them, Technology is a sentient being with purpose that, much like the God of evangelicals, has a wonderful plan for our lives. It is our job as believers to embrace Technology and let it lead us to the promised land, for it exceeds us in wisdom and power, and is unquestionably good. I’ve provided training and consulting services for many of the technology companies that preach this gospel. During these engagements, I do my best to moderate their techno-enthusiasm and point out that technologies are just tools that provide benefit only when they are well designed, capable of helping us solve real problems, and ethically used. We have choices when we approach technologies, and we should make them thoughtfully.

Morozov addresses information technologies of all types and critiques them incisively from the perspective of history and a breadth of disciplines. Even such givens as Moore’s Law, which technologists often cite as the basis of their position, is revealed as a failed hypothesis—hardly a law.

Morozov seems to share my concerns about Big Data. Regarding the popular new trend of capturing and storing everything he writes, “Where there is no reflection about what ought to be preserved, the records—no matter how comprehensive—might trigger fewer challenging questions about the relative significance of recorded events; the enormity of the archive might actually conceal that significance.” (p. 278) In opposition to those who fail to see the connection between the technologies of today with the past, he writes:

Contrary to his [David Weinberger of Harvard's Berkman Center] claim that “knowledge is now property of the network,” knowledge has always been property of the network, as even a cursory look at the first universities of the twelfth century would reveal. Once again, our digital enthusiasts mistake impressive and—yes!—interesting shifts in magnitude and order with the arrival of a new era in which the old rules no longer apply. Or, as one perceptive critic of Weinberger’s oeuvre has noted, he confuses “a shift in network architecture with the onset of networked knowledge per se.” “The Internet” is not a cause of networked knowledge; it is its consequence—an insight lost on most Internet theorists. (p. 38)

Technologists (especially technology vendors) use the term “revolution” much too loosely. What qualifies as revolutionary? Morozov argues that, “In order to be valid, any declaration of yet another technological revolution must meet two criteria: first, it needs to be cognizant of what has happened and been said before, so that the trend it’s claiming as unique is in fact unique; second, it ought to master the contemporary landscape in its entirety—it can’t just cherry-pick facts to suit its thesis.” No recent so-called revolution in technology fails to meet these criteria more severely than Big Data.

I don’t agree entirely with everything that Morozov presents in this book, but at no point did I find his reasoning unsound or uninformed. He has opened my eyes to a few issues that fall outside of my primary spheres of interest, some of which have caused me to lose a little sleep, especially ways in which technological solutionism is influencing politics. While it is true that our political systems can be improved, the notion that we can “ditch politics altogether and hope that technology—especially ‘the Internet’—can rid us of problems that politics can no longer solve or, in a milder version, that we can replace politicians and politics with technocrats and administration” is frightening. (p. 128 and 129) “Fixing politics without first getting a thorough understanding of what it is and what it is for is still a very dangerous undertaking…Political thinking, as well as political morality, needs to be cultivated; it doesn’t occur naturally—not even to geniuses in Silicon Valley.” (p. 139)

Technologies are important. They give us opportunities to extend our reach and improve our world, but they also give us opportunities to do the opposite. Morozov understands this. He is not a Luddite, he’s a responsible technologist. I recommend that you consider what he has to say.

Take care,

Big Data and the NSA

June 18th, 2013

In a recent blog post titled “Big data NSA spying is not even an effective strategy,” Francis Gouillart raised concerns about Big Data that are very much in line with mine. Gouillart’s is a refreshing and rare voice of sanity. He’s been around long enough to recognize marketing hype when he sees it, and as an independent thinker with ethics, not a shill for technology vendors, he is one among few who are speaking the truth. Here’s a sample:

The evidence for big data is scant at best. To date, large fields of data have generated meaningful insights at times, but not on the scale many have promised…Yet, for years now, corporations and public organizations have been busy buying huge servers and business intelligence software, pushed by technology providers and consultants armed with sales pitches with colorful anecdotes such as the Moneyball story in which general manager Billy Beane triumphed by using player statistics to predict the winning strategies for the Oakland A’s baseball team. If it worked for Billy Beane, it will work for your global multinational, too, right? Well, no.

The worship of big data is not new. Twenty-five years ago, technology salespeople peddled data using an old story about a retailer that spotted a correlation between diaper purchases and beer drinking, allowing a juicy cross-promotion of the two products for young fathers. Today, most data warehouses are glorified repositories of transaction data, with very little intelligence.

Working with multinationals as a management consultant, I have chased big data insights all my life and have never found them. What I have learned, however, is that local data has a lot of value. Put another way, big data is pretty useless, but small data is a rich source of insights. The probability of discovering new relationships at a local, highly contextual level and expanding it to universal insights is much higher than of uncovering a new law from the massive crunching of large amounts of data.

Read Gouilart’s article in full and pass it on. It’s time to usher in a quiet voice of sanity in this noisy, naive world of “more is better.”

Take care,