Visual Design & Analysis

Information visualization examples that make you think!

Friday, November 30, 2007

Design & Reach

Following on from my last post, Mike Danziger and I chatted on email & he wrote up some impressions of the InfoVis conference. Stephen Few responded to some of the points, and a couple (1, 2) of subsequent postings, and some other comments (3, 4) have shown that people are interested. Sorry for being so late in responding myself - the day job sometimes gets in the way!

For me, the key contribution has been Pat Hanrahan's. I feel the same way & I'm grateful to him for providing some academic respectability to what would otherwise just be my own opinions. From my own pragmatic software industry perspective, I'd like to say something about how his suggestions could be taken forward.

Delivery mechanisms are key: to appeal to the masses one needs reach. Interactive visualizations must be delivered to people's eyes & to their fingertips. Static images in papers aren't enough: people don't have much time or patience & won't enjoy having to read lots of text in order to learn how the interaction works.

One approach is to put good visualization capability into commonly used tools such as Excel (1). That way people can manage their data themselves. Because the user has the ability to load and edit the data behind the visualizations this means a high degree of skill is needed when crafting the software so it has the necessary flexibility. Each tool has different extension points & platforms. In practical terms this means a software company is forced to choose a very small list of supported environments & work flows.

The more obvious route is to exploit the immediacy and universality of web delivery mechanisms. Thanks to Flash, Silverlight & Java there is a huge audience out there with suitable runtimes. It is good to see more and more experimental visualizations using these. (Though problems with data management are still there of course...)

Reach isn't enough: in order to bring something compelling to people one must embrace designers. Graphic designers, user experience designers, interaction designers, the works! The right kind of designers can keep a visualization clean, useful & informative but also imbue it with style, panache & memorability. There is a design revolution happening now in the software industry & it will sweep up information visualization tools along the way.

The combination of the need for reach and good design is the main reason why I'm so interested in the Adobe platform. Because they already have designers using their tools, they don't need to woo them to new platform. Add a massive install base (flash) and increasingly workable languages (mxml, as3) and it is hard to dismiss. Nice to see I'm not alone in thinking this.

Labels: ,

Thursday, November 8, 2007

Sacramento Thoughts

I got back from the IEEE Visualization conference in Sacramento a few days ago - it was highly enjoyable and I met some great people there.

I've been struggling to come to terms with the quantity of reading I now have to do. I've also found it hard to summarize my thoughts on all that I heard.

I think my personal best paper award would go to Jeff Heer's "Design Considerations for Collaborative Visual Analytics".

On a similar topic, Fernanda Viégas said something that caught my attention: instead of focusing on the classic visualization question of scaling the amount of data being visualized, the Many-Eyes project scales the size of the audience.

However, each data set in the Many-Eyes site is isolated. Processing of the data has to be done in advance in order to bring it down to a manageable size, and data sets do not have any intersection points with each other. (Although they do allow comments to refer to other data sets, along with other navigation aids.)

Classic information visualization research seems to follow a pattern something like this:
* Researcher gets hold of a dataset from somewhere.
* They consider various encodings of it.
* While doing that they achieve some level of domain knowledge.
* They develop an isolated visualization system - this is what they spend most of their time on (I can't blame them - it is the fun bit).
* They achieve some insights of their own which gives them a warm glow.
* Some short evaluation is tacked on to keep the reviewers happy when they get the paper.

From an outsider's perspective:
* In many cases the dataset is considered in isolation from other potentially interesting & relevant data sets.
* The quality of the encodings chosen depends on the knowledge of the researcher, and this can vary quite a bit.
* The system developed tends to be isolated from other applications & systems - that makes it easier to develop. Often there are no multi-user aspects, but this at least seems to be changing.
* Insights almost always are with regard to knowledge gleaned from outside the data set. E.g., a downturn in the number of farmers (in census data) could be explained by increasing agricultural mechanisation (innate knowledge), or the popularity of a certain baby name might coincide with a celebrity (search for 'Celine' here). There is often an implied "cause and effect" hypothesis in these kind of insights.

Going back to Viégas' comments I suspect that the true problem lies in scaling not just the audience - though that of course is important - but scaling on both the number and type of datasets being visualized.

The 'perfect visualization tool' would be able to cope with new data sets being thrown at it. Linkage would be automatically established between elements of the data set (e.g., Joe Bloggs from one data set would be recognised as the same Joe Blogs from another data set). The data sets could have a wide variety of schemas and come from wildly different sources. The various visualizations in the tool would be automagically updated with the relevant encoding of the new data, and new visualizations which have suddenly become appropriate would be displayed. The user would be able to reach many new insights because all the data is cross-referenced and generally speaking most insights come when combining data. Plus the visualization, being perfect, would show those insights clearly.

Mike Cammarano's talk on his work with the dbpedia data was interesting from this angle, in that the data was inherently heterogeneous & extensible. Of course, the Semantic Web research agenda is of interest here too, despite lying outside of information visualization research.

As Matthew Ericson showed, the sheer craft and skill needed to combine data well and communicate it effectively means that it is difficult to see a perfect visualization tool being realised in an automated way. I guess this makes it an interesting research area!

Another aspect of developing web-based social visualizations is that there is much more potential for gathering information about how users actually use the visualizations: server-side logs can be designed to keep track of almost every action. This would lack the rigour of a properly controlled lab experiment, but that would be counterbalanced by the sheer number of possible users, so I'd say there must be huge benefits in this approach. (And of course making sense of the logs could be another data visualization challenge!)

On a separate topic I found Stephen Few's capstone talk rather unsettling - I understand why he is so passionate about designing clear visuals, but sometimes that passion can err on the abrasive side. And that style won't endear the visualization community to the world out there. I also think he underestimates the power of playfulness and fun in reaching out to an audience - come on - Swivel's option to 'bling your graph' is just funny! Another worry is that the very Spartan style of visuals he favours actually imposes an aesthetic in its own right, for all of its good intentions and intelligent rationale. We should accept some people just won't like that aesthetic.

However, his tutorial was a really excellent Tuftean summary of all that is great and good about the subject, so I guess he can be forgiven! And when you see graphics like graphwise (thanks Nathan) you can see how much work there is to be done :-)

Labels: ,

Wednesday, August 29, 2007

Visual Variables

I've been designing some new visualizations recently, and reading around the InfoVis literature for examples to analyse and best practices to follow. I've found myself returning again and again to a diagram in MacKinlay's 1986 paper and various papers which follow it. So much so that I've reworked a version of the diagram and pinned it to the wall behind my desk for quick reference:













It shows a theoretical model for accuracy when performing reasoning tasks with an image. The model was developed empirically but for some comparisons and some analytical tasks experimentation has backed it up. I find it a handy thing to have when working out how visualizations and infographics are put together. And when I have some data that I'm designing an representation for, it helps me choose what visual variables to use.

Labels: ,

Monday, June 25, 2007

Visualization Techniques for Temporal Information

Over the last couple of years I've been collecting various articles and links on temporal information visualisation. I thought it was about time I collated them and tried to put some context and analysis around the various techniques. Here is the article (pdf, 1MB).

Labels: ,

Friday, May 25, 2007

Google Timelines

From Google Blogoscoped, a fascinating timeline feature found in google experimental. Try volcano, blair, web application.

Labels: ,

Thursday, April 5, 2007

Social Graphing

I mused last year whether we'd see Web 2.0 style entity-link-property visualizations, alla swivel, and today macroscopia arrives. The UI builder looks superficially similar to Yahoo pipes, and it uses a radial layout animation very similar to one I mentioned earlier. Try this for size.

The interaction with the graph feels good - the animation preseves context, is smooth (flash based implementation) and I like the graying of content away from the centre. It is only really good for local link exploration though - no zoom and no other layouts mean the big picture remains elusive.

Some simple search functionality might help for navigation too.

I can't help thinking one thing wrong with this is the complexity in getting the data in - it just isn't easy for your average user to grasp. Putting a good step-by-step goal focused UI on this part would make a big difference.

Labels:

Monday, April 2, 2007

A sense of universal scale

Now that's what I call zooming!

(Put sound on and use the mouse wheel...)

Labels:

Friday, March 16, 2007

The latest Jonathan Harris

Universe. Plus a video explanation.

Labels:

Friday, March 2, 2007

Geotime and Crime Mapping

A dynamic visualisation of crimes over time and space - drag the marker along the top & see what happens. Plus some theory. It isn't perfect by any means, but the ideas are nice & it may be interesting to see where they go with it...

Labels:

Monday, February 12, 2007

Flow

This charming flash game made me think whether a 'depth' metaphor could work when exposing 'drill down on information'/'collapse or summarize information' in a visual way.

Labels: ,

Thursday, February 1, 2007

Words, Dictionaries and Thesauri

There are several link-entity browsers out there for words. Here are some I've spotted:
It is interesting to compare their approaches...
  • In terms of delivery, the Flash approach of the thesaurus browser and Visuwords wins hands-down.
  • In terms of keeping one's point of reference (history, animation), Roget2000 fails miserably. The hyperbolic layout kind of works, but it isn't particularly legible.
  • Visuwords is ugly to the point of distraction, and the layouts hide the words sometimes. Good feedback when dragging though.
  • MapMyWord appears to work through image maps, and is responsive until you want to re-centre it.
  • Thinkmap is Java, uses animated radial layout to good effect, and has a plethora of options. Not that I'd actually want to pay for it though!
An honourable mention also has to go to the ObjectGraph Dictionary - great speed and usability through AJAX.

Labels: ,

Friday, January 26, 2007

Places mentioned in this book

Google Book Search now adds a map of places mentioned in the book. Try War and Peace, or some other examples.

Labels: ,

Wednesday, January 24, 2007

Many Eyes

IBM's visualization lab have published the 'Many Eyes' collaborative site, which improves on the ideas of Swivel. After data gets uploaded, you can choose several different visualization paradigms to view it.

Labels:

Friday, January 12, 2007

Justice Mapping

An elegant set of visualizations of 'justice data'.

Labels:

Thursday, January 11, 2007

Information Visualization Introduction

This google tech talk is a great introduction to information visualization theory with particular reference to node-link graphs by a rising star in the field, Tamara Munzner.

Labels:

Friday, January 5, 2007

Radial Layout

A relatively old piece of work, but this radial layout video (25MB) demonstrates how compelling it could be for data exploration. (Context). Worth noting though that it works best with a tree structure with small numbers of non-tree links, which generally are emphasised by the layout. Incremental additions/deletions of nodes are particularly smoothly dealt with.

Labels:

Trois Dimensions

A great demo of 3D mapping from France (go to about 1:30min in), no doubt to be followed in the mainstream by Virtual Earth and Google Earth/Maps sometime...

Labels:

Wednesday, January 3, 2007

Casualties

Another flash based interactive infographic which combines a time line with geographic data - of US deaths in Iraq.

Labels: ,

Thursday, December 21, 2006

Just Bad

Now and again I stumble on things which are just so bad they kind of illustrate a point: proof of the uselessness of 3D: just try to a) navigate and b) read anything. Compelling, it isn't.

Labels:

Tuesday, December 19, 2006

Grokker

It has been around a while, but I thought I'd mention Grokker, being kind of interesting & having some visualization aspects too.

Labels:

Complexification & Processing

Complexification has some fascinating combination of some maths and some visualization using the increasingly popular java based processing library.

Using processing one can package the drawings in a java applet, or there are ways to keep java at the back end while using Flash at the browser end. Interesting.

Labels:

Thursday, December 14, 2006

Swivel

This past month, the swivel phenomenon seems to be growing. The idea: people upload their spreadsheets, graph the data, and share it in a Web 2.0 kinda way.

See for example violent crime versus wine consumption.

Interesting things: no XML in sight - swivel imports .CSV files. Users search by the column headers (no semantics, etc.) See the caveats page for guidelines on importing.

See the video on this page for a more complex variant on the same theme.

If our data format was as simple could we do the same with ELP?

Labels: ,

Wednesday, December 13, 2006

Amazon Browsers for Derek

A couple of entity/link amazon browsers:

http://www.pmbrowser.info/amazon.html
http://www.flowser.com/jp/flowserST.html

The second one is better 'cos it has pretty pictures. Plus I always think Japanese looks good when on ELP diagrams :-)

Labels:

Tuesday, December 12, 2006

Practical Tip

Visualize your hard-drive usage.

Labels:

Cabs in San Francisco

Amazing real time map display of cab journeys in San Francisco.

Labels:

Visualization of Uncertain Information

I found a seminar series on visualizing uncertain information: http://www7.nationalacademies.org/bms/visualization_title_page.html

The slides by David Harris were slanted at large quantities of unstructured data, although no solution seems in sight.

Labels: ,

Musicovery

No idea what the chart means but it is easy on the eye and comes with a soundtrack...
http://www.musicovery.com/

Labels:

Friday, December 8, 2006

Friday afternoon links

Simple but very effective data exploration
http://acg.media.mit.edu/people/fry/zipdecode/

Getting dumped
http://artport.whitney.org/commissions/thedumpster/interface.html

Flags with messages
http://www.brazilianartists.net/home/flags/

Labels:

Some old links

3D Visualization that actually works
http://labs.live.com/photosynth/

Describing a set of tools, uses and roles:
http://www.flickr.com/photo_zoom.gne?id=58299511&context=photostream&size=l

Search Flickr images by drawing
http://labs.systemone.at/retrievr/

Optical Illusion
http://web.mit.edu/persci/people/adelson/checkershadow_illusion.html

Sculptures
http://www.bathsheba.com/sculpt/

A mind-boggling timeline
http://www.futureswatch.org/Timeline.htm

Historical timeline
http://www.adept-plm.com/Newsletter/napoleon.gif

Labels: