A Fall ’19 Sampler of Student InfoVis Projects

I have been teaching our graduate class on Information Visualization (CS 7450) since the fall of 2000. Of perhaps all the different projects, initiatives, and activities that I’ve been involved with here at Georgia Tech, this course is the one that I have enjoyed the most and feel closest to.

Every year about 75 students take the class. The main component of their grade, for as long as I can remember it, has been a group project, typically of three or four students. Each team designs and develops a visualization system for a domain of their choosing. More specifically, each team must identify an interesting topic and find a dataset or datasets relevant to that topic. They identify a set of questions that the data may help answer and a high-level goal or purpose for the project. The teams next develop multiple visualization design ideas. Based on feedback from the TAs and myself, they move forward to create a working visualization system. While I specify no required platform for system development, nearly all teams now use d3.

The reason for this article is to publicize some of the top projects developed this fall by our students. I’ve created a webpage with links to the systems for these projects so you can explore each one individually. This fall, the projects split into two main styles. The first is a classic exploratory/analytic system with multiple views and many controls for selection, filtering, and reorganizing the views. The second style was more of a narrative or storytelling presentation, often employing a scrollytelling technique on a web page.


The two most impressive projects from the fall were both of the exploratory system style. The first, shown just above, presents songs and artists from the Spotify music service. Viewers see the breakdown of an artist’s songs along multiple dimensions such as key, tempo, and dance-ability. It also highlights similar songs and even provides short previews of the songs when they are moused over. The second project, shown at the top of this article, focused on a different type of artist, the greatest painters of all time. The visualization presents paintings as the four key colors from each. Viewers can organize the paintings by these colors, by artist, or by the date of the painting. When the viewer selects a painting, the system identifies other similar paintings and also presents characteristics of that artist’s works. I could literally spend hours playing with each of these systems to learn more about the data they’re presenting.

Another project of this style, shown just below, presents information from one of the Democratic presidential debates, highlighting when the different candidates spoke on various key issues and how the public reacted on social media. Yet another project presents information about videos trending on YouTube for the first three months of this year. Finally, one of the most impressive engineering efforts of the term is a visualization that depicts all the details (19 variables) of 46 million parking tickets in New York City! The system uses elasticsearch to manage access to that massive database.


On the narrative/storytelling side, one of my favorites showed coffee beans from around the world and their characteristics. And this one easily could be characterized as an analytical visualization as well because it contains multiple highly interactive components that allow the viewer to explore this rich data set about coffees. Another strong project focuses on the political polarization between the Democrats and Republicans in America politics over the years. Unfortunately, it looks like the two sides are drifting further and further apart. A beautifully done and somber visualization, shown below, illustrates the scale of the Syrian refugee crisis. Two other nice projects each adopting a scrollytelling style present information about the growing scourge of plastics in our oceans and the rising popularity of TED talk videos.


I’ve chosen to highlight a subset of the projects from the class here, but this was only about half of the 19 projects in total that occurred. Others focused on topics such as human trafficking, endangered animals, faculty in computer science departments, and presidential speeches. What a varied set of domains the students addressed this fall!

In the past, I required each team write a short paper about their project. As that grew to feel more and more like a chore, a few years back I pivoted and asked each group to make a video introducing their topic and demonstrating their system. In my eyes, this has been a great success. This past fall, in particular, the production quality of these videos improved immensely. All the teams presented their videos during our three hour final exam period and I came away from the session just amazed at how good they all were. The project summary page also includes a link to the video for the featured projects.

The people deserving all the credit for these projects are the students themselves. For brevity, I haven’t included the names of the individual team members in this posting. You can learn their identities by following the links to the individual projects. (All students have given permission for them to be identified in this manner.)

Impressions from VIS ’18

The VIS ’18 Conference concluded a few weeks ago, and I finally had some time to sit down and pull together a few reflections about this year’s conference. For the second time ever, VIS ventured to Europe, and this year the meeting was held in Berlin, Germany. Attendance was an all-time record with 1256 participants. To handle such a large group, the meeting was held in the Estrel Hotel (shown above) which was rumored to be the largest hotel in Germany. It is located in the eastern part of Berlin, and it was difficult for me not to think about how different circumstances are today from some 30 years ago when few of us would have been able to set foot there.


A clear theme that permeated the conference this year was the emergence of AI, machine learning, and related technologies, and just how visualization might connect to these topics. Many researchers feel that visualization can play a key role in developing “explainable AI” in the future. Pat Hanrahan’s keynote talk at the VDS Symposium perfectly aligned to this theme and was thoughtful and inspirational as usual. He defined analytical thinking as “A structured approach to answering questions and making decisions based on facts and data” and argued for its importance in our daily lives. He also characterized “responsible analysis” as being explainable, understandable, transparent, fair, vetted, and ethical, and communicated his belief that data visualization can and should be an important component of this concept.

In another invited talk at VDS, Kirk Goldsberry of ESPN, formerly with the NBA’s San Antonio Spurs, spoke about his experiences bringing data visualization to sports analytics. He is perhaps most famous for his heatmap visualizations of shot locations in professional basketball. Much of his talk focused on why we don’t see visualization used more in sports analytics. One simple answer he gave was “politics” but he enumerated three more specific reasons: 1. The constraints of media; 2. Sports analysts don’t know how to make visualizations; and 3. Sports executives don’t demand visualizations – it’s simply not a part of their culture. He also argued that visualization experts underestimate how much general managers only care about an answer (“Just tell me how much the house should cost, dude.”) In a very pertinent metaphor, Kirk believes that visualization scientists are good take at take-off and flying the plane, but we need to be better at landing it. He also interjected what was likely my favorite quote of the entire conference when he characterized legendary NBA player and announcer Charles Barkley as “more of a qualitative social scientist.” And I learned from Kirk’s talk that Harvard University has only eliminated one academic department in its history: geography.

This year VIS hosted a day-long VisInPractice event which included many invited talks presented by visualization practitioners. I only was able to see about half the talks and they were terrific. In one, Shan He of Uber presented the company’s Kepler geovisualization toolkit and system. It looked simply fantastic and left me eager for the opportunity to try it out. In another presentation, Lisa Charlotte Rost described her former and newly updated blog review of visualization authoring tools. She analyzed the growing space of tools and highlighted the rise of “data drawing apps” such as Lyra, Data Illustrator, and Charticulator. She did conclude with some thoughts about what all visualization author tools must improve at: better user interfaces, make it easier to build artsy, responsive charts, and make the software act more as a teacher, helping the user to learn the paradigm and tool as one uses it.

I greatly enjoyed the practitioner symposium and lamented the talks during it that I was unable to see. As visualization research becomes more and more focused and narrower in scope, I tend to miss the more design-focused work developing interesting and creative visualizations that we used to see at VIS. New workshops and symposia have sprung up to fill that void, such as OpenVis, Tapestry, Information+, and eyeo. (I’m looking forward to attending Tapestry for the first time next week.)

Beyond these symposia, recently I’ve also enjoyed following on Twitter and through their blogs a number of the stars of the visualization practitioner community, people such as Lynn Cherny, Andy Kirk, Neil Richards, Cole Knaflic, Scott Murray, and John Schwabish. Heck, even though he’s a professor, I’ll lump Alberto Cairo in there too. All these people consistently develop and identify interesting, thought-provoking visualizations, usually grounded in some domain and data set. I’ve found many of the visualizations in their posts to be inspirational in my own work and I also use many of them as examples to show in my visualization classes. I’d really welcome more involvement by these folks at VIS in the future.


One small thread of an idea in visualization design that I noticed throughout the conference was the use of motion and animation. The NY Times’ March ’18 story about effects of racism was described to use the “wandering dots” technique, the Times’ former work on helping to illustrate uncertainty in elections by using a spinning roulette wheel metaphor, and HOPs (hypothetical outcome plots) that illustrate uncertainty by making random draws from a distribution and animating through the resulting different visualizations, all were examples of this idea. While some of these visualizations are now a few years old, it was interesting to me how this idea popped up in different talks throughout the conference.

As for papers that caught my eye, I tend to gravitate toward the InfoVis Conference sessions in general, so most were from there. Just a few of the many that stood out (mostly because of my own personal interests) include:

  • The Draco system for embedding visualization design principles as constraints that can drive the generation of appropriate visualizations for a given data set
  • Efforts to develop new metrics for how users interact with visualization interfaces
  • The VAP system that automatically (drawing from dblp, the vispub data set, and the keyvis data set) generates text profiles, augmented by visualizations, of visualization researchers given just their name
  • Studies of different visualizations’ efficacies on phones and watches
  • Techniques for unifying tables and tables with text in long document viewers
  • A survey and analysis of uses and popularity of visualization dashboards
  • The Charticulator system for creating visualizations from data sets without needing to program
  • The ATOM grammar and toolkit for constructing unit-style visualizations
  • Litvis.org, a site and approach for creating visualization design, explanation, narrative, and reflection notebooks, much as done with Jupyter notebooks for data analysis.

I’ll shamelessly add a plug for two papers in-part from my research group: our work on the low cost ICE-T approach to evaluating the value of a visualization and the Voder system that combines interactive data facts with visualizations to help data analysis and presentation.

Oh, I learned once again a simple maxim for the conference: If you’re going to present a paper about color, be ready for objections after your talk.

Topics receiving focus at the InfoVis Conference are always changing. For fun, I grabbed the conference session titles/topics from both this year and ten years ago to see how things have changed. Below is a list of the sessions, 2008 is on the left and 2018 on the right. Right away, one can see how much the conference has grown, almost doubling the number of sessions over the ten years. Beyond that, there is a core of consistent topics, but new themes are emerging. Two that jumped out to me include interaction with different types of displays (Immersive analytics and Devices: large & small) and the rise of perceptual/cognitive studies and uncertainty (three sessions this year).


Perhaps the topic of most discussion across the entire week was the evolution of the conference itself. A small committee has been studying the possibilities for how the meeting may evolve in the future. It has been noted that the preponderance of different conferences and symposia within the meeting may confuse newcomers who aren’t quite sure where their work fits. This review committee presented the results of their study to the conference during Wednesday’s lunch session. One potential option is to unify more, resulting in one main conference with many subareas underneath it. This seems to be the most popular potential path forward.

Many details remain to be worked out, however, and unfortunately those details (e.g., what are the subareas, what is the new conference named, how does the reviewing work, etc.) are quite challenging. The growing size of the meeting can be viewed both positively and negatively. The growth indicates the popularity and increased interest in visualization, which is terrific. However, that growth also results in more parallel sessions and conflicts, and just an overall busier and more hectic week. (Perhaps the most common sentence I heard uttered during my week there was “Oh, I missed that presentation.”)

Nonetheless, I think that keeping a large central showcase conference for our discipline, much like CHI is for the HCI research community, is likely a good thing. It provides an opportunity for many people to meet and exchange ideas. We may potentially see subareas grow and blossom into their own meetings. For CHI, related conferences such as UIST, CSCW, Multimedia, ISS, and others did just that. We even have seen this to a lesser degree in visualization with the emergence of symposia such as OpenVis, Tapestry, and Information+, as I mentioned earlier in this post. One difference in the visualization community is the presence of geographically-based conferences such as EuroVis and PacificVis that are not focused subareas, but smaller versions of the broad discipline as a whole. Well, it will certainly be interesting to see how things develop over the next few years.

Next year the VIS Conference moves to Canada as it will be held in Vancouver in October. Vancouver is a beautiful city and I would not be surprised if a new attendance record is set yet again.

Finally, I wanted to end with a picture. On Friday afternoon after the conference had ended, some colleagues and I wandered around to a number of the must-see tourist locations in the city. One was Checkpoint Charlie, the infamous crossing point between East and West Berlin during the Cold War. While there, I snapped the photo below. The text reads “You are leaving the America sector”, but I couldn’t help thinking about the irony (?) of the KFC sign just below/above it. Given enough time, I guess things can and do change.



ICE-T @ InfoVis ’18

At the recent IEEE InfoVis Conference in Berlin, my research group collaborated with colleagues on two published papers. This blog entry gives a quick and dirty overview of one of those two papers: “A Heuristic Approach to Value-Driven Evaluation of Visualizations.”


The first project was done with Emily Wall, Meeshu Agnihotri, and Alex Endert here at GT, and Laura Matzen, Kristin Divis, and Michael Haass of Sandia National Lab in New Mexico. Back in 2014 I published a paper at the BELIV Workshop that grew out of some frustrations I’d had with the evaluations one finds in many infovis papers. Often, the evaluation consists of a small set of benchmark tasks done with students at the local university. The tasks themselves typically are low-level, detailed questions about a data set that one would answer using a visualization. While there is nothing fundamentally wrong with that approach, it just seemed that it did not strike to the core of why a visualization might be helpful or what utility it could provide.  I felt that such evaluations, while reasonably assessing the usability of a visualization system, failed to assess the visualization’s true value.

Hence, I developed a formula that attempted to capture the value of a visualization. The formula consisted of four components: the time a visualization saves, the insights the visualization spurs, the essence of the data set it conveys, and the confidence about the data it evokes. This is an oversimplification, but hopefully enough for you get the basic idea. For more details about the approach, you can examine the original paper. Over time, I grew frustrated that this value equation was purely descriptive – there was no way to use it to actually evaluate the value of a visualization. This frustration motivated the project Emily and I presented at InfoVis.

We began the research by seeking to identify more specific characteristics of or statements about each of these components. We surveyed visualization literature and conducted multiple brainstorming sessions and workshops to generate ideas. Ultimately, we developed a hierarchical framework in which each of the four components contains a small set of guidelines, and each guideline contains a small set of heuristics. A visualization then will be rated using these heuristics, and the scores for each individual heuristic accumulate to provide a rating for each component and for the visualization overall. Ideally, a small set of people would rate a visualization using the framework. These people should have a background and experience in data visualization. The approach is designed to be in the family of “discount” evaluation approaches one finds in HCI, much along the lines of Jakob Nielsen’s heuristic evaluation technique.


In our InfoVis paper, we describe a user study in which we had 15 visualization experts evaluate three different visualizations of the same data set using the method. Although the experts expressed some concerns about the technique and the heuristics themselves, ratings across all the experts were quite consistent and aligned with our a priori assessments of the utility of the different visualizations compared to each other. And we used the concerns raised by the experts to refine and clarify some of the heuristics. Thus, the method seems to show promise as a relatively low-cost way of determining a visualization’s potential value or utility.

Finally, we struggled to come up with a name for the approach for quite a while. Ultimately, we took the four lead letters of the value equation’s components (TIEC) and made an anagram of them: ICE-T. To researchers struggling to find an appropriate, helpful method of evaluating their systems, hopefully the ICE-T method will be just as refreshing as a cold iced tea on a warm summer day.

You can learn more about the ICE-T approach in our InfoVis paper, hear and see a replay of the talk we gave in Berlin, or browse the project webpage we have created to help others utilize the technique. Currently, it contains a pdf summarizing the hierarchical value framework, but we are working on an interactive version of the framework (form) that also will generate a spreadsheet of result data and a report about each evaluation.

Voder @ InfoVis ’18

At the recent IEEE InfoVis Conference in Berlin, my research group collaborated with colleagues on two published papers. This blog entry gives a quick and dirty overview of one of those two papers: “Augmenting Visualizations with Interactive Data Facts to Facilitate Interpretation and Communication.”

In the broad field of data analysis, recently there has been an increasing effort to “automatically” generate insights about a data set. Sophisticated techniques from the database and AI communities help generate these insightful observations about the data, usually in a natural language expository form. Now, precisely what constitutes an “insight” is a matter of debate, something that I explored in a previous column. In our research, we choose to use the term “data fact” instead, reserving “insight” for deeper and more meaningful realizations about a data set.

Our paper at InfoVis was the lead effort of PhD student Arjun Srinivasan, with help from Steve Drucker at Microsoft, and Alex Endert and me here at GT. The key contribution of the work is to think of these data facts that can be generated for a data set not as static utterances, but as interactive components of a more comprehensive data analysis system. We built a system called Voder1 that illustrates this principle in action.

When an investigator specifies/creates a chart that visualizes variables of interest, Voder generates data facts corresponding to those variables. As the investigator moves the cursor over the facts, the visualization changes (perhaps just a highlight) to emphasize and help explain the fact being examined. Furthermore, Voder presents alternative visualizations that also illustrate the fact, and it gives the investigator different options in how to embellish the visualization to communicate the fact.


Voder also provides a search capability in which the investigator can type in terms such as variables on the data set or analytic queries (e.g., “correlation”, “outlier”), then Voder generates visualizations and data facts pertinent to the query terms. Thus, the system facilitates a flexible data analysis process that can start with visualizations, with data facts, or with keyword searches, and supports easy, fluid transitions between each of these aspects. Voder also provides a “presentation” mode where interactive data facts and visualizations can be compiled as slide decks or dashboards.

A formative user study of the system with people of varying visualization backgrounds identified a great deal of promise for the approach. Less-experienced participants appreciated the help Voder provided for interacting with visualizations. Experts appreciated that too, but also hoped for deeper observations in the data facts. Attendees that we spoke to after the talk at InfoVis expressed excitement about the potential of the system for assisting visualization literacy and education as well.

You can learn more about Voder in our InfoVis paper, hear and see a replay of Arjun’s talk from the conference, or see a video of the system at the project webpage.


1Voder is a disc-shaped voice-box translation device from Star Trek. It was also Bell Lab’s device that was the first machine to generate human language.

What’s an Insight?

One of the key notions of data visualization is that it can inspire insight about the data being presented. The idea of generating or spurring insights has been a core objective that visualization developers strive to achieve. But just what is an insight? How do we identify the insights that a visualization inspires? This is a tough question that the visualization research community has been grappling lightbulbwith for quite a while.

I had cause to revisit that question late last fall when the topic of our weekly Visualization Group meeting was a paper from SIGMOD ’17, “Extracting Top-K Insights from Multi-dimensional Data”, by Tang, Han, Yiu, Ding, and Zhang.1  In this fascinating project, the research team developed methods to automatically (algorithmically) identify the top insights that can be gleaned from a data set such as sales data over time for a group of products. Note that this research comes from the Database community, which is obviously quite different from the data visualization research community.

To better understand what the developed algorithm does, suppose we have sales records of five different products over a five-year period. Potential insights from that data might be that a particular product’s sales show an increasing trend over time (i.e., the delta or change from year to year is growing), or that another product’s sales ranking within the group is falling each year.

Amidst all the debate within the visualization research community about what constitutes an insight, I was curious to see how Tang et al would characterize one. They describe an insight as “an interesting observation derived from aggregation in multiple steps.” Furthermore, the researchers explain that such insights have two typical usages in business applications, to “provide informative summaries of the data to non-expert users who not know exactly what they are looking for” and to “guide directions for data exploration.”

The heart of the paper is their algorithm for finding the “Best-k” insights from a data set. Needless to say, it is quite complex and simply beyond me to completely follow it, but ultimately it is about identifying insights and quantifying their “interestingness”. Most insights they find seem to take on one of two flavors: “point” insights where values are remarkably different from others or “shape” insights that show rising or falling trends.

The paper contains a case study on car and computer tablet sales data. Their algorithm identified the following example top insights:

  • When measuring the importance of SUV sales for a certain brand, brand F is outstanding number 1.
  • There is a rising trend of SUV’s market share.
  • In 2014, SUV exhibits most advantage over other categories than ever.
  • The yearly increase of tabular sales is slowing down.
  • 2012/04-07’s yearly increase of tablet sales is remarkably lower than ever.

Finally, the authors conduct a user study in which they have data analysts and managers rate the insights found by their algorithm along usefulness and difficulty dimensions. The algorithm fares well on both measures. Additionally, a comparison study of senior database researchers identifying insights via “traditional” methods uncovers the dramatic result that the time taken (average) using SQL was 29.2 minutes, using Excel pivot tables was 14.2 minutes, and using the Best-k algorithm was 0.17 seconds. The machine triumphs yet again!  :^)

I was fascinated by their characterization of data insights and their descriptions of insight characteristics. But how do those notions compare with other communities’ views of insight?

I believe that a very common impression of an insight, one harbored by many people, is as a kind of “a-ha” moment when a person figures out an answer or a solution to a problem that has been simmering for a while. This perception reminds me of the famous scenario where a light bulb goes on over a person’s head while they’re in the shower, a true “Eureka!” moment.

But I don’t feel that’s how the data visualization community most commonly views insight. Chris North actually defined an insight as being an individual observation about data by a person, a unit of discovery.2  He believes that insights are complex, deep, qualitative, relevant, and unexpected. Would the insights found by Tang et al’s algorithm meet those criteria? I’m not sure.

Personally, I have always resonated with the characterization of insights by Chang, Ziemkiewicz, Green, & Ribarsky.3  Their view contrasts with the spontaneous a-ha perception described above. Instead, they believe that insight is much more about knowledge-building and model-confirmation. It is like a substance that people acquire with the aid of systems.

When I hear someone say that a “visualization gave them insights about a data set”, I tend to be thinking along the lines of Chang’s characterization. In fact, my former GT colleagues Ji Soo Yi, Youn-ah Kang, Julie Jacko, and I reflect on insight in an old BELIV workshop paper.4  In it, we focus on the processes that one undertakes in order to gain insight. This frequently occurs in “sensemaking” scenarios. We found four processes through which people frequently obtain insight using visualizations, including provide an overview, adjust, detect a pattern, and match a mental model.

I have always been struck by the importance of context and existing domain knowledge to insights too. A person’s pre-existing knowledge about a data set and its domain has a big influence on what they will consider a data insight. For a data set about wines of the world, the set of insights a novice uncovers may simply be ho-hum background information to a wine connoisseur. When determining insights about a data set, it’s likely safest to assume the person doing the exploration is unfamiliar with the data and its domain, in order to establish a common baseline.

Looping back to the paper by Tang et al, ultimately I’m not sure that I’d describe the statements that their algorithm produces as “insights”. Maybe they’re interesting data facts or data observations, but insights somehow feel to me like deeper understandings of the characteristics and implications of a data set. This in no way diminishes the remarkable achievement of Tang et al. That they can automatically identify salient and useful observations about a data set is quite remarkable.

As we move forward, it will be interesting to see if the different academic sub-communities (cognitive science, databases, KDD, visualization) can come to some shared understanding of just what insight is and how we can better help people find them. Once we do that, then maybe we can start to develop evaluation methods to determine whether particular visualizations actually do a good job generating insights.  I’m also especially excited by systems that will be able to combine techniques from multiple areas – for example, systems that automatically generate insights about a data set, support those insights through illustrative visualizations, and allow analysts to manually explore the data through visualizations to uncover their own unique insights.

1 B. Tang, S. Han, M.L. Yiu, R. Ding, and D. Zhang. “Extracting Top-K Insights from Multi-dimensional Data.” In Proc. of SIGMOD ’17. May 2017, pp.  1509-1524.

2 C. North. “Toward Measuring Visualization Insight.” IEEE Computer Graphics & Applications  26, 3 (May 2006), pp. 6-9.

3 R. Chang, C. Ziemkiewicz, T.M. Green, and W. Ribarsky. “Defining insight for visual analytics.” IEEE Computer Graphics & Applications  29, 2 (March 2009), pp. 14-17.

4 J.S. Yi, Y. Kang, J. Stasko and J. Jacko, “Understanding and Characterizing Insights: How Do People Gain Insights Using Information Visualization”, In Proc. of BELIV ’08, April 2008, pp. 39-44.

Impressions from VIS ’17

The VIS ’17 Conference was held almost two months ago in downtown Phoenix, AZ. This column is my woefully late recap of the meeting with a few reactions and thoughts about how it went this year. I was so busy upon returning to school after the conference that I just kept putting off writing this. (If I ever consider being an AC for the CHI Conference again, someone please slap me.) Now that final exams are almost here, I’ve finally gotten a little chunk of time to pull this together.

I really enjoyed the location of the conference this year. I’d never been to Phoenix before, so I wasn’t sure what to expect. The conference was held at the big downtown convention center with a lot of hotels nearby. Even though it’s in the middle of the city, it didn’t feel like that. It was relatively quiet in the surrounding area and it certainly was easy to get around. Plenty of restaurants were nearby too. One night the streets were buzzing as the Diamondbacks beat the Rockies in the NL Wild Card game, and all the fans streamed out afterwards. The home stadiums of the Diamondbacks (baseball) and Coyotes (ice hockey) are close by, and add to the atmosphere of this part of the city.

The convention center itself is quite large. As VIS has grown, we now must use facilities like it just to be handle the number of attendees. The rooms for paper presentations also were huge, perhaps even a little too big. Of course, we’re at the mercy of the layout of the venue on this, and no one wants meeting rooms that are too small. However, the sessions often felt a little sterile and impersonal to me this year. It seemed like relatively few questions were asked after talks, and I wonder if the room size and atmosphere somehow contributed to that, even a little.

mirrorThree topics/themes stood out to me this year. The first one was data science. In workshops, tutorials, and papers, the topic was everywhere. It seems like VIS is just mirroring what we’re seeing throughout academia now as more schools create Data Science degrees, programs, and even in some cases, departments. Visualization is only one piece of data science too, and sometimes a piece that is overlooked. Machine learning is clearly a large component of the data science equation, and it was ever present at the conference. It seemed like half of the VAST papers were about interfaces to machine learning algorithms and systems.

The second theme that grabbed my attention, particularly at InfoVis, was the growing presence of evaluation-focused papers. I guess this is to be expected – As our area matures and it becomes tougher and tougher to come up with new visualization techniques and systems, it shouldn’t be surprising to see more evaluation papers show up. InfoVis seems to feel a little more like CHI every year to me. (Not sure how I feel about that.)

The final topic I noticed this year was a simple one, word clouds. I couldn’t believe how many papers were about them! OK, OK, maybe that’s an exaggeration, but there was one paper session that seemed to be all about them. While they can be great for advertisements and fun, I always remember Jacob Harris’ great column and quote: “Every time I see a word cloud presented as insight, I die a little inside.” Anyhow, I did like the EdWordle paper and especially the interactive demo at http://www.edwordle.net/.

While I enjoyed many papers at the conference, a few stood out to me. Sandia National Lab’s work developing a data visualization saliency model is fascinating. The computer vision community has good models that can predict where people will look within a picture, that is, what parts of the picture will first draw a person’s attention. The Sandia team is working on developing a similar model for predicting the parts of an abstract data visualization that will draw focus. This model has some very different heuristics than what one finds with natural, photographic images.  I also really enjoyed Jorge Poco’s talk and demo about extracting color maps form bitmap images of visualizations. It was fantastic how he and his colleagues can identify color legends and ultimately allow a person to change them, which would then be reflected in the image. I also enjoyed Dragicevic and Jansen’s replication study of whether charts persuade people to trust textual arguments more, and Lam, Tory, and Munzner’s paper about the challenges of moving from high-level analysis goals to low-level analysis tasks.

Giorgia Lupi’s closing capstone talk on “Data Humanism” was fantastic as well. Giorgia is one of the two correspondents in the Dear Data series of visual postcards about their lives. She sees data visualization becoming much more personal in the future and she advocates that people explore and draw with data to discover what it holds. Giorgia’s column in Medium is a companion to and highlights the key points from her capstone talk.

I certainly missed one thing from many of the talks this year, demos. Sitting in the few presentations that had one reminded me how well a demo can make the ideas of a paper more concrete and illustrate potential applications of the research. Jarke van Wijk’s papers of the past stand out in this respect to me – So often I remember thinking about them, “Wow, that’s cool.” Here’s hoping that more authors and presentations include demos in the future.


Attendance was down a little at the conference this year. I believe that just over 1000 people attended, while the previous few years were up in the 1200-1300 range. I’m hoping this was a momentary blip, perhaps due to the location (Phoenix) being a little out of the way for many people. I certainly see visualization continuing to grow as a topic, so I fully expect VIS to keep growing as well.

With that said, I do have mixed feelings about one side effect of the growth of the conference. Back quite a few years ago, the InfoVis Symposium was a single track. All attendees who were interested in that topic attended all those sessions together and effectively shared the same experience during the week. (The SciVis Conference, then called just “Visualization”, had multiple tracks due to its larger size, but I always stayed at InfoVis.) With today’s configuration of multiple tracks, panels, journal papers, and the addition of VAST, attendees are torn between and eventually scattered about many possible sessions at any one time, and they tend to gravitate their own existing interests. Papers that are up against other popular topics may receive relatively little traffic. The single track/shared experience model of the past promoted more exposure to papers and topics outside of a person’s comfort zone. I definitely feel that the single track helped our community prosper and grow. Its loss is an inevitable consequence of growth, which also has its benefits, but sometimes I long for the “all in it together” days of the past.

Next year we’re on to Berlin, VIS’s second trip outside of the United States, in what should be an exciting meeting. Before that, the AVI and EuroVis conferences fall in back-to-back weeks late next spring in Italy and the Czech Republic, respectively. Europe clearly will be the hub of academic data visualization research in 2018!

Next column: Some thoughts about insights from visualization.


Impressions from EuroVis ’17

I recently returned from EuroVis ’17 in Barcelona, Spain. The conference was held at the Universitat Politècnica de Catalunya (UPC) which is close to Camp Nou, the home stadium for Barcelona’s famous soccer team, in the suburbs outside the city center. It is a pleasant and relatively quiet area of the city compared to the bustling La Rambla, Gothic quarter, and beachfront. It was my first time ever in Barcelona, and I had heard so many great things about the city, so I was eager to visit.

EuroVis is similar in scope to IEEE VIS, the but the three main research areas of information visualization, scientific visualization, and visual analytics are woven together into one program as opposed to the three conferences you see at VIS.  The conference is much smaller than VIS — this year just over 300 people attended. Typically, at any time of the meeting, there are about three sessions occurring in parallel. Beyond regular papers, EuroVis hosts the STAR (State-of-the-Art Reports) presentations as well. Think of them as in-depth surveys of specific subareas of visualization. These reports now appear as papers in the journal Computer Graphics Forum, as do the full research papers in EuroVis.

The conference received 170 paper submissions this year and 46 (27%) were accepted for presentation. Of the traditional five visualization paper types, “Algorithm” led with 74 submissions, followed by “Design study”-52, “Evaluation”-20, “Theory”-13, and “System”-11.  The Algorithm and Design Study areas also had the highest acceptance percentages at 32% and 27%, respectively.


In addition to full papers, EuroVis takes short paper (four pages of content plus a page of references) submissions, typically for work that is newer and still developing. This year the conference received 64 short paper submissions and accepted 30. Each of these papers is published as an archived conference paper and it receives a 15-minute talk slot at the conference, so researchers definitely should consider this track in the future. The conference also accepted 35 posters for presentation during the week.

If I had to think of one word to describe the conference this year, it would be “Hot”. No, by that I don’t mean that the papers were dynamic and sizzling, although there were many good presentations. I’m simply referring to the temperature!  Every day the high temperature was close to 90º F, and there wasn’t a cloud in the sky the whole week.  Typically, the most valuable commodity at our conferences is good wireless service. Instead, this year it was air conditioning and shade. But hey, I’ll take that anytime over clouds and rain. Just think of it as good practice for VIS this fall in Phoenix.

The conference began with a timely and fascinating keynote talk by Fernanda Viégas and Martin Wattenberg of Google. They discussed many ways that machine learning and visualization are connecting and benefiting each other. Martin and Fernanda showed a number of examples, both from their work and others, of how visualization can help people better understand what is going on (beyond the black box, so to speak) in machine learning. Their talk was complemented by Helwig Hauser‘s closing capstone that examined how visualization is moving onto larger and larger data sets. Up front, he pondered what problems our community has “solved” in the last 25 years. While it may be difficult to think of many, he rightfully also asked when is a problem really ever “solved”? Developing “sufficient” solutions to a bevy of problems simply may be good enough and may be an indicator of good progress. He provided many examples where visualization has done just that.

I saw many nice presentations at the conference and was trying to come up with a theme or two that emerged, but I had a tough time doing so. Perhaps one broad theme I observed was many papers dealing with the HCI aspects of visualization. Topics ranging from evaluation to interaction to storytelling all seemed to have a strong presence this year. Another nice set of papers concerned text and document visualization as well.


EuroVis traditionally hosts a nice conference dinner on Thursday evening.This year it was at a restaurant on Montjuic, a mountain (actually more of a hill) on the southwest side of the city. The restaurant’s deck afforded a beautiful view down onto the city. The conference organizers also graciously sponsored a guided tour of the famous Sagrada Familia basilica in downtown Barcelona on Wednesday evening. The church is simply stunning both inside and out, and has become an iconic landmark for the city.

One of my favorite aspects of EuroVis is that the conference provides lunch for attendees there at the conference site. Not having to trudge offsite to a restaurant simply gives more time to sit and talk with fellow attendees, old friends, and new acquaintances. The smaller size of EuroVis compared to VIS also makes it easier to find colleagues. All these things combine to provide a little more relaxed lunchtime. I think my lunch conversations were my favorite aspect of the conference this year. It was great hearing what so many friends are working on currently.

In a lucky coincidence, my home university, Georgia Tech, participates in a cooperative study-abroad program with UPC that hosted EuroVis.  Our faculty spend the summer there and teach our courses to our own students who also travel there for the term. My fellow Interactive Computing faculty member and good friend Mark Guzdial was literally teaching classes in the same buildings in which EuroVis was occurring. He even was able to drop in and hear my presentation at the conference. IC PhD student Barbara Ericson is teaching the undergraduate infovis class there this summer too. She asked me about giving a guest lecture while there, but I figured that I’d take a break from the teaching.  :^)

If you haven’t submitted a paper to or attended EuroVis yet, I strongly encourage you to do so. I hadn’t attended until about five years ago, but now I try to make it back as often as I can. The paper quality is excellent and it’s usually hosted at a beautiful European city. Next year’s conference is in Brno, the second largest city in the Czech Republic. (With VIS ’18 in Berlin, apparently they didn’t take my suggestion that EuroVis should be in New Orleans, LA.) Just be on the watch out for dragons that look like alligators.