appeal to all of the hype as of late inside knowledge science, however I’d argue they’re each secondary to a extra necessary—and often-ignored—part of the sphere.
When coping with knowledge, there are two important steps:
- Processing and analyzing the info to extract significant insights.
- Conveying these insights to others.
The second level is essential and sometimes ignored. The world’s most superior algorithm or useful perception is ineffective if nobody can perceive it. As a knowledge scientist, you should study to convey your insights to others. There’s multiple purpose for this, with the obvious one being that if the appropriate folks perceive the info, the world at giant will profit. Nevertheless, there’s one other equally necessary purpose: It’s typically in describing our findings to others that we uncover errors, extra profound data, or additional areas for exploration.
On this article, we’ll look at a robust and efficient device which may also help obtain the second step above: knowledge visualization. That is the primary in a collection of articles that can take absolute freshmen deep into the realm of information visualization. This primary article is normal and lightweight, meant as an introduction to the sphere as a complete. In later articles, I’ll get into the extra technical elements, ultimately concluding by educating you the right way to construct your individual knowledge visualizations.
With that data, you’ll be armed to deal with your knowledge in new, thrilling methods.
“The best worth of an image is when it forces us to note what we by no means anticipated to see.” –John Tukey
What Counts as a Information Visualization?
Many individuals view knowledge visualization by means of a restricted lens, solely classifying customary graphs, comparable to bar charts, line charts, and the like, as true knowledge visualizations. Seen from this attitude, knowledge visualization didn’t materialize till the center of the 18th century. (We’ll see some examples under.)
Nevertheless, we’d do properly to broaden our minds. Visible transformations of information are in no way restricted to our conventional concepts. They’ve been round for hundreds of years. For instance, right here is the Imago Mundi [1], the oldest recognized map on the earth, found as a relic of the traditional metropolis of Babylon:

This map locations Babylon on the middle and was probably an especially useful gizmo for visualizing what we now formally name geospatial knowledge. It is without doubt one of the world’s earliest knowledge visualizations.
There are a plethora of comparable figures and pictures from numerous historical civilizations—cave work, calendars, stone carvings, even Egyptian hieroglyphics—these are all successfully visible representations of information that have been obscure of their preliminary kind. Viewing these examples as knowledge visualizations leads us to an necessary precept:
At its core, knowledge visualization is nothing greater than taking some knowledge—be it numerical, textual, or in any other case—and making use of a change to signify it visually.
This foundational precept results in a number of associated matters primarily involving the simplest strategies to conduct these transformations, the place efficient loosely interprets to “sincere, straightforward to grasp, and informative.”
Early Examples of Information Visualizations
Now that we have now broadened our views regarding what constitutes a knowledge visualization, allow us to check out some trendy examples. Under is a chart from 1644 developed by Michael Florent Van Langren [2]. It is without doubt one of the earliest graphical representations of what we take into account to be conventional statistical knowledge, depicting estimates of the distinction in longitude between Rome and Toledo.

Let’s take into account a extra concerned instance subsequent—one which straight highlights Tukey’s quote above.
Under is a map of London’s Soho District in 1854 [3]. It was designed by John Snow to be able to decide if there have been any patterns within the cholera outbreak that was debilitating the city on the time:

Wanting towards the middle of the map, we are able to see an exceptionally giant variety of deaths close to the water pump on Broad Avenue. An investigation decided that this pump was contaminated and was a significant reason behind the unfold of the illness.
This instance highlights precisely the precept from John Tukey we famous above: Among the best makes use of of information visualization is to shortly see insights which are troublesome to seek out within the knowledge’s preliminary kind.
Precision and Flexibility
Information visualization is a broad and deep matter that may be approached in some ways. That mentioned, there are two ideas that it’s best to take into account no matter the precise type of knowledge visualization you interact in: precision and flexibility.
knowledge visualization doesn’t attempt to accomplish ill-defined duties, comparable to displaying the essence of or summarizing every little thing necessary a few knowledge set. Statements like these are subjective and basically unattainable to realize.
Somewhat, a great knowledge visualization highlights a particular and well-defined facet of the related knowledge in a means that makes it simpler to grasp for the consumer. You need to at all times articulate precisely what you need to categorical about your knowledge earlier than you even start designing a visualization.
To internalize this precept, it’s useful to recall what the aim of a knowledge visualization is to start with: to show insights from a knowledge set in a transparent and helpful means. We need to make the info simpler to grasp. Being exact ensures we obtain this objective. A visualization that makes an attempt to do an excessive amount of would possibly find yourself complicated the viewer much more. It’s significantly better to provide a visualization which covers much less knowledge in a clearer means. High quality is extra necessary than amount.
Check out the info desk under, which accommodates details about salaries from completely different cities round the US.
Identify | Metropolis | Earnings | Occupation |
---|---|---|---|
Sarah Mitchell | Denver, CO | $72,500 | Advertising and marketing Supervisor |
Jamal Rodriguez | Houston, TX | $58,300 | Electrician |
Priya Desai | Seattle, WA | $91,200 | Software program Engineer |
Thomas Nguyen | Chicago, IL | $64,800 | Nurse |
Which of the next is the higher visualization alternative for the above knowledge?
- A visualization that makes an attempt to simplify the knowledge within the knowledge desk utilizing a bar chart that has names on one axis and salaries on the opposite axis, makes use of coloration to distinguish amongst cities, and makes use of a texture on the bars (dashed traces, diagonal traces, and so forth.) to differentiate amongst careers.
- The identical visualization as above, however this time excluding the majors. In different phrases, a bar chart of names and salaries which colours the bars based mostly on location.
It’s tempting to decide on the primary one, however the reality is, it tries to do an excessive amount of. Higher to show restricted, focused data than to confuse your viewers.
Along with being exact, sustaining flexibility can also be necessary. There isn’t a such factor as an ideal knowledge visualization. There’s at all times room for enchancment, and knowledge visualizations typically grow to be higher with every revision. After all, sooner or later, a knowledge visualization have to be shared with others and serve its goal.
This results in a quandary—how a lot revision is sufficient revision? There isn’t a definitive reply to this query. The method of revising a visualization have to be undertaken with care. Asking too many individuals for recommendation will probably end in a bunch of half-baked, conflicting opinions. Alternatively, publishing the primary draft of a visualization—i.e., not revising it in any respect—is prone to result in a subpar end result.
Though there is no such thing as a excellent answer, there are a number of tips you possibly can comply with:
- Determine 2-3 folks to offer you suggestions in your visualization.
- Strive to make sure your checklist of individuals encompasses the next:
- A reviewer who’s proficient in designing knowledge visualizations
- A reviewer who has a powerful understanding of the info that’s getting used to develop the visualization (e.g., a political scientist for election knowledge)
- A reviewer who’s a part of the meant viewers for the visualization
- Undergo 2-3 rounds of suggestions and revision with this similar checklist of individuals. It will be certain that enhancements to the visualization are steady and logical.
Last Ideas and Wanting Ahead
In some ways, knowledge visualization is akin to writing. Even probably the most prolific and proficient authors have editors, and their books undergo in depth revision earlier than being accredited for publishing. Why? For the easy purpose that good writing is essentially depending on the viewers, and punctiliously curated revision ensures the perfect expertise for the eventual readers of a e book. The identical concept applies to knowledge visualization.
By following these tips, you possibly can make sure you develop a sturdy knowledge visualization which is grounded in finest practices, appropriately shows the info at hand, and is comprehensible for the meant viewers.
They’re the important thing to efficient knowledge visualization, and the muse for superior visualization strategies that will probably be mentioned in future articles. Till then.
References
[1] https://commons.wikimedia.org/wiki/File:The_Babylonian_map_of_the_world,_from_Sippar,_Mesopotamia..JPG
[2] The Visible Show of Quantitative Info, Edward Tufte
[3] https://picryl.com/media/snow-cholera-map-1-cbadea