Design

A retelling of the book "Visual Display of Quantitative Information": errors, graph formats

A retelling of the book "Visual Display of Quantitative Information": errors, graph formats

Learn: The Profession of UX/UI Designer

Find out more

Errors in Graphs

Since the late 1960s, graphical representation of data has been perceived not only as a design element, but also as an important tool for information analysis. However, even today you can encounter distortions and manipulations with graphs. One example is Comparison of the amounts for six months of 1978 with the annual figures for 1976 and 1977. Such techniques can be misleading and distort the true perception of the data. It is important to critically evaluate the graphs presented and consider the context to avoid drawing incorrect conclusions.

Image: The New York Times, 1978 / The American Statistician, May 1984, Vol. 38, No. 2 / Edward R. Tufte

The graph from the Day Mines report shown here lacks a baseline, making it difficult to assess losses. For example, 1973 begins with a loss of -4,200,000, while 1972 has a loss of -4,400,000. This lack of a clear basis for comparison makes it difficult to analyze financial results and can be misleading when interpreting the data. It is important to consider the need for clarity in the presentation of information to more accurately evaluate and analyze financial performance.

Image: Day Mines, Inc. 1974 Annual Report / Edward R. Tufte

One-dimensional data cannot be adequately compared across two dimensions. This approach is inefficient and can lead to frequent errors in data interpretation. For more accurate data analysis and visualization, it is recommended to use multivariate methods that take into account various aspects and relationships between variables. Thus, the use of more complex models significantly improves the accuracy and reliability of results. Oil prices increased by 454%, driven by an increase in barrel volumes. This is causing significant changes in the market and affecting the economic situation in various countries. Oil prices continue to rise, raising concerns among analysts and energy experts. It's important to monitor these trends to understand how they might impact consumers and businesses overall.

Image: Time, April 9, 1979 / Edward R. Tufte

A common mistake in calculating inflation is to misrepresent the value of money. Although the value of money is a one-dimensional quantity, this graph illustrates it in two dimensions. If the area of ​​a bill truly reflects its purchasing power, then the 1978 dollar should be twice as large as shown in the graph. A proper understanding of inflation requires taking into account many factors, including changes in the purchasing power of a currency and real consumer spending. This is important for forming an adequate picture of economic realities and evaluating financial decisions.

Image: Washington Post, 1978 / Edward R. Tufte

The chart should focus the reader's attention on the data, so it is important to exclude everything unnecessary. Edward Tufte uses the analogy of printing ink—saving on it shouldn't compromise informational content. While he's referring to printed media, this principle is also relevant to digital charts. Effective charts should be simple and clear, allowing users to quickly digest information and draw conclusions based on the data presented. Reducing unnecessary elements and emphasizing key aspects facilitates better understanding and analysis of the information.

In his map of imports and exports between England and North America, William Playfair overemphasized the design, resulting in excessive ink consumption on elements that didn't contribute to the information's conveyance. This negatively impacted the map's functionality, which was intended to clearly depict trade flows between regions. It is important that visual elements work to improve the perception of data, and not distract from the main purpose of presenting information.

Image: William Playfair / The Commercial and Political Atlas, 1786, 3rd edition 1801 / Edward R. Tufte

This graph presents data on Imports and exports of Denmark and Norway. The chart's information content has increased significantly, despite retaining the same data format. This allows for a more accurate analysis of trade flows between countries and the identification of trends in their economic relations.

Image: William Playfair / Edward R. Tufte

This graph shows that most ink is used for various purposes, but not for transmitting data.

Image: Stanley Kelley, Jr., Richard E. Ayres, William G. Bowen, Registration and Voting: Putting First Things First, American Political Science Review, 61 (1967) / Edward R. Tufte / Graphics Press

The next iteration of the graph shows a significant increase in the data-to-ink ratio. This improvement highlights the importance of optimizing information visualization to facilitate better data comprehension and analysis. Effective use of ink in graphics helps highlight key points, making the data clear and accessible.

Image: Stanley Kelley, Jr., Richard E. Ayres, William G. Bowen, Registration and Voting: Putting First Things First, American Political Science Review, 61 (1967) / Edward R. Tufte / Graphics Press

It is impossible to delete data from this electroencephalogram without losing information. Please note the scale in the lower right corner of the block. This scale plays an important role in interpreting the study results and allows for an accurate assessment of the brain's electrical activity. Each element on the graph contains valuable information necessary for diagnosis and analysis of the patient's condition.

Image: Kenneth A. Kooi, Fundamentals of Electroencephalography, Harper and Row, New York, 1971 / Edward R. Tufte / Graphics Press

The graph will not acquire Chartjunk, which Tufte calls "chart junk," can add visual appeal and interest. Elements that are irrelevant to the data are distracting and interfere with the comprehension of the information. Clean and simple charts facilitate better understanding of the data presented, allowing users to focus on the essentials. Effective visualizations should be clear and concise, avoiding unnecessary embellishments that can confuse the viewer. It's important to remember that the purpose of a chart is to convey information, not simply decorate it.

The design of graphic images serves a variety of purposes. It can enhance the scientific accuracy of a visualization, make an exhibition more appealing, and provide a platform for a designer to showcase their artistic vision. These changes often arise from a lack of data or, conversely, an excess of data. Proper design not only improves the perception of information but also effectively communicates key ideas to the audience.

Edward Tufte is a prominent expert in the field of data visualization and computer science. He is known for his work emphasizing the importance of effective information presentation. Tufte is the author of several books, including "The Art of Data Presentation," which details visualization methods and techniques for making complex data more accessible and understandable. His approach is based on the principles of clarity, accuracy, and aesthetics. Edward Tufte also actively conducts seminars and lectures, sharing his experience and knowledge with professionals and students. His ideas and research have significantly influenced the development of data visualization, making it more relevant in today's information-driven world. Jittery graphs filled with excessive clutter pose a serious challenge in data visualization. Incorrect or redundant elements in graphs can obscure information and lead to incorrect conclusions. Effective data presentation requires minimizing noise and focusing on key aspects, ensuring clarity and comprehensibility. Clean graphs play a vital role in analytics, allowing users to quickly and accurately interpret results. Optimizing data visualization with these factors in mind leads to improved analysis and more informed decision-making.

Image: Instituto de Expansão Commercial, Brasil: Graphicos Economicos–Estatisticas, Rio de Janeiro, 1929, p. 15. / Edward R. Tufte / Graphics Press

Bad graphs presented in scientific journals often cause confusion. In such visualizations, background planes can come to the foreground, making it difficult to perceive the information. For example, the bottom plane may be presented without any labels, and the triangles that are to be compared have no clear landmarks. This significantly reduces the effectiveness of the graphs and complicates data analysis. It is important to create clear and understandable graphics that allow readers to easily perceive information and make comparisons.

Graph of pulmonary regurgitation severity assessment in 31 patients. Severity ratings: 0 - none, 0.5 - minor, 1 to 1.5 - mild, 2 - moderate, 3 - severeImage: Nicholas T. Kouchoukos, Replacement of the Aortic Root with a Pulmonary Autograft in Children and Young Adults with Aortic-Valve Disease, The New England Journal of Medicine, 1994 / Edward R. Tufte / Graphics Press

The Paris to Lyon train schedule provides complete departure, travel, and arrival times. Departure and arrival points, as well as stops between stations, are displayed in a horizontal format. Travel times are indicated by vertical lines: the straighter the lines, the faster the train. This user-friendly presentation allows passengers to easily navigate the schedule and plan their trips.

Image: Étienne-Jules Marey, La Méthode Graphique Dans les Sciences Expérimentales et Principalement en Physiologie et en Médecine / Edward R. Tufte / Graphics Press LLC

A dense grid can significantly hinder the perception of information. To improve usability and emphasize the data, it is recommended to loosen the grid. This will create a more comfortable visual environment and make information more accessible to the user.

Image: Étienne-Jules Marey, La Méthode Graphique Dans les Sciences Expérimentales et Principalement en Physiologie et en Médecine / Edward R. Tufte / Graphics Press LLC

Tafte draws parallels between "ducks" in information design and extravagant buildings. Architects and landowners often invest heavily in creating unique forms that can distract from the primary function of a building.

It is time to rethink John Ruskin's assertion that architecture is the ornament of construction. It is important to heed Pugin's warning: while ornamentation of a structure is permissible, decoration that does not serve a functional purpose should be avoided. Architecture should combine aesthetics and practicality, creating a harmonious space where form serves not only beauty but also convenience.

Robert Venturi, Denis Scott Brown, and Stephen Izenour are significant figures in architecture and design. Their work influenced the development of postmodernism in architecture. The Cambridge, revised edition, 1977, is an important source that illuminates their ideas and concepts. This volume explores the core principles that informed their approach to architectural design and examines the social and cultural contexts in which they worked. These authors emphasize the importance of the interaction between architecture and society, as well as the role of historical elements in contemporary design. Studying their works allows us to better understand the evolution of architectural thought and current trends in the field of architecture.

Photo: Edward R. Tufte / Graphics Press LLC

The designer uses graphics exclusively as a decorative element. This approach allows for the creation of visually appealing content that doesn't distract from the main message. Decorative graphics enhance the perception of information, adding aesthetic value and helping to establish an emotional connection with the audience. Effective use of graphic elements can significantly increase interest in material and improve user interaction with the content.

Image: American Education, 1970s / Edward R. Tufte / Graphics Press LLC

A series of unusual 3D graphs published in American Education magazine in In the 1970s, this chart sparked genuine interest among lovers of graphical absurdity. Using five colors to display just five pieces of data created confusion, as the yearly divisions increase by 100%. This chart may be considered one of the most unsuccessful in the history of data visualization.

Edward Tufte is a leading expert in the field of data visualization and information design. He is known for his works that reveal how to effectively present and interpret data. Tufte emphasizes the importance of visual perception of information and offers methods for making complex data more accessible and understandable. His books, such as "The Visual Display of Quantitative Information," have become classics in the field of data design, providing readers with valuable insights on how to improve the quality of visualizations. Edward Tufte is also the founder of an information design consultancy, where he continues to advance the idea of ​​effective communication through visualization. His work inspires designers, scientists, and analysts seeking to create clear and informative graphs and charts.

Data Display Formats

A sentence is effectively used when there are no more than two numbers to present. If the number of numbers increases, the reader will need more time to comprehend and analyze the information. For example:

In Group A, 53% of participants completed certain actions, while in Group B this figure was 46%. In Group C, the percentage was even higher - 57%. These data highlight the differences in activity between the groups and can be useful for further analysis.

You can create a list that is sorted not alphabetically, but by numerical values. This will allow you to compare indicators more effectively and speed up data analysis. This approach greatly simplifies the comparison process and helps to quickly identify key points.

  • Group B 46%.
  • Group A 53%.
  • Group C 57%.

A table can be used to achieve similar goals. However, to justify this format, significantly more data must be presented. For example, a table could display information about the winners and losers in a prediction game. Furthermore, a table can function as a graph, allowing you to visually present the results and compare them. This simplifies the analysis and perception of information, making it more accessible to users.

Image: New York Times, January 2, 1979 / Edward R. Tufte / Graphics Press LLC

Statistical calculations can be more accurate and informative than traditional analysis methods. An example of this is the Anscombe quartet, in which four different data sets share the same linear model despite significant differences in their visual representations. This highlights the importance of not only quantitative analysis but also graphical presentation of data for a deeper understanding and interpretation of statistical results.

Image: Edward R. Tufte / Graphics Press

The graphs have significant differences, which makes it clear that each column is unique. Each of them demonstrates its own individuality, highlighting differences in data and trends. Such contrasting visualizations help to better understand the presented information and make the analysis more visual.

Image: F.J. Anscombe, Graphs in Statistical Analysis, American Statistician, 1973 / Edward R. Tufte / Graphics Press

In this example, you can clearly see that point A deviates significantly from the overall data set. However, this anomaly is not noticeable on the vertical and horizontal axes. Analyzing such deviations is important for a deep understanding of the structure and distribution of the data.

Image: Edward R. Tufte / Graphics Press

If the source data is poor, no amount of visualization can improve the situation. Good information presentation starts with reliable and accurate data. Only with high-quality source data can you create informative graphs and charts that will truly help in analysis and decision-making.

A - New York Stock Exchange stock prices, B - solar radiation, C - London Stock Exchange prices. 1929 (after Garcia-Mata and Schaffner)Image: Edward R. Dewey and Edwin F. Dakin, Cycle: The Science of Prediction, New York, 1947 / Edward R. Tufte / Graphics Press

These tools effectively and naturally draw attention to data analysis, allowing significant amounts of information to be accumulated in a compact format.

These maps display cancer rates among white men and women in the United States. It is important to focus on the analysis of the data, rather than being distracted by the visual elements. Studying these rates can help understand cancer prevalence and identify trends in different populations.

Cancer incidence rates among white women and men in the United States. 1950–1969 Image: Thomas J. Mason, Frank W. McKay, Robert Hoover, William J. Blot, and Joseph F. Fraumeni / Public Health Service, National Institutes of Health, 1975 / Lawrence Fahey and Edward Tufte

Maps can distort perceptions by emphasizing unimportant aspects. It is important to consider the significance of each geographic area, not simply statistics on the number of people or cancer deaths. This phenomenon, according to Tufte, is a “chronic problem” with maps that use color fills to denote different areas. More careful interpretation of map data is needed to avoid false conclusions and emphasize the real significance of the information.

Tufte expresses doubts about the source and reliability of the data presented.

Death certificates list the cause of death, which is an important aspect for understanding health statistics. These reports may reflect the influence of diagnostic trends among physicians and coroners in specific regions and during specific historical periods. This can lead to distorted data, which itself addresses the complex question of the precise location of primary cancers within the body. Such falsifications can complicate the analysis of causes of death and influence the development of medical research, as well as the formation of public opinion about diseases.

Edward Tufte is a renowned expert in the field of data visualization and information design. His work focuses on how to effectively present complex data to make it more accessible and understandable to a wide audience. Tufte emphasizes the importance of aesthetic design and accurate communication of information, emphasizing that design should support, rather than distort, the content. In his books and lectures, he shares methods for creating intuitive graphs and charts. Edward Tufte inspires designers and analysts worldwide, urging them to strive for high precision and clarity in data visualization.

Dr. John Snow's dot map visualizes the locations of cholera deaths in central London. On the map, dots represent cases, and crosses indicate the locations of water pumps where residents drew their water. The highest number of deaths was recorded on Broad Street. Noticing this anomaly, John Snow removed the handle from the pump on that street, helping to end the epidemic in the area. This case became an important stage in the development of epidemiology and demonstrated the importance of high-quality drinking water for public health.

Image: E.W. Gilbert, Pioneer Maps of Health and Disease in England, Geographical Journal, 1958 / Edward R. Tufte / Graphics Press

Charles Minard's 1867 map of French wine exports presents data on the quantity and direction of shipments. This map illustrates important trade routes and wine export volumes, providing insight into the dynamics of the French wine industry at the time. Minard's unique approach to data visualization makes his work a significant contribution to the study of economic geography and the history of wine. By analyzing this map, modern researchers and wine lovers can learn more about the traditions and development of wine production in France.

Image: Charles Minard

The uniform rhythm of time creates a natural order on the timelines, which allows for effective interpretation of the data. This is especially relevant for large, highly volatile data sets, as structure facilitates better analysis and visualization of the information.

This New York City weather chart for 1980 illustrates daily highs and lows, comparing them with long-term averages. Analyzing this trend allows for a better understanding of the weather conditions that residents and visitors can expect in the future. Understanding climate trends helps us plan and adapt to changing conditions.

Image: The New York Times / Edward R. Tufte / Graphics Press

William Playfair's chart shows three parallel time series: wheat price dynamics, wage changes, and the length of British rule monarchs from 1565 to 1821. This chart illustrates the relationship between economic indicators and political events, highlighting how changes in agricultural prices could influence living standards and political stability in the country. Analyzing this data allows for a better understanding of the economic conditions of the era and their impact on social and political processes in Britain.

Image: William Playfair

This category of time series is a visualization that displays not only temporal changes but also spatial aspects. Typically, such graphs display data in three, four, or even five dimensions. The information is presented in such an intuitive manner that the viewer often doesn't appreciate the complexity of the data being analyzed.

Charles-Joseph Minard visualized the fate of Napoleon's army in Russia using his map. The orange line represents the number of troops during the advance on Moscow, and the black line represents the number of soldiers during the retreat to Poland. The map clearly demonstrates the places and methods by which Napoleon lost his soldiers, highlighting the consequences of his military actions in Russia. This graphical method allows for a better understanding of the scale of losses and strategic mistakes made during the campaign.

Image: Charles Minard

The life cycle of the Japanese beetle is an interesting process consisting of several stages. During the first stage, the female lays eggs in the soil, usually in acidic and moist conditions. After a few weeks, the eggs hatch into larvae, which begin actively feeding on plant roots. This process can last up to several years, depending on environmental conditions.

After the larval stage, the beetle transitions to a pupa, where it undergoes changes in preparation for the next stage. The pupa can remain in the soil for several months. Eventually, the adult Japanese beetle emerges from the pupa, which has a distinctive appearance. Adult beetles actively search for food, primarily plants, and can cause significant damage to agriculture. Understanding the life cycle of the Japanese beetle is important for developing effective methods of controlling this pest and protecting cultivated plants.

Image: Leonard Hugh Newman / Man and Insects, Aldus Books, 1965 / Edward R. Tufte / Graphics Press

Graph by Antonio Gaballo illustrates the history of Italian post offices. He demonstrates the growth dynamics of savings boxes over time. The univariate data representation and the same number of variables and dimensions ensure the graph's high reliability. This contrasts with obscure charts related to oil and money, which can be misleading.

Image: Antonio Gabaglio, Teoria Generale Della Statistica, Milan, 1888 / Edward R. Tufte / Graphics Press

With the development Infographics have created a need for more abstract methods of presenting data that have no direct analogs in the physical world. In a context where different quantities can be compared with each other if measured in the same units, such approaches are particularly relevant. Modern technologies make it possible to visualize complex data, simplifying its perception and analysis. This opens new horizons for researchers and specialists in various fields, as it allows information to be presented in a more accessible and visual form.

In this graph, William Playfair presented the relationship between population and tax revenue in various countries, where the areas of the circles symbolize population size. The lines connecting tax and population data are intended to visualize this relationship, but they are not always informative, as they depend not only on the data presented but also on the size of the circles. This can be misleading when interpreting the graph.

Image: William Playfair, Statistical Breviary, 1801 / Edward R. Tufte / Graphics Press

A comparison of studies on the relationship between temperature and thermal conductivity of copper reveals significant differences in the results. These differences may be due to the presence of impurities in the original copper samples. Impurities affect the physical properties of the metal, which in turn affects its thermal conductivity. The thermal conductivity of copper, one of the best heat conductors, can vary depending on the temperature and composition of the material. Therefore, it is important to consider the purity of the samples during testing to achieve more accurate and reproducible results. Analyzing data on the effect of temperature on the thermal conductivity of copper can contribute to a deeper understanding of its thermal properties and improve technologies using copper in various industries.

Image: C.Y. Ho, R.W. Powell, P.E. Liley / Journal of Physical and Chemical Reference Data 3, American Chemical Society, 1974 / Edward R. Tufte / Graphics Press

White pine seedlings were planted for one season in sand with varying calcium levels. These experiments allowed us to evaluate the effect of calcium levels on plant growth and development. White pine, known for its resilience and longevity, can significantly benefit from optimal nutrient conditions. Studying the seedlings' response to varying soil calcium concentrations helps us better understand how this element influences their viability and adaptation to the environment. The results of these studies may be useful for forestry and horticulture, as they will allow the selection of the most suitable conditions for the successful cultivation of white pine.

Image: H. L. Mitchell, The Growth and Nutrition of White Pine Seedlings in Cultures with Varying Nitrogen, Phosphorus, Potassium and Calcium, The Black Rock Forest Bulletin No. 9 / Maria Moors Cabot Foundation for Botanical Research / Harvard University, 1939 / Edward R. Tufte / Graphics Press

Tufte emphasizes that there are no universal rules for presenting data. However, he offers several guidelines that can help in finding the optimal solution. Pay attention to the context in which the data will be used, consider the target audience, and choose the most appropriate visual formats. Experiment with different ways of presenting information to achieve the best perception and comprehension. Remember the importance of clean and clear design to ensure data is easy to read and understand.

  • Choose the right format;
  • Combine words, numbers, and illustrations;
  • Don't forget about balance, proportion, and a sense of appropriate scale;
  • Show accessible complexity of detail;
  • Use data to tell a story;
  • Avoid content-free designs, including chartjunk.

UX/UI Designer Profession

You will learn how to develop user-friendly websites and applications and adapt them to different devices. You will understand how to create powerful products. Master a sought-after specialty and increase your income.

Find out more