OldVisOnline: Curating a Dataset of Historical Visualizations

08/30/2023
by   Yu Zhang, et al.
0

With the increasing adoption of digitization, more and more historical visualizations created hundreds of years ago are accessible in digital libraries online. It provides a unique opportunity for visualization and history research. Meanwhile, there is no large-scale digital collection dedicated to historical visualizations. The visualizations are scattered in various collections, which hinders retrieval. In this study, we curate the first large-scale dataset dedicated to historical visualizations. Our dataset comprises 13K historical visualization images with corresponding processed metadata from seven digital libraries. In curating the dataset, we propose a workflow to scrape and process heterogeneous metadata. We develop a semi-automatic labeling approach to distinguish visualizations from other artifacts. Our dataset can be accessed with OldVisOnline, a system we have built to browse and label historical visualizations. We discuss our vision of usage scenarios and research opportunities with our dataset, such as textual criticism for historical visualizations. Drawing upon our experience, we summarize recommendations for future efforts to improve our dataset.

READ FULL TEXT

page 3

page 4

page 6

page 7

research
11/18/2018

Ethical Dimensions of Visualization Research

Visualizations have a potentially enormous influence on how data are use...
research
09/04/2020

Visualizing a Large Spatiotemporal Collection of Historic Photography with a Generous Interface

Museums, libraries, and other cultural institutions continue to prioriti...
research
02/12/2022

Structure-aware Visualization Retrieval

With the wide usage of data visualizations, a huge number of Scalable Ve...
research
07/27/2022

Beyond Visuals : Examining the Experiences of Geoscience Professionals With Vision Disabilities in Accessing Data Visualizations

Data visualizations are ubiquitous in all disciplines and have become th...
research
04/02/2018

mQAPViz: A divide-and-conquer multi-objective optimization algorithm to compute large data visualizations

Modern digital products and services are instrumental in understanding u...
research
08/30/2021

Making the Invisible Visible: Risks and Benefits of Disclosing Metadata in Visualization

Accompanying a data visualization with metadata may benefit readers by f...
research
07/29/2020

Advancing Visual Specification of Code Requirements for Graphs

Researchers in the humanities are among the many who are now exploring t...

Please sign up or login with your details

Forgot password? Click here to reset