Moving Historical Geodata to the Web
The New York Public Library Supported by the Alfred P. Sloan Foundation November 5-7, 2014
meeting notes by Lex Berman
PDF: agenda participants
Hosted by Matt Knutzen, Geospatial Librarian for NYPL, this meeting built upon the successful digitization of the NYPL map collection and established common ground with other similar institutional projects.
The meeting also connected experts from across the U.S. and Europe who are currently working on the same types of historical geographic information in digital form. The workshop participants spent two days brain storming, sharing experiences, and proposing practical steps for collaboration.
In his opening remarks, Knutzen described the transformation of more than 20,000 historical map sheets in the NYPL collection into digital images that were subsequently made available to the general public. The modular approach taken by Knutzen, involved several pieces of software, and crowd-sourcing of some tasks.
Each of the pieces of the NYPL transformative workflows to turn paper maps into a variety of searchable resources on the web resulted in important findings. For example, the Map Warper Tool was very successful in creating rough-and-ready georectified images of maps. However, a crowd-sourced feature extraction tool, which enabled volunteers to digitize buildings and other features from the historical maps, proved to be much too time-consuming a process. This discovery subsequently led to the automated map vectorizer and feature extraction tool.
The automation of the “first pass” in creating building footprints saved an enormous amount of time, and enabled the creation of the crowd-sourced quality control and data entry tool, called Building Inspector. The Building Inspector users are asked to submit only a few very simple pieces of information, enabling the rapid collection of a huge number of data enhancements with a minimum possibility of error.
Overall, the important lessons learned by developing the NYPL historical map tools were that some tasks are more suitable to automation, and some tasks, though automated, benefit from human inspection and correction. In addition, the working software for Map Warper, Map Vectorizer, and Building Inspector, are all now open source and available for re-use in other contexts.
By bringing together a large group of domain experts, programmers, and researchers in the fields of libraries, museums and digital humanities, this workshop advanced the set of common tasks and inspired multi-party collaborations. In Matt Knutzen’s words: “making historical spatial data actionable is the part of the venn diagram where all of our interests overlap.”
During the meeting a metadata exchange repository was established:
http://github.com/opengeometadata
Some of the important questions raised at the meeting were:
Some highlights of the topics discussed were:
DATA LIFE CYCLE
FIVE STAR HISTORICAL MAP DATA
USE CASES
TOOLS - SURVEY OF TYPES
THE BIG VISION
A collection of links from the meeting:
Annotorius Image Annotation - http://annotorious.github.io/
Boston Public Library Leventhal Map Center - http://maps.bpl.org/view_collection
Cooper Hewitt Labs - http://labs.cooperhewitt.org/
Data Life Cycle Model -http://libraries.mit.edu/data-management/plan/checklist/data-lifecycle/
Data in Project Life Cycle - https://wiki.responsibledata.io/Data_in_the_project_lifecycle
Five Star Open Data - http://5stardata.info/
GeoBlacklight Design Document - http://goo.gl/V0lkz4
GeoHumanities Special Interest Group - http://geohumanities.org
GeoNames - http://www.geonames.org/
Getty Thesaurus of Geographic Names as GeoJSON - https://github.com/straup/tgn-geojson
LOC Gazetteer - http://loc.gazetteer.us/
Metadata Schema for Resource Discovery Use Cases - http://journal.code4lib.org/articles/9710
Neatline, Plot your course in space and time - http://neatline.org/
NYPL Building Inspector - http://buildinginspector.nypl.org/
NYPL Map Vectorizer - https://github.com/NYPL/map-vectorizer
NYPL Map Warper - http://maps.nypl.org/warper/
OGC Cat Interop - https://github.com/OSGeo/Cat-Interop
Old Maps Online - http://www.oldmapsonline.org/
OpenGeoMetadata repository - http://github.com/opengeometadata
OpenGeoPortal - http://opengeoportal.org/
OpenHistoricalMap - http://www.openhistoricalmap.org
OpenStreetMap Map Warper: https://github.com/timwaters/mapwarper
Orbis Geospatial Network Model - http://orbis.stanford.edu/
Past Place API - http://www.pastplace.org/
Pelagios - http://pelagios-project.blogspot.com/
PeriodO assertions for linking data - http://perio.do/
Simple Open Data - http://simpleopendata.com/
Temporal Gazetteer API [TGAZ ] - http://chgis.hmdc.harvard.edu/tgaz/api/
Temporal Gazetteers Resource Page - http://www.fas.harvard.edu/~chgis/gazetteer/
TopoTime Qualitative Reasoning for Historical Time - http://kgeographer.com/wp/topotime/
UCSB Spatial Search - http://spatial.ucsb.edu/2014/spatial-search
UVA Map Scholar - http://mapscholar.org/
WikiData - http://www.wikidata.org/