Notes from Meeting With Farris
- OpenRefine – maybe just go with that if easier
- Farris’ notes from provenance conference
- What other data sets
- Goals: Combine as many datasets as possible.
- How to visualize – data in action
- Yearly accession data
- Timeline
- External resources
- Incorporating external data most important at this point
- Create a timeline?
- What can be ingested into Whitney server and maintained?
Timeline for the Rest of the Semester
Since Karma is a pain and Gephi is probably beyond the scope of my technical abilities, I will focus on OpenRefine for now.
A nice deliverable would be a master Founding Collection dataset with URIs from as many other institutional repositories as possible.
Basically, just reconcile everything.
Reconciliation
https://github.com/OpenRefine/OpenRefine/wiki/Reconciliation
https://www.wikidata.org/wiki/Wikidata:List_of_properties
https://www.wikidata.org/wiki/Property:P1566
Goal – a Founding Collection Constituents Name Directory! Currently testing to see whether OpenRefine can auto-generate columns based on Wikidata properties. Some references:
- https://tools.wmflabs.org/openrefine-wikidata/
- https://github.com/OpenRefine/OpenRefine/wiki/Reconciliation-Service-API#examples
- https://www.bountysource.com/issues/42907094-add-column-from-wikidata
- http://www.mnylc.org/fellows/2017/03/17/using-openrefine-to-reconcile-name-entities/ (Karen’s tutorial mentioned earlier).
- https://data-lessons.github.io/library-openrefine/05-advance-functions/
- https://github.com/OpenRefine/OpenRefine/issues/1179
- https://www.mediawiki.org/wiki/Wikibase/API
- https://www.youtube.com/watch?v=5tsyz3ibYzk&feature=youtu.be
Using OpenRefine to search for Wikidata properties is kind of time-consuming. Is it any less so than Python?
And…searching for Wikidata properties with OpenRefine was a failure. Python is probably fine.
Attempting to get GeoNames property from Wikidata via the Wikidata entity page for a place:
https://www.wikidata.org/w/api.php?action=wbgetclaims&entity=Q200078&property=P1566 “https://www.wikidata.org/w/api.php?action=wbgetclaims&entity=”+cell.recon.match.id+”&property=P1566” “https://tools.wmflabs.org/openrefine-wikidata/en/fetch_values?item=”+cell.recon.match.id+”&prop=P1566” |
Yes! Once names are reconciled to Wikidata, OpenRefine can create a column based on any property! I used GeoNames to test, since Joshua had already queried it for constituent birth/death places.
Extract property from resulting JSON dictionary:
value.parseJson().values.replace(‘[‘,”).replace(‘]’,”).replace(‘”‘,”)
A name directory with the URIs of Whitney Constituents from various other institutional repositories seems like it could be pretty useful.
More VIAF reconciliation details:
External URI Wikidata Properties
- VIAF ID – https://www.wikidata.org/wiki/Property:P214
- LCAuth ID – https://www.wikidata.org/wiki/Property:P244
- FAST-ID (WorldCat Linked Data) – https://www.wikidata.org/wiki/Property:P2163
- Social Networks and Archival Context (SNAC) ID –https://www.wikidata.org/wiki/Property:P3430
- ULAN ID – https://www.wikidata.org/wiki/Property:P245
- RKDartists (Rijksbureau voor Kunsthistorische Documentatie) ID –https://www.wikidata.org/wiki/Property:P650
- Art UK artist ID – https://www.wikidata.org/wiki/Property:P1367
- British Museum person-institution – https://www.wikidata.org/wiki/Property:P1711
- Musée d’Orsay artist ID – https://www.wikidata.org/wiki/Property:P2268
- Photographers’ Identities Catalog ID – https://www.wikidata.org/wiki/Property:P2750
- NGA (National Gallery) artist id – https://www.wikidata.org/wiki/Property:P2252
- Artsy artist ID – https://www.wikidata.org/wiki/Property:P2042
- Smithsonian American Art Museum: person/institution thesaurus id – https://www.wikidata.org/wiki/Property:P1795
- Web Gallery of Art ID – https://www.wikidata.org/wiki/Property:P1882
- Kunstindeks Danmark Artist ID – https://www.wikidata.org/wiki/Property:P1138
- Tate artist identifier – https://www.wikidata.org/wiki/Property:P2741
- Dictionary of Art Historians ID – https://www.wikidata.org/wiki/Property:P2332
- Te Papa artist ID – https://www.wikidata.org/wiki/Property:P3544
- Sikart – https://www.wikidata.org/wiki/Property:P781
- Auckland Art Gallery artist ID – https://www.wikidata.org/wiki/Property:P3372
- Belvedere artist ID – https://www.wikidata.org/wiki/Property:P3421
- MoMA artist id – https://www.wikidata.org/wiki/Property:P2174
- KulturNav-id – https://www.wikidata.org/wiki/Property:P1248
- Nationalmuseum Sweden artist ID – https://www.wikidata.org/wiki/Property:P2538
- J. Paul Getty Museum artist id – https://www.wikidata.org/wiki/Property:P2432
- Cooper-Hewitt Person ID: https://www.wikidata.org/wiki/Property:P2011
- Thyssen-Bornemisza artist ID – https://www.wikidata.org/wiki/Property:P2431
- National Gallery of Victoria artist ID – https://www.wikidata.org/wiki/Property:P2041
- Information Center for Israeli Art artist ID – https://www.wikidata.org/wiki/Property:P1736
- Artnet Artist ID – https://www.wikidata.org/wiki/Property:P3782
- CLARA-ID (women visual artists) – https://www.wikidata.org/wiki/Property:P1615
Export
I guess JSON is the default export format for OpenRefine?
https://github.com/OpenRefine/OpenRefine/wiki/Export-As-YAML
But is it LD….?
You can export JSON from OpenRefine using the Templating function:
More on that: http://stackoverflow.com/questions/31328001/openrefine-working-with-templating-to-export-json-as-records