Skip to content

Perl scripts for merging datasets contributed to Jo's HMRC Tax Exempt Art project

Notifications You must be signed in to change notification settings

milh0use/HMRC-Data

 
 

Repository files navigation

HMRC database of tax exempt art

HMRC-data*.tsv is the most recent file, containing all artworks, post some rudimentary data cleaning and a first attempt to extract artist names from the full text descriptions.

HMRC objects by county (also tab separated) contains limited location data for the objects. This data is stored separately by HMRC. Only the descripton field is common to both tables.

te-art.txt (hash separated) is the original scraped data from the HMRC website

The database contains just over 33,000 works of art in total.

About

Perl scripts for merging datasets contributed to Jo's HMRC Tax Exempt Art project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Perl 100.0%