Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

routes for searching for datasets/recordsets #14

Open
sckott opened this issue Mar 16, 2016 · 2 comments
Open

routes for searching for datasets/recordsets #14

sckott opened this issue Mar 16, 2016 · 2 comments
Labels

Comments

@sckott
Copy link

sckott commented Mar 16, 2016

Am I missing something? Seems there's no route for searching for datasets, sort of like those in GBIF http://www.gbif.org/developer/registry

@godfoder
Copy link
Collaborator

There is, but it is currently undocumented. One of the reasons we haven't worked on it at all is that the metadata we collect is very bare, and often not what you really expect.

The endpoint is at http://search.idigbio.org/v2/search/recordsets if you want to poke around. The search parameter is rsq (equivalent to rq and mq on the record and media endpoints). Whats in there is a minimum set of identifying information scrapped from the RSS feeds and EML files. Deeper information, like the links to hand curated institution and collection metadata like GBIF has, is spotty since we don't have a registration process we make all of our providers go through.

There is also a publishers endpoint: http://search.idigbio.org/v2/search/publishers but that data is basically so bare its useless to anyone outside iDigBio. The search parameter on that is pq.

The Meta fields endpoints work for both types as well:

http://search.idigbio.org/v2/meta/fields/recordsets
http://search.idigbio.org/v2/meta/fields/publishers

If you have a specific use case your working towards, we can try and augment the available data if possible to move those types closer to production usability.

@sckott
Copy link
Author

sckott commented Mar 17, 2016

Thanks for the quick response.

Use case: searching for datasets by a specific institution, collection within an institution, etc.

Working on a collaboration with CalAcademy right now https://github.com/ropenscilabs/spplit - and its not a hugely important part, but would be nice to programmatically let users search for datasets - then we use that UUID to search for specimen records, etc. Right now since this is just a CalAcademy thing, I'm just caching the UUIDs associated with CalAcademy collections in the library itself

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants