Web form: https://osf.io/search/. Example with .mdp files: https://osf.io/search/?q=mdp&filter=file&page=1
Query follows Lucene Search Query Help
Note
OSF does not allow to search by file extension
API to search for files: https://api.osf.io/v2/search/files/. Example with .mdp files: https://api.osf.io/v2/search/files/?q=mdp&page=1
API documentation.
A token is required to use the API programmatically. Create one from your user settings. Select the osf.full_read
scope. Save this token in a .env
file:
OSF_TOKEN=<YOUR OSF TOKEN HERE>
- Search for relevant files. Loop on file extensions with keywords. Results are paginated. Extract a set of unique datasets.
- For each dataset, retrieve informations and files list.
- Retrieve files informations. Results are paginated.
Dataset with folders:
- ULK1 kinase domain MD simulations. Gromacs files with Charmm forcefield.
Dataset with components:
- Voltage-sensing. Gromacs files.
Dataset with zip files:
- Molecular dynamics simulation of floating sphere...
- LN17351-MD model-NAMD conf.. Overview of the content of
AllModel.zip
- Supplemental materials for preprint: Energetics of Interfacial Interactions...
Note: we cannot easily catch the content of zip files as displayed by OSF since the overview is Javascript based. See for instance the source of the page with the content of AllModel.zip
. More advanced solutions such as selenium might be useful.