ScienceDirect: Object Retrieval #360

nils-herrmann · 2024-10-17T09:13:30Z

It seems that the only consistent way of identifying objects is by its eid which has following structure:
<file_eid>-<object_ref>.<object_suffix>
An example is 1-s2.0-S0893608024005562-si15.svg'

Therefore the most reliable strategy is to retrieve objects by passing the document identifier and object file name:

ObjectRetrieval('10.1016/j.neunet.2024.106632', filename='gr3.jpg')

To get the file names, users can use the ObjectMetadata class:

o_md = ObjectMetadata('10.1016/j.neunet.2024.106632')
filenames = [f['filename'] for f in o_md.results]

The text was updated successfully, but these errors were encountered:

Michael-E-Rose · 2024-10-18T09:26:27Z

How would users know the filename beforehand?

nils-herrmann · 2024-10-18T09:36:31Z

There is a naming convention. All items are enumerated with a prefix/suffix depending on its type (figure, math formula, pdf):

Standard Figures are: gr<nr>.jpg
Formula: si<nr>.svg

Manually, there are two options:

Use the ObjectMetadata class and get the filenames of all objects:

o_md = ObjectMetadata('10.1016/j.neunet.2024.106632')
filenames = [f['filename'] for f in o_md.results]

Check the paper online and inspect the download link: https://ars.els-cdn.com/content/image/1-s2.0-S1566253524004342-gr2_lrg.jpg

Michael-E-Rose · 2024-10-24T10:03:13Z

Alright, then let's make the class work with the filename. I will include your hints in the documentation.

nils-herrmann self-assigned this Oct 17, 2024

nils-herrmann mentioned this issue Oct 17, 2024

(Draft) ScienceDirect: Object Retrieval API #353

Closed

nils-herrmann added Effort: High Backend labels Oct 17, 2024

nils-herrmann changed the title ~~Science Direct: Object Retrieval~~ ScienceDirect: Object Retrieval Oct 17, 2024

nils-herrmann added a commit to nils-herrmann/pybliometrics that referenced this issue Oct 17, 2024

pybliometrics-dev#360 ScienceDirect Object Retrieval with eid

af361f1

nils-herrmann added a commit to nils-herrmann/pybliometrics that referenced this issue Oct 17, 2024

pybliometrics-dev#360 Minor changes

08adcc4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ScienceDirect: Object Retrieval #360

ScienceDirect: Object Retrieval #360

nils-herrmann commented Oct 17, 2024

Michael-E-Rose commented Oct 18, 2024

nils-herrmann commented Oct 18, 2024 •

edited

Loading

Michael-E-Rose commented Oct 24, 2024

ScienceDirect: Object Retrieval #360

ScienceDirect: Object Retrieval #360

Comments

nils-herrmann commented Oct 17, 2024

Michael-E-Rose commented Oct 18, 2024

nils-herrmann commented Oct 18, 2024 • edited Loading

Michael-E-Rose commented Oct 24, 2024

nils-herrmann commented Oct 18, 2024 •

edited

Loading