Spec.qc.ca not loading events #82

saumier · 2024-10-24T21:53:53Z

When running the workflow for spec.qc.ca the system exits with an error:
Max retries reached. Unable to fetch the content for page .

The text was updated successfully, but these errors were encountered:

dev-aravind · 2024-10-25T12:22:01Z

Notes: The crawling works fine in a local machine, but fails when it is running in a github runner.

Task for @dev-aravind - Add the user-agent header in all steps of the crawling process, which includes fetching entity URLs, fetching entity details ( both headless and headful mode ).

@saumier will try and contact the Spec.qc.ca developer team to allow our user-agent to crawl their website.

saumier · 2024-10-25T14:55:00Z

@troughc I sent you an email for Isabelle to ask her tech team to allow the Artsdata crawler User Agent "artsdata-crawler/3.3.0"

Additional note: Artsdata crawler agent is "artsdata-crawler/3.3.0" however the tech teams have been informed to only match to "artsdata-crawler", because the version number (currently 3.3.0) changes with each update.

troughc · 2024-10-25T21:59:27Z

email was sent

dev-aravind · 2024-10-29T11:49:28Z

@saumier The user-agent is now added to every step.

troughc · 2024-10-29T12:44:55Z

@dev-aravind the tech teams have been informed to only match to "artsdata-crawler", because the version number (currently 3.3.0) changes with each update.

saumier · 2024-10-29T17:12:17Z

@fjjulien Please let me know if you hear anything from Isabelle at Spec regarding our crawler being allowed in. Once the Artsdata crawler is allowed in I will run another crawl of their event JSON-LD.

saumier assigned dev-aravind Oct 24, 2024

saumier added this to Artsdata Oct 24, 2024

saumier moved this to Todo in Artsdata Oct 24, 2024

dev-aravind assigned saumier and unassigned dev-aravind Oct 29, 2024

dev-aravind moved this from In Progress to In Review in Artsdata Oct 29, 2024

saumier assigned fjjulien and unassigned saumier Oct 29, 2024

saumier removed the status in Artsdata Dec 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spec.qc.ca not loading events #82

Spec.qc.ca not loading events #82

saumier commented Oct 24, 2024

dev-aravind commented Oct 25, 2024

saumier commented Oct 25, 2024 •

edited by troughc

Loading

troughc commented Oct 25, 2024

dev-aravind commented Oct 29, 2024

troughc commented Oct 29, 2024

saumier commented Oct 29, 2024 •

edited

Loading

Spec.qc.ca not loading events #82

Spec.qc.ca not loading events #82

Comments

saumier commented Oct 24, 2024

dev-aravind commented Oct 25, 2024

saumier commented Oct 25, 2024 • edited by troughc Loading

troughc commented Oct 25, 2024

dev-aravind commented Oct 29, 2024

troughc commented Oct 29, 2024

saumier commented Oct 29, 2024 • edited Loading

saumier commented Oct 25, 2024 •

edited by troughc

Loading

saumier commented Oct 29, 2024 •

edited

Loading