HTTPError should be catched.

HTTPError should be catched.

The sitechecksum which caused error should be skipped to prevent blocking others behind.

https://github.com/FigureHook/hook_tasks/blob/d1826a3251c613e064f8be9d503e9a5987d83b6f/hook_tasks/periodic/tasks.py#L35-L53

	@app.task
	def check_new_release():
	scheduled_jobs = []
	site_checksums: list[Type[SiteChecksum]] = [
	AlterChecksum,
	GSCChecksum,
	NativeChecksum
	]
	with pgsql_session():
	scrapy_util = ScrapydUtil(
	os.getenv("SCRAPYD_URL", "http://127.0.0.1:6800"), "product_crawler"
	)
	for site_checksum in site_checksums:
	checksum = site_checksum(scrapyd_util=scrapy_util)
	if checksum.is_changed:
	spider_jobs = checksum.trigger_crawler()
	scheduled_jobs.extend(spider_jobs)
	checksum.update()
	return scheduled_jobs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HTTPError should be catched. #1

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

HTTPError should be catched. #1

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions