Add --until and --sort sweep flags to analyze/label by ashleyzhang01 · Pull Request #40 · withmartian/code-review-benchmark

ashleyzhang01 · 2026-05-21T23:46:19Z

Add --until flag to analyze/label for backfilling a specific date range (e.g. a missed day). Add --sort sweep mode that orders by assembled_at (analyze) or analyzed_at (label) DESC instead of bot_reviewed_at. Since we only analyze merged PRs, ones that merge long after discovery fall outside the --since window of the primary run; the sweep pass catches these stragglers in addition to the freshness-first processing.

Allows targeting a specific date range when re-running analyze/label, which is needed for incident-recovery backfills (e.g. analyzing only 2026-04-18 without first burning budget on newer days that the DESC- ordered query would walk first). - New BOUNDED query variants in queries.py with nullable since/until bounds; existing SINCE / no-bound queries kept as fast paths. - repository.get_assembled_not_analyzed and get_analyzed_not_labeled gain an `until` param and route to the bounded queries when set. - pipeline.analyze.analyze_prs and pipeline.label.label_prs thread `until` through to the repository. - main.py exposes --until on analyze + label, parsed identically to --since (relative "Nd" or absolute date). Refactored the parsing into a shared _parse_time_bound helper. - connection._translate_params strips PG-only ::type casts when running against SQLite, so the bounded queries work in both backends without duplication. - Tests cover since-only, until-only, since+until, and the all-chatbots variant. Semantics: --since is inclusive, --until is exclusive, so --since 2026-04-18 --until 2026-04-19 -> just 2026-04-18. Made-with: Cursor

asyncpg requires datetime objects for timestamptz parameters, and _coerce_args only converts strings that match the full ISO regex (YYYY-MM-DDThh:mm:ss). Bare dates like "2026-04-18" passed straight through and triggered: asyncpg.exceptions.DataError: invalid input for query argument $2: '2026-04-18' (expected a datetime.date or datetime.datetime instance, got 'str') Fix in _parse_time_bound: detect a bare YYYY-MM-DD and expand to midnight UTC. Affects both --since and --until on analyze and label. This also resolves a latent bug: the original --since handler had the same problem, but it was only ever invoked with the relative "Nd" form (which produces a full ISO timestamp), so nobody hit it. Made-with: Cursor

default is `--sort reviewed` by when bot reviewed at desc. new `--sort sweep` sorts by assembled at desc for analyze and analyzed at desc for label. this catches straggler PRs that were discovered/processed late

Copilot

Pull request overview

This PR extends the DB-backed analyze and label CLI subcommands to support bounded backfills and an alternate “sweep” prioritization order, aimed at catching PRs that become eligible long after their original discovery/review timestamp.

Changes:

Added --until (exclusive upper bound) to analyze/label, including a shared _parse_time_bound helper to normalize CLI time inputs (relative days, bare dates).
Added --sort {reviewed,sweep} to analyze/label, and introduced new SQL query variants to support sweep ordering.
Added/extended SQLite integration tests covering until bounds and sweep ordering behavior.

Reviewed changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
online/etl/tests/test_repository.py	Adds integration tests for `until` bounds and analyze sweep ordering.
online/etl/tests/test_main.py	Adds unit tests for the new `_parse_time_bound` CLI helper.
online/etl/pipeline/label.py	Plumbs `until`/`sort_by` into the labeling pipeline call to the repository.
online/etl/pipeline/analyze.py	Plumbs `until`/`sort_by` into the analysis pipeline call to the repository.
online/etl/main.py	Adds `--until`/`--sort` flags to CLI and centralizes time-bound parsing.
online/etl/db/repository.py	Extends repository query APIs with `until` + `sort_by` branching.
online/etl/db/queries.py	Adds bounded and sweep-ordered SQL query variants for analyze/label selection.
online/etl/db/connection.py	Updates SQLite param translation to strip Postgres `::type` casts used in new bounded queries.

Comments suppressed due to low confidence (1)

online/etl/db/repository.py:334

sort_by == "sweep" bypasses the since/until bounds here too, meaning label --since/--until --sort sweep will label PRs outside the requested reviewed-at window. Align the sweep branch with the bounded logic (e.g., add bounded+sorted queries ordered by p.analyzed_at DESC while still filtering on p.bot_reviewed_at).

        if sort_by == "sweep":
            if chatbot_id is not None:
                return await self.db.fetchall(
                    q.GET_ANALYZED_NOT_LABELED_BY_ASSEMBLED, (chatbot_id, limit)
                )
            return await self.db.fetchall(
                q.GET_ALL_ANALYZED_NOT_LABELED_BY_ASSEMBLED, (limit,)
            )
        if until is not None:

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…an/code-review-benchmark into improve-analyze-flags

christama

Config

ashleyzhang01 added 9 commits April 21, 2026 21:30

allow sort unanalyzed PRs by assembled time

42dfe4d

add sort by flag to analyze pipeline

516f87b

add tests

4e599a8

allow sort label by assembled time

e531b88

label sorts by analyzed for sweep

da1982d

change sort key from assembled to sweep

37aefcd

default is `--sort reviewed` by when bot reviewed at desc. new `--sort sweep` sorts by assembled at desc for analyze and analyzed at desc for label. this catches straggler PRs that were discovered/processed late

change new sort key from assembled to sweep

7546b9e

default is `--sort reviewed` by when bot reviewed at desc. new `--sort sweep` sorts by assembled at desc for analyze and analyzed at desc for label. this catches straggler PRs that were discovered/processed late

ashleyzhang01 requested a review from Copilot May 21, 2026 23:46

Copilot started reviewing on behalf of ashleyzhang01 May 21, 2026 23:46 View session

Copilot AI reviewed May 21, 2026

View reviewed changes

Comment thread online/etl/db/repository.py

Comment thread online/etl/db/queries.py Outdated

Comment thread online/etl/db/queries.py

ashleyzhang01 added 4 commits May 21, 2026 17:03

rename comments for clarity

e40ee5e

Merge branch 'main' into add-analyze-until-flag

6195095

fix ruff errors

742b23e

Merge branch 'add-analyze-until-flag' of https://github.com/withmarti…

5d19f49

…an/code-review-benchmark into improve-analyze-flags

zverianskii approved these changes May 22, 2026

View reviewed changes

ashleyzhang01 merged commit 279f279 into main May 23, 2026
2 checks passed

christama approved these changes Jun 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add --until and --sort sweep flags to analyze/label#40

Add --until and --sort sweep flags to analyze/label#40
ashleyzhang01 merged 13 commits into
mainfrom
add-analyze-until-flag

ashleyzhang01 commented May 21, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

christama left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ashleyzhang01 commented May 21, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

christama left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants