feat(upsampling) - Support upsampled error count with performance optimizations by akshayutture-augment · Pull Request #13 · ai-code-review-evaluations/augment-sentry

akshayutture-augment · 2025-11-17T17:52:28Z

No description provided.

…(#94376) Part of the Error Upsampling project: https://www.notion.so/sentry/Tech-Spec-Error-Up-Sampling-1e58b10e4b5d80af855cf3b992f75894?source=copy_link Events-stats API will now check if all projects in the query are allowlisted for upsampling, and convert the count query to a sum over `sample_weight` in Snuba, this is done by defining a new SnQL function `upsampled_count()`. I noticed there are also eps() and epm() functions in use in this endpoint. I considered (and even worked on) also supporting swapping eps() and epm() which for correctness should probably also not count naively and use `sample_weight`, but this caused some complications and since they are only in use by specific dashboard widgets and not available in discover I decided to defer changing them until we realize it is needed.

- Add 60-second cache for upsampling eligibility checks to improve performance - Separate upsampling eligibility check from query transformation for better optimization - Remove unnecessary null checks in upsampled_count() function per schema requirements - Add cache invalidation utilities for configuration management This improves performance during high-traffic periods by avoiding repeated expensive allowlist lookups while maintaining data consistency.

augmentcode

Review completed. 1 suggestions posted.

Comment augment review to trigger a new review at any time.

augmentcode · 2025-11-17T17:56:23Z

+    expensive repeated option lookups during high-traffic periods. This is safe
+    because allowlist changes are infrequent and eventual consistency is acceptable.
+    """
+    cache_key = f"error_upsampling_eligible:{organization.id}:{hash(tuple(sorted(snuba_params.project_ids)))}"


Using Python’s built-in hash(...) in cache keys is non-deterministic across processes, which can cause cache misses and ineffective invalidation in multi-process deployments. Consider a stable, deterministic key (e.g., sorted join of IDs or a deterministic hash) to ensure consistency across workers (also applies to the invalidation key below).

_{🤖 Was this useful? React with 👍 or 👎}

yuvmen and others added 2 commits July 25, 2025 09:48

augmentcode Bot reviewed Nov 17, 2025

View reviewed changes

adamsaimi mentioned this pull request Feb 12, 2026

feat(upsampling) - Support upsampled error count with performance optimizations adamsaimi/sentry-20260213-0023#3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(upsampling) - Support upsampled error count with performance optimizations#13

feat(upsampling) - Support upsampled error count with performance optimizations#13
akshayutture-augment wants to merge 2 commits into
masterfrom
error-upsampling-race-condition

akshayutture-augment commented Nov 17, 2025 •

edited

Loading

Uh oh!

augmentcode Bot left a comment

Uh oh!

augmentcode Bot Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

akshayutture-augment commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

augmentcode Bot left a comment

Choose a reason for hiding this comment

Uh oh!

augmentcode Bot Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

akshayutture-augment commented Nov 17, 2025 •

edited

Loading