feat: use arrow for datetimes and standardize output block by qibinlei · Pull Request #56 · usatlas/af-benchmarking

qibinlei · 2026-05-03T22:07:47Z

Created a new file benchmark_utils.sh which has a function that appends a benchmark block into all output logs
Updated parser to use arrow package in python
Updated parser to scrape benchmark block (reducing amount of handlers needed)

kratsg · 2026-05-04T22:54:04Z

-        "runTime": run_time,
-        "status": status,
+        "submitTime": start_dt.int_timestamp * 1000,  # milliseconds
+        "queueTime": 0,


Suggested change

"queueTime": 0,

"queueTime": queueTime,

kratsg · 2026-05-04T22:56:03Z

-        "status": status,
+        "submitTime": start_dt.int_timestamp * 1000,  # milliseconds
+        "queueTime": 0,
+        "runTime": wall_time,


why is this wall_time and not end_time_utc - start_time_utc? The logic should be inside the parsing.

kratsg · 2026-05-04T22:58:30Z

 # shellcheck disable=SC1091
 # shellcheck disable=SC2115
-  source "${ATLAS_LOCAL_ROOT_BASE}"/user/atlasLocalSetup.sh -c el9 -m "${2}" -r "export RUCIO_ACCOUNT=jroblesg && \
+  source "${ATLAS_LOCAL_ROOT_BASE}"/user/atlasLocalSetup.sh -c el9 -m "${2}" -r "export RUCIO_ACCOUNT=qlei && \


need @qibinlei 's voms credentials added as secrets to af-benchmarking

- Replace wall_time_sec with end_time_utc - start_time_utc for runTime - Use queue_time variable instead of hardcoded 0 for queueTime - Add benchmark_utils.sh shared shell utilities for benchmark scripts Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

append_benchmark now takes (log_file, start_time, end_time, mode) — wall_time_sec is no longer written to the BENCHMARK block since parsers derive run time from end_time_utc - start_time_utc directly. Removes start_epoch, end_epoch, and wall_time variables from all 21 run scripts and drops the wall_time_sec field from benchmark_utils.sh. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

All job types now use the BENCHMARK block approach via base_parser.py. Remove handlers/ subdirectory and all per-type parsers (evnt, truth3, rucio, coffea, eventloop, fastframes), ParsingClass from base/, and text_utils.py which had no remaining callers. Move base_parser.py to parsing/ root and update ci_parse.py import. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Loads coffea, eventloop, and fastframes photon pT histograms and prints an integral/ratio summary and saves an overlay plot with a ratio panel. Highlights the key difference: coffea/eventloop apply an event-level tightID cut while fastframes fills underflow for events with no tightID photon via sorted ph1_pt1_NOSYS. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Replace uproot + numpy + matplotlib with PyROOT — available on all ATLAS analysis facilities without extra installs. Output default changed to PDF (vector, better for publication). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- benchmark_utils.sh: reorder append_benchmark params so mode is last (new signature: log_file start end [setup_start] [setup_end] [mode]) and read SUBMIT_TIME from env for queue-time tracking via HTCondor - base_parser.py: parse submit_time_utc (→ submitTime ms, queueTime s) and setup_start/end_time_utc (→ setupTime s) using arrow - payload.schema.json: add optional setupTime field - All 21 run scripts (EVNT, TRUTH3, NTuple_Hist, Rucio): - Container scripts: capture setup_start before export ATLAS_LOCAL_ROOT_BASE, write SETUP_COMPLETE marker inside -r string after asetup/lsetup, grep it after container exits for setup_end - Native scripts: capture setup_start/setup_end around atlasLocalSetup + asetup/lsetup before payload command - Rucio: records setupTime=0 (start_time used for both bounds) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Slight clock differences between submit host and worker node can produce negative values from timestamp arithmetic, violating the schema's minimum: 0 constraint. Added max(0, ...) guards for both. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

The sourced benchmark_utils.sh uses an absolute cluster path that doesn't exist locally, causing false SC1091 failures in shellcheck. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

The 11 HTCondor .sub files pointed Executable at /usatlas/u/qlei/dev/af-benchmarking/... while every BNL shell script sourced and referenced /usatlas/u/qlei/AF-Benchmarking/. This mismatch meant HTCondor launched executables from a different checkout than the one the scripts internally relied on. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

This reverts commit 0e12152.

qibinlei changed the title ~~Feat/use arrow for datetimes~~ feat: use arrow for datetimes and standardize output block May 3, 2026

kratsg reviewed May 4, 2026

View reviewed changes

Comment thread EVNT/BNL/CentOS7/centos_cron.sh

kratsg reviewed May 4, 2026

View reviewed changes

Qi Bin Lei and others added 2 commits May 4, 2026 16:01

Parser + Arrow Updates

8acc35d

style: pre-commit fixes

35b246d

kratsg force-pushed the feat/useArrowForDatetimes branch from b543db4 to 35b246d Compare May 4, 2026 23:01

kratsg changed the base branch from feat/useArrowForDatetimes to main May 4, 2026 23:03

kratsg and others added 13 commits May 4, 2026 16:04

bump

be4554d

test: add tests for parse_benchmark_block and parse_atlas_log

ec31ae8

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

style: pre-commit fixes

5ff4539

fix: update BNL benchmark path to AF-Benchmarking

7321344

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

style: pre-commit fixes

bf36ca9

fix: suppress SC1091 shellcheck warnings in BNL batch scripts

242285b

The sourced benchmark_utils.sh uses an absolute cluster path that doesn't exist locally, causing false SC1091 failures in shellcheck. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

qibinlei requested a review from kratsg May 21, 2026 01:34

Qi Bin Lei and others added 3 commits May 21, 2026 15:28

test: verify PreToolUse check-typos hook fires on commit

0e12152

Revert "test: verify PreToolUse check-typos hook fires on commit"

de4ca61

This reverts commit 0e12152.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: use arrow for datetimes and standardize output block#56

feat: use arrow for datetimes and standardize output block#56
qibinlei wants to merge 18 commits into
usatlas:mainfrom
qibinlei:feat/useArrowForDatetimes

qibinlei commented May 3, 2026

Uh oh!

Uh oh!

kratsg May 4, 2026

Uh oh!

kratsg May 4, 2026

Uh oh!

kratsg May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

qibinlei commented May 3, 2026

Uh oh!

Uh oh!

kratsg May 4, 2026

Choose a reason for hiding this comment

Uh oh!

kratsg May 4, 2026

Choose a reason for hiding this comment

Uh oh!

kratsg May 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants