joesposito8

Follow

Joey Esposito joesposito8

Follow

AI @ LinkedIn | Safety Evaluation Engineer | Inspect Contributor

2 followers · 7 following

San Francisco, USA
https://www.linkedin.com/in/joseph-esposito8
in/joseph-esposito8

Achievements

Achievements

joesposito8/README.md

Joey Esposito

Trying to bring good engineering practices to AI Safety Evaluations :)

Current focus

Measuring Prefill Awareness in transcript-based evals. Building an Inspect-based eval to audit existing benchmarks for Prefill Awareness as a confounding factor: prefill-awareness-audit.
Eval tooling and methodology contributions to UK AISI's Inspect AI and Inspect Evals.

Selected upstream contributions

Inspect AI Ecosystem:

inspect_ai#3709 — Add vLLM chat template controls for base-model evals.
inspect_ai#3969 - Add pass_k reducer metric for reliability in agentic evals.
inspect_evals#1503 — Fix mean_of on_missing="skip" to also skip None-valued samples.
inspect_evals#1501 — cyberseceval_4: tolerate fenced and prose-wrapped judge JSON.
inspect_evals#1429 — Fix CodeIPI exfiltration scorer to check tool result messages.
inspect_scout#455 - Fix events_data parser to fit new Petri eval schema.

Contact

Email: joesposito8@gmail.com
LinkedIn: joseph-esposito8

Pinned Loading

prefill-awareness-audit prefill-awareness-audit Public

Reusable audit scaffold for detecting prefill awareness confounds in transcript-based AI evals

Python