Skip to content
View 2betforyou's full-sized avatar

Block or report 2betforyou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
2betforyou/README.md

Junyoung Park

Typing SVG

LLM Safety Evaluation Failure Observability Applied AI Systems

I study LLM safety and evaluation, with a focus on process-level diagnosis of instruction-following failures and refusal dynamics.

I am an undergraduate student at Chung-Ang University, pursuing Art and Technology with Cyber Security as a convergence major. My work asks when, where, and how LLM failures form before they appear in final outputs. I like projects that turn research questions into reproducible evaluation pipelines, diagnostic signals, and working systems.

Focus Areas

  • LLM safety and evaluation
  • Logit dynamics, refusal/compliance margins, and early-token behavior
  • Jailbreak and instruction-following failure analysis
  • Benchmark design, LLM-as-Judge evaluation, and automated experiment pipelines
  • Applied AI systems, privacy-aware ML, and retrieval-based reasoning

Selected Projects

Project Description
Logit-Margin_Score Temporal analysis of LLM safety activation via logit-margin scores.
Persona_Attack Incremental memory injection jailbreak attack experiments against LLMs.
GraphRAG Graph-based retrieval framework for financial-security regulation interpretation.
FinSec-LLM-PostTraining RAG and QLoRA post-training pipeline for Korean financial-security and regulatory QA.
SafeAI_FInal Machine unlearning experiment on privacy, fairness, and eye-coordinate regression.
AutoValetParking Centralized autonomous valet parking simulation with reservation-based path planning.

Working Style

  • I prefer benchmark design and reproducible workflows over one-off results.
  • I keep model weights, private datasets, generated artifacts, and credentials out of public repositories.
  • I document assumptions around data, evaluation conditions, metrics, and compute constraints.

GitHub Snapshot

GitHub stats Top languages

Links

Popular repositories Loading

  1. Logit-Margin_Score Logit-Margin_Score Public

    Temporal analysis of LLM safety activation via logit-margin scores.

    Python 1 1

  2. 2betforyou.github.io 2betforyou.github.io Public

    Personal academic portfolio website built with GitHub Pages.

    SCSS

  3. SafeAI_Proposal SafeAI_Proposal Public

    Eye-detection pipeline for privacy-preserving face anonymization

    Jupyter Notebook

  4. SafeAI_FInal SafeAI_FInal Public

    Machine unlearning experiment on privacy, fairness, and eye-coordinate regression.

    Jupyter Notebook

  5. Persona_Attack Persona_Attack Public

    Incremental memory injection jailbreak attack experiments against LLMs

    Python

  6. AutoValetParking AutoValetParking Public

    Centralized autonomous valet parking simulation with reservation-based path planning

    Python