Skip to content

feat(skillopt 0.4.0): background train + harder example dataset (SKILLOPT.easy §8)#419

Merged
dancinlife merged 1 commit into
mainfrom
feat/skillopt-bg-and-harder
Jun 6, 2026
Merged

feat(skillopt 0.4.0): background train + harder example dataset (SKILLOPT.easy §8)#419
dancinlife merged 1 commit into
mainfrom
feat/skillopt-bg-and-harder

Conversation

@dancinlife

Copy link
Copy Markdown
Contributor

Implements the two "더 다듬을 거리" from domains/SKILLOPT.easy.md §8.

  • 🌙 background train — /skillopt train --bg detaches via nohup (log under
    ~/.sidecar/skillopt/train-.log) and returns immediately; /skillopt status
    shows running-state + score/step progress, /skillopt log tails the latest log.
  • 🎯 harder example — examples/toyqa swapped to 6 format-sensitive QA items
    (chemical symbols, ISO codes, rounding, past-tense) that an EMPTY skill answers
    in a full sentence → STRICT exact-match fails → a real learning gradient (the
    optimizer learns a "reply with only the value" rule, then the held-out gate
    rises). train_size 6 · batch 3 · sel 5 for stronger signal.

Validated: skill.sh bash -n ✅ · dataloader/config parse ✅ · status/help smoke ✅
· new dataset loads (6 train / 5 eval). g22 lockstep 0.4.0 + CHANGELOG.

…LOPT.easy §8)

Implements the two "더 다듬을 거리" from domains/SKILLOPT.easy.md §8.

- 🌙 background train — `/skillopt train --bg` detaches via nohup (log under
  ~/.sidecar/skillopt/train-<ts>.log) and returns immediately; `/skillopt status`
  shows running-state + score/step progress, `/skillopt log` tails the latest log.
- 🎯 harder example — examples/toyqa swapped to 6 format-sensitive QA items
  (chemical symbols, ISO codes, rounding, past-tense) that an EMPTY skill answers
  in a full sentence → STRICT exact-match fails → a real learning gradient (the
  optimizer learns a "reply with only the value" rule, then the held-out gate
  rises). train_size 6 · batch 3 · sel 5 for stronger signal.

Validated: skill.sh `bash -n` ✅ · dataloader/config parse ✅ · status/help smoke ✅
· new dataset loads (6 train / 5 eval). g22 lockstep 0.4.0 + CHANGELOG.
@dancinlife dancinlife merged commit 1bd8bd7 into main Jun 6, 2026
@dancinlife dancinlife deleted the feat/skillopt-bg-and-harder branch June 6, 2026 20:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant