Review: Condensed matter generation task (Jeffrey)#44
Review: Condensed matter generation task (Jeffrey)#44doncamilom wants to merge 44 commits intomainfrom
Conversation
Paper-to-Code Mapping UpdateThis task is described in the paper as CMG (Conditional Material Generation). Paper claims for CMG:
What this branch provides:
What is MISSING to back the paper's claims:
Priority for reviewer reproducibility: HIGHThis task is a headline result in the paper (Table 4). Without the dataset, evaluation script, and a working task implementation in main, the CMG results cannot be reproduced. |
Final ClassificationPaper task: CMG (Conditional Material Generation) VerdictThis PR provides one of two competing CMG implementations (the other is in PR #33). It must be determined which implementation produced the paper results. Once resolved, the chosen implementation needs: bug fixes (global stop token, global print rate change), dataset release, evaluation harness, tests, and documentation. The other implementation should be closed. Work needed for peer review
|
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
0fbadd7 to
9d6efd2
Compare
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Reproducibility Assessment —
condmatgen-jeffreyAuthor: Jeffrey | Commits: 41 | Files changed: 11 | +395 / -10 lines
What this contributes
A new GRPO task — Conditional Material Generation (
condmatgen) — where the model proposes a novel crystalline compound (element list + space group) given a set of chemical elements.New files:
src/open_r1/tasks/condmatgen/condmatgen.py(325 lines) —ConditionalMaterialGenerationtask classsrc/open_r1/tasks/condmatgen/comps_used_in_sft.json— placeholder for seen-compositions dedup (currently empty[])recipes/condmatgen.yaml— GRPO training recipeReward logic (multi-signal, ~170 lines):
<think>/<answer>tag presence and ordering<think>content < 500 charssmact.screening.smact_validity()viapymatgenself.seen_comps_set)Breaks / Blockers
"stop": ["</answer>"]added toutils.pySamplingParams — affects ALL tasks, not just condmatgen. Could truncate other tasks' generations.base.pyrandom_printrate changed from 0.01 → 0.1 — 10x more debug output for ALL taskslaunch_CSCS.slurmoverwritten with Jeffrey-specific paths (a131account, personal dir)model_paths.txtadds entries pointing to personal CSCS storageReproducibility Gaps
condmatgen.pyline 57:/capstor/store/cscs/swissai/a131/jmeng/sink/...; recipe:/capstor/.../a131/jmeng/sink/src/open_r1/dataset/; SLURM script: 3 occurrencesNatureLM_conditional_v2.jsonis not in repo, not on HuggingFace, no download script or instructionssmactandpymatgenimported but not insetup.pyor any requirements file__init__.pyincondmatgen/directorycomps_used_in_sft.jsonis empty.rstfile indocs/source/tasks/requests,Optional,rdkit.Chem,pd(pandas)What's needed for reviewer reproducibility
/capstor/...paths with${MIST_DATA_DIR}oros.path.dirname(__file__)"stop"token change inutils.py— make it task-specificrandom_printrate change inbase.pylaunch_CSCS.slurmandmodel_paths.txtsmactandpymatgento project dependenciesdocs/source/tasks/condmatgen.rstaccuracy_reward🤖 Generated with Claude Code