refactor: harden generative compilation pipeline by nxank4 · Pull Request #71 · codepawl/loclean

nxank4 · 2026-02-27T18:04:05Z

Summary

Hardens the LLM code-generation pipeline with better sandboxing, error handling, and developer experience.

Changes

Commit	Scope
`source_sanitizer.py` [NEW]	Strips markdown fences, prose, unicode operators from LLM output (17 tests)
`sandbox.py`	Restricted `__import__` — only explicitly allowed modules can be imported
`feature_discovery.py`, `shredder.py`	Hardened retry loops: catches `ValueError` from compile, per-retry logging, actionable error messages with model suggestions
`model_manager.py`	In-process cache for verified models (skips redundant Ollama API calls), fixes dict/object API response handling
`LIBRARY_SUMMARY.md` [DELETE]	Superseded by module docstrings and `examples/README.md`

Design Decisions

Sandbox import restriction prevents LLM-generated code from importing arbitrary modules (e.g. subprocess, os). Only modules in the allowed_modules list are permitted.
Actionable error messages guide users towards solutions (larger model, more retries) instead of opaque "code generation failed" errors.
Model verification cache eliminates repeated ollama list calls within the same process, useful for batch pipelines.

Tests

42 tests across test_source_sanitizer.py, test_sandbox.py, test_model_manager.py.

uv run pytest tests/unit/utils/ tests/unit/inference/ -v --no-cov

…eAuditor - TrapPruner: statistical profiling + LLM verification of Gaussian noise columns - MissingnessRecognizer: MNAR pattern detection with sandbox-compiled encoders - TargetLeakageAuditor: semantic timeline evaluation for target leakage

…o public API - Add all three to extraction/__init__.py lazy imports - Add Loclean class methods + module-level convenience functions - Update __all__ in loclean/__init__.py

…r, TargetLeakageAuditor - 13 tests each (39 total) covering profiling, prompt construction, verdict parsing, verification, and mock-LLM integration

- Strip markdown fences, prose, and backticks - Fix unicode operators and invalid numeric literals - 17 unit tests covering all transformation stages

- LLM-generated import statements now only work for explicitly allowed modules; all others raise ImportError - Preload modules into safe_globals for direct namespace access - Updated docstring to document the restriction

- Wrap initial compile in try/except to catch ValueErrors - Add per-retry logging with attempt counter - Replace vague failure messages with actionable guidance (model suggestions, max_retries hint) - Add concrete code examples to LLM prompts for better output

- Add module-level _verified_models set for deduplication - Fix model_exists to handle both dict and object API responses - Use model attribute (not name) for correct Ollama registry matching

Superseded by examples/README.md and module docstrings.

devactivity-app · 2026-03-03T09:08:16Z

Pull Request Summary by devActivity

Metrics

Achievements

@nxank4
Earned XP: 15⭐
Sign up here to check your progress

nxank4 added 8 commits February 27, 2026 17:43

feat(api): wire prune_traps, recognize_missingness, audit_leakage int…

f5bd6f9

…o public API - Add all three to extraction/__init__.py lazy imports - Add Loclean class methods + module-level convenience functions - Update __all__ in loclean/__init__.py

test(extraction): add unit tests for TrapPruner, MissingnessRecognize…

67c5e49

…r, TargetLeakageAuditor - 13 tests each (39 total) covering profiling, prompt construction, verdict parsing, verification, and mock-LLM integration

feat(utils): add source_sanitizer for LLM output cleanup

8c235f3

- Strip markdown fences, prose, and backticks - Fix unicode operators and invalid numeric literals - 17 unit tests covering all transformation stages

refactor(sandbox): add restricted __import__ to compile_sandboxed

6cad8a7

- LLM-generated import statements now only work for explicitly allowed modules; all others raise ImportError - Preload modules into safe_globals for direct namespace access - Updated docstring to document the restriction

perf(inference): cache verified models to skip redundant Ollama checks

d44cf11

- Add module-level _verified_models set for deduplication - Fix model_exists to handle both dict and object API responses - Use model attribute (not name) for correct Ollama registry matching

chore: remove LIBRARY_SUMMARY.md

888298a

Superseded by examples/README.md and module docstrings.

nxank4 merged commit ef10a26 into main Feb 27, 2026
21 checks passed

nxank4 deleted the refactor/generative-compilation-hardening branch February 27, 2026 18:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: harden generative compilation pipeline#71

refactor: harden generative compilation pipeline#71
nxank4 merged 8 commits intomainfrom
refactor/generative-compilation-hardening

nxank4 commented Feb 27, 2026

Uh oh!

Uh oh!

devactivity-app bot commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

nxank4 commented Feb 27, 2026

Summary

Changes

Design Decisions

Tests

Uh oh!

Uh oh!

devactivity-app bot commented Mar 3, 2026

Pull Request Summary by devActivity

Metrics

Achievements

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant