added RFC on how to create a living knowledge base of owasp things#734
added RFC on how to create a living knowledge base of owasp things#734northdpole wants to merge 1 commit intomainfrom
Conversation
|
@northdpole I've gone through the RFC and it gives a clear architectural and experimental framework to build the proposal around. I'll spend some time digesting it in detail and start aligning my work proposal with this design and the pre-code experiments outlined here. |
|
Thanks for putting this together Sir, the experimental framework is really clear. I’m particularly interested in Module C (The Librarian) and want to start with the suggested pre-code experiments before proposing any concrete design or implementation. The negation problem stands out — I’ve worked on gap analysis features before (#716) and have seen how basic similarity metrics can struggle with logical inversions in requirements (e.g., “Use X” vs “Do NOT use X”). Plan:
If the experiment is successful, I’m also interested in exploring hybrid search (vector + BM25), especially for cases like CVE identifiers where pure vector search often underperforms. I'll take this up step by step . I’ll share experiment results and observations before proposing any implementation. I’m using AI tools (similar to Cursor/Windsurf) and have read Section 3. Thank you . |
|
Hi @northdpole , Thanks for putting together this RFC — the structure, pre-code experiments, and CI-first mindset are exactly the kind of system I enjoy working on. I’d like to formally express my interest in owning Module B: Noise / Relevance Filter as my primary contribution, and I’m also happy to assist with adjacent modules where needed. So Why Module B The framing of Module B as a cheap, high-signal gate before expensive downstream processing resonates strongly with me. Getting this layer right feels critical to the quality, cost, and trustworthiness of the entire pipeline, especially given the planned regression dataset and CI enforcement. Proposed Plan of Action (Aligned with the RFC)
And Cross-Module Contributions While Module B would be my ownership area, I can also help with: I’ve read and understood Section 3 (Agent-Ready CI & AI-generated PR constraints) and I’m comfortable working within those boundaries. Looking forward to collaborating — this project feels like a rare opportunity to build something both technically rigorous and genuinely useful. Best, |
No description provided.