SchoolMarm

Production-grade GBNF grammar-constrained decoding for LLMs.

Zero dependencies. No unsafe code. Pure Rust.

What It Does

Given a GBNF grammar string and a vocabulary of token strings, this crate produces bitmasks of allowed tokens at each autoregressive generation step. This constrains a language model to only generate output that matches the grammar — valid JSON, valid code, structured data, or any other formally-defined format.

This implementation is derived from the battle-tested grammar engine in llama.cpp, ensuring full GBNF compatibility while providing a safe, idiomatic Rust API.

Usage

use schoolmarm::{Grammar, GrammarState};

// Parse a grammar
let grammar = Grammar::new(r#"root ::= "hello" | "world""#).unwrap();

// Create runtime state
let mut state = GrammarState::new(grammar).unwrap();

// Get allowed tokens for your vocabulary
let vocab = vec!["hello", "world", "foo", "hel"];
let allowed = state.allowed_tokens(&vocab);
// allowed = [true, true, false, true]

// Accept a token and advance state
state.accept_token("hello").unwrap();
assert!(state.is_accepting());

Integration With Inference Engines

The typical integration loop:

use schoolmarm::{Grammar, GrammarState};

fn generate(grammar_str: &str, vocab: &[&str]) -> Vec<usize> {
    let grammar = Grammar::new(grammar_str).unwrap();
    let mut state = GrammarState::new(grammar).unwrap();
    let mut tokens = Vec::new();

    loop {
        // 1. Get allowed token mask
        let allowed = state.allowed_tokens(vocab);

        // 2. Apply mask to logits (set disallowed to -inf)
        // ... your inference engine's logit masking here ...

        // 3. Sample from masked logits
        let token_id = 0; // placeholder: your sampling logic
        if !allowed[token_id] { break; }

        // 4. Accept the token and advance grammar state
        state.accept_token(vocab[token_id]).unwrap();
        tokens.push(token_id);

        // 5. Check if grammar is complete
        if state.is_accepting() {
            break;
        }
    }
    tokens
}

GBNF Format

GBNF (GGML BNF) supports:

Literals: "hello"
Character ranges: [a-z], [^0-9], [abc]
Any character: .
Rule references: rulename
Alternation: |
Grouping: ( ... )
Repetition: *, +, ?, {n}, {n,m}, {n,}
Unicode escapes: \xNN, \uNNNN, \UNNNNNNNN
Comments: # ...

Example JSON grammar:

root   ::= object
value  ::= object | array | string | number | ("true" | "false" | "null") ws
object ::= "{" ws (string ":" ws value ("," ws string ":" ws value)*)? "}" ws
array  ::= "[" ws (value ("," ws value)*)? "]" ws
string ::= "\"" ([^"\\\x7F\x00-\x1F] | "\\" (["\\bfnrt] | "u" [0-9a-fA-F]{4}))* "\"" ws
number ::= ("-"? ([0-9] | [1-9] [0-9]*)) ("." [0-9]+)? ([eE] [-+]? [0-9]+)? ws
ws     ::= | " " | "\n" [ \t]*

Origin

Ported from llama.cpp's src/llama-grammar.cpp and src/llama-grammar.h (MIT License). This is a clean-room Rust implementation following the same algorithms: recursive descent GBNF parser, nondeterministic pushdown stack state machine, character-level token matching.

Fixes applied over the C++ original:

Recursion depth limit in advance_stack (prevents stack overflow on pathological grammars)
Bounds-checked rule index validation (prevents buffer overflows)
All errors returned as Result<T, GrammarError> (no panics, no asserts)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github		.github
.vscode		.vscode
assets		assets
src		src
tests		tests
.gitignore		.gitignore
AUTHORS.md		AUTHORS.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
DCO.md		DCO.md
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
SPONSORS.md		SPONSORS.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SchoolMarm

What It Does

Usage

Integration With Inference Engines

GBNF Format

Origin

License

About

Licenses found

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SchoolMarm

What It Does

Usage

Integration With Inference Engines

GBNF Format

Origin

License

About

Topics

Resources

License

Licenses found

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages