Blue3 port by bathan1 · Pull Request #12 · JHU-PL-Lab/caprice-lang

bathan1 · 2026-04-23T21:05:20Z

Features added

Dedicated solver loop for handling easy formulas:

solve.ml

Integer Difference Logic solver:

integer.ml

Boolean text parser

boolean.ml

Overview

I want to merge in my toy SMT solver ("Blue3") into the concolic evaluator so it can attempt to fast-solve certain formulas. The IDL solver and Boolean text parser are more-or-less ported over 1-to-1. The primary change from the port is writing out a basic DPLL (T) solver loop.

Benchmarks

Here are the benchmark results you can find by inputting analysis.sql into an SQLite database
after running the benchmark script:

What were the average runtimes from both solvers?

avg_blue3	avg_z3
292.0μs	304.0μs

Rows with the MIN times from both solvers

trial_num	formula_id	formula	time_us_blue3	time_us_z3	which_min
0	64	(0 < a) ^ (a < 0)	3.814697	112.056732	blue3_min
2	30	(6 <= a) ^ (a < 0)	3.814697	122.070312	blue3_min
3	36	(2 < a) ^ (a < 0)	3.814697	133.037567	blue3_min
3	89	(1 <= a) ^ (0 < a) ^ (not ((a % 1) = 0))	113.010406	69.856644	z3_min
4	30	(6 <= a) ^ (a < 0)	3.814697	111.103058	blue3_min

Rows with the MAX times from both solvers

trial_num	formula_id	formula	time_us_blue3	time_us_z3	which_min
0	108	(c <= (b % a)) ^ (c <= a) ^ (0 < c) ^ (0 < a) ^ (0 < b) ^ (n	8136.034012	3366.947174	blue3_min
		ot ((b % a) = 0)) ^ (not (c = 0)) ^ (not (a = 0)) ^ (((b * a
		) / c) < b)

How much faster were the fast cases on average?

num_fast_cases	avg_faster_by	avg_percent_faster
345	110.83μs	49.6%

How much slower were the slow cases on average?

num_slow_cases	avg_slower_by	avg_percent_slower
548	50.25μs	9.81%

What was the max time difference blue3 beat z3 by?

max_diff
836.0μs

What was the max time difference z3 beat blue3 by?

max_diff
4769.0μs

bathan1 · 2026-05-11T22:27:12Z

Bellman Ford

The Bellman Ford algorithm finds the shortest distance paths from a particular node to all others in a directed graph.

Suppose we have a graph with the following edges:

let edges = 
  [ ('a', '0', 3)
  ; ('0', 'a', -1)
  ; ('z', '0', 5)
  ; ('z', 'a', 5)
  ]

It looks something like this:

graph LR
  na["a"] -->|"3"| n0["0"]
  n0 -->|"-1"| na["a"]
  nz["z"] -->|"5"| n0["0"]
  nz -->|"5"| na["a"]

If we wanted to find the shortest distance paths from z to every other node,wWe can run bellman ford against the edge list and it will tell us:

print_bellman_ford ~label:"OK cycle" ~src:'z' edges

No negative cycle found.
dist(z) = 0
dist(a) = 4
dist(0) = 5

The shortest distance path from z to 0 is just the direct edge z -> 0 with weight 5.

The shortest distance path from z to a is 4, because we can go from z to 0 for cost 5, then from 0 to a with cost -1 to give us a shortest distance of 4.

If we jumped from a back to 0 for a cost of 3, our distance would go from 4 to 7. So even though we can go back and forth from a to 0, it will just add cost to our path to a to cycle back to 0. This means there is no negative cycle.

But if we changed the edge from a -> 0 to have cost -6...

let edges =
  [ ('a', '0', -6)
  ; ('0', 'a', -1)
  ; ('z', '0', 5)
  ; ('z', 'a', 5)
  ]
in
print_bellman_ford ~label:"Negative Cycle" ~src:'z' edges;

graph LR
  na["a"] -->|"-6"| n0["0"]
  n0 -->|"-1"| na["a"]
  nz["z"] -->|"5"| n0["0"]
  nz -->|"5"| na["a"]

...then Bellman Ford will tell us:

Example: [Negative Cycle]
Negative cycle found:
- 0 -> a (-1)
- a -> 0 (-6)

After going from z to 0 for cost 5, going to 0 from a costs us -1, which leads us to total cost of 4. And now going from a back to 0 would cost us -6 weight for a total sum of -2, which is less than our previous path to a. We can do this as many times as we want and will end up with lower and lower weights.

So there is no shortest path from z to a, because for any shortest-path P we find for it, we can find a shorter path P' by circling over a and 0. And as a consequence, we have no shortest path from z to any other node, because we can loop over the a and 0 edges once more for any other claimed shortest path and get a lower distance path. Thus, we have a negative cycle.

Core Relaxation Loop

Bellman Ford is able to tell us all this about our graphs rather elegantly. The main idea is to iterate over the edges and "relax" the distance state at each iteration. At each iteration at some edge (from, to, cost), we lower the distance state of the to node based on our current distance to the from node with our current distance to the to node:

let relax_edge (dist : tbl) (was_updated : bool) (edge : Node.t edge) : bool =
  let from_, to_, cost = edge in
  match Hashtbl.find dist from_, Hashtbl.find dist to_ with
  ...

If our current distance to from plus cost is less than our current shortest distance to the to node (dist(from) + cost < dist(to)), then we update our shortest distance to to with that sum.

Our distance table implementation uses an option to represent the distance state, with None being the initial "Infinity" state and Some dist being a concrete distance sum from the graph. In either case, we are effectively finding that dist(from) + cost < dist(to), with the Some case being the case where we explicitly compare the sum

let relax_edge (dist : tbl) (was_updated : bool) (edge : Node.t edge) : bool =
  let from_, to_, cost = edge in
  match Hashtbl.find tbl from_, Hashtbl.find tbl to_ with
  | (Some du, _), (None, _) -> ...
  | (Some du, _), (Some dv, _) when du + cost < dv -> ...

For a graph with NUM_NODES nodes, we run the above edges iteration a max of NUM_NODES - 1 times:

let relax_edges (edges : Node.t edge list) (dist : tbl) (i : int)
  : [ `Continue of tbl | `Stop of tbl ] =
  let num_nodes = Hashtbl.length dist in
  if i >= num_nodes - 1 then `Stop dist
  ...

A common optimization is to early return when no distances are updated in some iteration. We can implement this using a fold_until loop where we only Continue the next relaxation iteration when at least one distance has been updated. We are able to track this state by having our relax_edge function return whether it updated the distance table.

If we relax at least one distance at any point during the edges iteration, we return true by or-ing the previous boolean value with a true:

let relax_edge (dist : tbl) (was_updated : bool) (edge : Node.t edge) : bool =
  ...
  match Hashtbl.find dist from_, Hashtbl.find dist to_ with
  | (Some du, _), (None, _) ->
    ...
    was_updated || true
  | (Some du, _), (Some dv, _) when du + cost < dv ->
    ...
    was_updated || true

Otherwise we just OR the boolean state value with a false:

  let relax_edge (dist : tbl) (was_updated : bool) (edge : Node.t edge) : bool =
    let from_, to_, cost = edge in
    match Hashtbl.find dist from_, Hashtbl.find dist to_ with
    ...
    | _ -> was_updated || false

Notice how once we return a true flag, subsequent calls to relax_distance will always return true because even if a subsequent call doesn't relax a distance, or-ing a false with a true is still true:

So the parent relax_edges just checks that boolean flag at the end of each iteration to decide whether it should continue or stop.

let relax_edges (edges : Node.t edge list) (dist : tbl) (i : int)
  ...
  else
    let is_dist_updated = List.fold_left (relax_edge dist) false edges
    in
    if is_dist_updated then `Continue dist
    else `Stop dist

Then building the final distance table state is just a matter of initializing the table and then iterating over the nodes:

let find_shortest_paths ~(src : Node.t) (edges : Node.t edge list) =
  let dist = create_tbl ~src edges in
  let num_nodes = Hashtbl.length dist in
  let vertices = List.init num_nodes Fun.id in
  let final_tbl = List_utils.fold_until
    (relax_edges edges)
    Fun.id
    dist
    vertices
  in
  final_tbl

Where create_tbl explicitly initializes each node table entry and sets the distance from src to itself to 0 so we can advance the distance table state in the initial iteration:

let create_tbl ~(src : Node.t) (edges : Node.t edge list) =
  let bindings =
    edges
    |> to_node_list
    |> List.map (fun node -> node, (None, None))
    |> List.to_seq
  in
  let tbl = Hashtbl.of_seq bindings in
  let () =
    Hashtbl.replace tbl src (Some 0, None)
  in
  tbl

Predecessors and the Minimum Distance Path

You can think of find_shortest_paths as the "raw" Bellman Ford implementation that returns the final distance table regardless of whether a negative cycle exists. When there is no negative cycle, then the table is effectively our return value of the bellman_ford implementation:

let bellman_ford
  (type node)
  (module Node : Baby.OrderedType with type t = node)
 ~(src : node)
  (edges : node edge list)
  : [ `No_negative_cycle of (node * int) list
    | `Negative_cycle of node edge list
    ] =
  let open Make (Node) in
  let tbl = find_shortest_paths ~src edges in
  match find_cycle_entry_opt edges tbl with
  | None -> `No_negative_cycle (
    tbl
    |> Hashtbl.to_seq_keys
    |> Seq.map (fun node -> node, find_distance node tbl)
    |> List.of_seq
  )

The only change we make to the bindings is picking out the first tuple element dist from the values with find_distance:

let find_distance (node : Node.t) (dist : tbl) : int =
  match fst @@ Hashtbl.find dist node with
  | None -> Int.max_int
  | Some v -> v

We do this because our distance table state encodes a second "predecessor edge" for the second element in the values.

The predecessor edge is the edge that connects the previous node in the shortest-distance path to the key-ed node. In other words, it is the edge that caused the last update to the distance state for that particular node.

For relax_edge, this is just the edge argument. Whenever the relaxation condition is met, we append the 2-tuple of du + cost, edge rather than du + cost alone:

let relax_edge (dist : tbl) (was_updated : bool) (edge : Node.t edge) : bool =
  let from_, to_, cost = edge in
  match Hashtbl.find dist from_, Hashtbl.find dist to_ with
  | (Some du, _), (None, _) ->
    Hashtbl.replace dist to_ (Some (du + cost), Some edge);
    ...
  | (Some du, _), (Some dv, _) when du + cost < dv ->
    Hashtbl.replace dist to_ (Some (du + cost), Some edge);
    ...

You can think of the (distance, predecessor) as separate derivations of the same state. distance tells us the shortest distance, while predecessor tells us the corresponding shortest-distance path from src.

We can lookup the predecessor edge with find_predecessor_edge:

let find_predecessor_edge (node : Node.t) (dist : tbl)
  : Node.t edge option =
  snd @@ Hashtbl.find dist node

And we can lookup the predecessor node with find_predecessor by reading the from node element:

let find_predecessor (node : Node.t) (tbl : tbl) : Node.t option =
  Option.map (fun (from_, _, _) -> from_) (find_predecessor_edge node tbl)

The typical implementation of Bellman Ford keeps two separate tables for both distance and the predecessor node. This is usualy the better option, but I felt that in a functional language like OCaml, it would be more idiomatic to merge them into 1 table because they are dependent on the same state (not unlike the grouping related state pattern from React).

Revisiting our OK cycle example of a non-negative cyclic graph where we set src to z:

let edges =
  [ ('a', '0', 3)
  ; ('0', 'a', -1)
  ; ('z', '0', 5)
  ; ('z', 'a', 5)
  ]

graph LR
  na["a"] -->|"3"| n0["0"]
  n0 -->|"-1"| na["a"]
  nz["z"] -->|"5"| n0["0"]
  nz -->|"5"| na["a"]

The shortest path from z to a has distance 4, which we can immediately read from our distance table state:

let module BellmanFord = Bellman_ford.Make (Char) in
let dist = BellmanFord.find_shortest_paths ~src:'z' edges in
Printf.printf "Minimum distance to 'a' = %d\n" (fst @@ Hashtbl.find dist 'a');

Minimum distance to 'a' = 4

The shortest distance path is z -> 0 -> a, which we can derive from the predecessor edge:

let predecessor_edge_of_a = BellmanFord.find_predecessor_edge 'a' dist in
Printf.printf "Predecessor edge is: %s\n" (pp_edge_opt predecessor_edge_of_a);

Predecessor edge is: 0 -> a (-1)

predecessor_edge_of_a says 0 is the last node in the shortest-distance path before we hit a. Then finding the predecessor edge of 0...

let predecessor_edge_of_0 = BellmanFord.find_predecessor_edge '0' dist in
Printf.printf "Predecessor edge is: %s\n" (pp_edge_opt predecessor_edge_of_0);

Predecessor edge is: z -> 0 (5)

Leads us back to our source node z. So the main takeaway is that 1 edge is enough to trace back the minimum distance path.

Detecting Negative Cycles

For the purposes of the Blue3 solver, we don't care about the shortest distance paths and only care about the distance values themselves when bellman ford is able to return a meaningful distance table. This is why we filter out the predecessor in the No_negative_cycle case:

let bellman_ford
  ...
  match find_cycle_entry_opt edges tbl with
  | None -> `No_negative_cycle (
    tbl
    |> Hashtbl.to_seq
    |> Seq.map (fun node -> node, find_distance node tbl)

The predecessor becomes useful when find_cycle_entry_opt returns the other Negative_cycle case:

let bellman_ford
  ...
  match find_cycle_entry_opt edges tbl with
  | None -> ...
  | Some entry -> (* we need predecessor to handle this case *)

find_cycle_entry_opt finds the first node from the edges list that is within the negative cycle. It does this by first running one more relaxation pass over the edges with a call to find_relaxed_node_opt:

let find_cycle_entry_opt (edges : Node.t edge list) (dist : tbl)
  : Node.t option =
  ...
  let relaxed_predecessor = find_relaxed_node_opt edges dist in

If find_relaxed_node_opt is able to update at least one more entry, it returns the to node of that entry:

let find_relaxed_node_opt (edges : Node.t edge list) (dist : tbl) : Node.t option =
  List.find_map (fun ((_, to_, _) as edge) ->
    if relax_edge dist false edge then
      Some to_
    else None)
  edges

The basic idea is that the minimum distance paths take at most NUM_NODES - 1 iterations over the edges to find. If we can relax any edge after those NUM_NODES - 1 iterations, then there is a negative cycle, because the subsequent NUM_NODES + 1, NUM_NODES + 2, ... iterations of relax_edge will also return true infinitely.

This returned node by find_relaxed_node_opt may not necessarily be in the cycle, however.

Consider the graph where we have the negative cycle of a -> b -> c -> a:

graph LR
  c["c"] -->|"0"| d["d"]
  s["s"] -->|"0"| a["a"]
  a["a"] -->|"1"| b["b"]
  b["b"] -->|"-4"| c["c"]
  c["c"] -->|"1"| a["a"]

If we just used the node from find_relaxed_node_opt instead of a real node in the cycle, then we'd incorrectly include c -> d in the reconstructed cycle:

let edges =
  [ ('c', 'd', 0)   (* outgoing edge from cycle to non-cycle node *)
  ; ('s', 'a', 0)
  ; ('a', 'b', 1)
  ; ('b', 'c', -4)
  ; ('c', 'a', 1)
  ]
in
let cycle_entry = BellmanFord.find_relaxed_node edges (fst dist) in
Printf.printf "First relaxed node found: %c\n" cycle_entry;
let cycle_from_entry = BellmanFord.collect_cycle cycle_entry dist in
List.iter (fun edge ->
  Printf.printf "- %s\n" (pp_edge edge))
  cycle_from_entry

First relaxed node found: d
- b -> c (-4)
- c -> a (1)
- a -> b (1)
- b -> c (-4)
- c -> d (0)

To find a node that is actually in the cycle, we have to back track from to until we've backtracked NUM_NODES parents (because after following NUM_NODES predecessor links, the pigeonhole principle guarantees we have skipped any non-cycle tail and landed on a node inside the cycle.) or until we hit our start node again:

let find_cycle_entry_opt (edges : Node.t edge list) (tbl, num_nodes : t)
  : Node.t option =
  let relaxed_predecessor = find_relaxed_node_opt edges tbl in
  match relaxed_predecessor with
  | None -> None
  | Some entry ->
    let rec move_back node n =
      if n = 0 then node
      else if n < num_nodes && node = entry then node
      else
        match find_predecessor node tbl with
        | None -> node
        | Some from_ -> move_back from_ (n - 1)
    in
    Some (move_back entry num_nodes)

Then with this entry node found, we can terminate the algorithm by building the negative cycle. We build it by backtracking one more time along the predecessor edges starting from our entry node:

let collect_cycle (start : Node.t) (tbl, num_nodes : t) : Node.t edge list =
  let rec loop curr n acc =
    if n = 0 then
      acc
    else
      match find_predecessor_edge curr tbl with
      | None -> acc
      | Some ((from_, _, _) as pred_edge) ->
        let acc = pred_edge :: acc in
        if Node.compare from_ start = 0 then acc
        else loop from_ (n - 1) acc
  in
  loop start num_nodes []

And that's all Bellman Ford needs to return the negative cycle edges:

let bellman_ford
  (type node)
  (module Node : Baby.OrderedType with type t = node)
 ~(src : node)
  (edges : node edge list)
  : [ `No_negative_cycle of (node * int) list
    | `Negative_cycle of node edge list
    ] =
  let open Make (Node) in
  let tbl = find_shortest_paths ~src edges in
  match find_cycle_entry_opt edges tbl with
  | None -> ...
  | Some entry -> `Negative_cycle (collect_cycle entry tbl)

let edges =
  [ ('s', 'a', 2)
  ; ('a', 'b', 1)
  ; ('b', 'c', -4)
  ; ('c', 'a', 1)
  ; ('c', 'd', 3)
  ]
in
let dist = BellmanFord.find_shortest_paths ~src:'s' edges in
let cycle_entry = BellmanFord.find_cycle_entry edges dist in
let cycle_from_entry = BellmanFord.collect_cycle cycle_entry dist in
Printf.printf "Negative cycle found:\n";
List.iter (fun edge ->
  Printf.printf "- %s\n" (pp_edge edge))
  cycle_from_entry;

Negative cycle found:
- b -> c (-4)
- c -> a (1)
- a -> b (1)

bathan1 and others added 25 commits April 18, 2026 13:30

Initial commit for port

bfc0632

Placeholder simplifier

799f1c1

Stubs

e17e5ba

Port over rewrite

3a01e2d

Finish rewrite_int_bounds

cf356b2

Add bellman ford solver

c53938d

Rebase

0b32699

Add boolean module

c3d4401

Almost finish solve loop

f58930b

Finish main solve loop from jaylang

699f1f2

Fix bellman_ford

c642a3b

Fix int rewriter

90923ea

Almost finish solve loop

f3dd3fc

Turn dpll into solver

253863c

Finish dpll theory solver

1a55847

Get workings solver

8043188

Finish port draft

4393197

add yojson as dependency

5a0aa61

configure test directory from makefile

bfd9a94

bump ocaml version

5dacfd8

add types to binary operators that occur in formulas

29460bb

Log inconsistent SATs

4f90943

Add clause checking

37160c2

Fix pipeline to not use rewrite double

99dd0a0

Add pipeline changes

cf9d9ab

bathan1 added the draft label Apr 23, 2026

Add analysis

500822a

bathan1 assigned bathan1 and brandonzstride and unassigned bathan1 Apr 23, 2026

bathan1 added 4 commits May 11, 2026 06:10

Finish bellman_ford

9293617

Simplify bellman_ford

f141f6b

Finish bellman_ford docs

4878391

Change names for clarity

a0e3aef

bathan1 added 25 commits May 11, 2026 18:59

Update idl calls to bellman_ford

98bca55

Fix some typos

b265013

More typo fixes and touch rest of report sections

8816269

Almost finish intro

d2b39e9

Add intro

d715d1e

Finish intro

b2ba6cf

Update titles

5c9777d

Finish idl

2613ea4

Add SAT and UNSAT solution types instead of options for sat module

23dbcdc

Merge connector with blue3 and add sat writeup

e929a65

Finish report

3a9e318

Fix some typos

975d134

Add pandoc script

a8cad80

Finish programming_blue3.md

3b92a06

Finish writeup

84940fc

Update benchmarks

607e641

Generate pdf

fbaa651

Add code snippets in bellman_ford

b4f690e

Update code snippets

8ee863b

Add code examples

3a77cb1

Add sql queries

e6f30bb

Fix implementation snippets

42fce40

Fix implementation snippets

b1fbb9a

Add sandbox scripts as their own executables

05bcbec

Add link highlighting

0823f46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Blue3 port#12

Blue3 port#12
bathan1 wants to merge 132 commits into
mainfrom
blue3-port

bathan1 commented Apr 23, 2026

Uh oh!

bathan1 commented May 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bathan1 commented Apr 23, 2026

Features added

Overview

Benchmarks

What were the average runtimes from both solvers?

Rows with the MIN times from both solvers

Rows with the MAX times from both solvers

How much faster were the fast cases on average?

How much slower were the slow cases on average?

What was the max time difference blue3 beat z3 by?

What was the max time difference z3 beat blue3 by?

Uh oh!

bathan1 commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bellman Ford

Core Relaxation Loop

Predecessors and the Minimum Distance Path

Detecting Negative Cycles

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bathan1 commented May 11, 2026 •

edited

Loading