gawee

gawee는 ONNX 모델을 받아 그래프 rewrite를 수행하고, MLIR 기반 파이프라인 (Gawee Dialect -> Linalg -> SCF/LLVM)으로 lowering해서 AOT 실행 파일까지 만드는 딥러닝 컴파일러 프로젝트입니다.

현재 우선순위는 "작은 vision 모델용 우회 rewrite"보다, resnet / bert_tiny / tinyllama 같은 실제 benchmark 모델이 middle-end에서 직접 지원되는 op 집합을 넓히는 것입니다.

현재 상태

모델	ONNX Emission	Gawee -> Linalg	Full LLVM/AOT	Correctness	비고
ResNet-18	pass	pass	pass	pass (5.25e-06)	vision baseline
bert_tiny	pass	pass	pass	pass (1.79e-07)	transformer encoder
tinyllama_15m	pass	pass	pass	pass (1.62e-05)	RoPE 포함 decoder LLM

Extended Benchmarks (확장 대상)

모델	타입	노드 수	비고
yolo26_nano	vision/detection	397	Conv/Sigmoid 기반, TopK/ReduceMax 미지원
smollm_135m	nlp/decoder	2844	30-layer, Trilu/ScatterND 미지원

이번 단계에서 늘린 지원 범위

MatMul을 Gemm/Linear와 분리
semantic op를 gawee dialect에 명시적으로 추가
- gawee.gather
- gawee.gather_elements
- gawee.range
- gawee.resize
- gawee.split
- gawee.tile
trivial decomposition은 emitter에서 직접 lowering
- Pow, Neg, Sin, Cos, And, LessOrEqual, IsNaN, Mod
- Constant, ConstantOfShape

핵심 의도는 front에서 unsupported op를 과하게 rewrite해서 숨기지 않고, middle-end가 benchmark에 실제로 등장하는 op semantics를 직접 받도록 만드는 것입니다.

파이프라인

ONNX Model
  -> Rewrite / Optimize (Python)
  -> MLIR Gawee Dialect
  -> Linalg
  -> Bufferization / SCF / Math / LLVM
  -> Native Binary

Frontend (Python)

ONNX graph rewrite 및 정규화
constant folding, spec-driven rewrite
모델별 unsupported op audit

Middle-end (C++ / MLIR)

gawee-onnx-translate
- ONNX protobuf를 직접 읽어서 gawee.* 또는 direct tensor/linalg/math op 생성
gawee-opt
- --convert-gawee-to-linalg
- --gawee-to-loops
- --gawee-to-llvm

Backend (C++)

AOT 실행 파일 생성
ONNX Runtime 기준 결과 비교
NLP 모델은 static shape binding 경로를 사용

지원 전략

모든 연산을 새 dialect op로 만들지는 않습니다.

1. semantic op는 `gawee`에 남긴다

다음 op는 shape/axis/lookup 의미가 크고, 나중에 fallback 여부를 판단할 가치가 있습니다.

MatMul
Gather
GatherElements
Range
Resize
Split
Tile

이런 op는:

GaweeOps.td에 정의
ONNX emitter에서 gawee.* 생성
GaweeToLinalg.cpp에서 lowering

순서로 구현합니다.

2. trivial op는 direct decomposition 한다

다음 op는 별도 dialect op 없이 emitter에서 바로 푸는 편이 낫습니다.

Pow
Neg
Sin
Cos
And
LessOrEqual
IsNaN
Mod
Constant
ConstantOfShape

이 경우 tensor.generate, linalg.generic, arith, math 조합으로 직접 낮춥니다.

현재 middle-end에서 중요한 op

현재 Gawee dialect와 ONNX emission 경로에서 중요한 축은 다음과 같습니다.

CNN 경로
- conv, relu, add, max_pool, average_pool, global_average_pool
Transformer / LLM 경로
- matmul, reshape, transpose, expand, slice, softmax
- gather, gather_elements, range, split, tile, resize
공통 연산
- mul, div, sub, reduce_mean, reduce_sum, where, cast

검증 메모

3개 priority 모델 모두 원본 ONNX에서 end-to-end correctness 통과:

resnet18: max_abs_diff = 5.25e-06 (atol=1e-4)
bert_tiny: max_abs_diff = 1.79e-07 (atol=5e-4)
tinyllama_15m: max_abs_diff = 1.62e-05 (atol=5e-4)

원본 ONNX를 직접 MLIR 파이프라인에 태운다. Frontend rewrite된 ONNX는 노드 수 증가로 FP 오차가 누적되므로 사용하지 않는다.

distilbert_base_uncased는 모델이 커서 dev 머신 행을 유발하므로 영구 제외. qwen3_0_6b은 확장 benchmark로 두고, semantic op illegal 여부만 우선 확인한다.

학습 문서

이번 단계와 직접 연결되는 문서:

프로젝트 구조

gawee/
├── front/onnx_rewrite/        # Python frontend / ONNX rewrite
├── middle/mlir/               # MLIR middle-end
│   ├── include/Gawee/         # TableGen dialect/op definitions
│   ├── include/Emit/          # ONNX emitter headers
│   ├── lib/Emit/              # ONNX -> MLIR
│   ├── lib/Conversion/        # Gawee -> Linalg / lowering pipeline
│   └── tools/                 # gawee-opt, gawee-onnx-translate
├── back/                      # AOT builder / evaluation
└── docs/                      # notes and reports

참고

ONNX specification
MLIR documentation
LLVM documentation

Name		Name	Last commit message	Last commit date
Latest commit History 181 Commits
back		back
benchmarks		benchmarks
docs		docs
front		front
gawee_ir		gawee_ir
middle		middle
models		models
scripts		scripts
tests		tests
.clangd		.clangd
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.en.md		README.en.md
README.ko.md		README.ko.md
README.md		README.md
future.md		future.md
how.md		how.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gawee

현재 상태

Extended Benchmarks (확장 대상)

이번 단계에서 늘린 지원 범위

파이프라인

Frontend (Python)

Middle-end (C++ / MLIR)

Backend (C++)

지원 전략

1. semantic op는 `gawee`에 남긴다

2. trivial op는 direct decomposition 한다

현재 middle-end에서 중요한 op

검증 메모

학습 문서

프로젝트 구조

참고

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

gawee

현재 상태

Extended Benchmarks (확장 대상)

이번 단계에서 늘린 지원 범위

파이프라인

Frontend (Python)

Middle-end (C++ / MLIR)

Backend (C++)

지원 전략

1. semantic op는 gawee에 남긴다

2. trivial op는 direct decomposition 한다

현재 middle-end에서 중요한 op

검증 메모

학습 문서

프로젝트 구조

참고

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. semantic op는 `gawee`에 남긴다

Packages