Bonacci MoE 9B
The first purpose-built mixture-of-experts model for Data Engineering & Data Science. Expert-level SQL generation, PySpark pipeline authoring, data modeling, and autonomous multi-step orchestration — all in a single 9B sparse model that activates only 2.6B parameters per token.
8 experts per layer, top-2 routing, GQA attention. Only 2.6B of 9B parameters fire per token — frontier-class quality at small-model cost.
Model Parameters
| Model type | Sparse MoE |
| Total parameters | 9B |
| Active parameters | 2.6B / token |
| Experts per layer | 8 |
| Expert routing | Top-2 |
| Context window | 32,768 |
| Vocabulary | 49,152 |
| License | Commercial |
Transformer Block
| Hidden dimension | 2048 |
| Layers | 32 |
| Attention heads | 16 |
| KV heads (GQA) | 4 |
| MLP dim / expert | 5504 |
| Positional encoding | RoPE |
| Normalization | RMSNorm |
| Activation | SwiGLU |
Not a general-purpose code model bolted onto data tasks. Every capability is post-trained against execution-grounded rewards.
- 01
SQL Generation & Optimization
Production SQL from natural language. Optimize slow queries, explain execution behavior.
- 02
SQL Dialect Translation
BigQuery, Snowflake, Spark SQL, PostgreSQL, Trino — full semantic preservation.
- 03
PySpark Pipeline Authoring
Optimized batch & streaming pipelines with best-practice patterns built-in.
- 04
ThinkingLanguage Native
First-class understanding of the Bonacci platform's declarative pipeline DSL.
- 05
Pipeline Autopilot
Autonomous multi-step reasoning. Decompose, plan, iterate with execution feedback.
- 06
Execution Plan Generation
Structured JSON plans with cost estimates & optimization recommendations.
- 07
Data Security
PII detection, encryption strategy, RLS policies, GDPR/CCPA/HIPAA gap analysis.
- 08
Data Modeling
Star schema, Data Vault 2.0, Medallion, SCD Types 1/2/3/4.
- 09
Lakehouse Integration
Native knowledge of Apache Iceberg, Delta Lake, and Apache Hudi internals.
- 10
Tool Use & MCP
10+ tools via Model Context Protocol — DBs, storage, orchestration, catalogs.
- 11
RAG-Aware Generation
Consumes schema docs, table samples, and domain glossaries via the embedding model.
- 12
Extended Thinking
Surfaces multi-step reasoning for complex query & schema design decisions.
- 13
DQ & Observability
Data quality rules, freshness monitors, anomaly detection logic.
- 14
Streaming & CDC
Kafka, Flink, and CDC pipeline design with backward-compat checks.
Snowflake → BigQuery
Round-trip translation with full semantic preservation across five dialects: BigQuery, Snowflake, Spark SQL, PostgreSQL, Trino. Handles QUALIFY, OBJECT_CONSTRUCT, LISTAGG, window functions, and dialect-specific types.
- Function-level dialect mapping
- Window function rewrites
- Type coercion safety
Autonomous multi-step reasoning
Give the model a high-level goal — "monitor Postgres CDC, detect schema drift, alert on anomalies, sync to Snowflake" — and it decomposes the task, generates a plan, and emits the full pipeline.
- Decomposition & planning
- Execution-feedback iteration
- End-to-end YAML emission
Declarative Bonacci pipelines
The model has first-class understanding of ThinkingLanguage — the Bonacci platform's declarative pipeline DSL. It composes sources, transforms, and sinks with awareness of the execution model.
- Source / transform / sink composition
- Dependency & schedule awareness
- Multi-sink fan-out (warehouse + feature store)
Two-stage pre-training on TPU v6e-16 with MaxText/JAX, followed by SFT and GRPO with code-execution rewards.
-
01
pre-train · stage 1 · 32B tokens
General Corpus
Broad language understanding, code, and reasoning foundations.
-
02
pre-train · stage 2 · 90B tokens
FineWeb-Edu
High-quality educational and technical web content.
-
03
post-train · sft · 15K+ pairs
Supervised Fine-Tuning
Curated DE/DS pairs — SQL, PySpark, ThinkingLanguage, pipeline blueprints, schemas.
-
04
post-train · rl · 230 test cases
GRPO + Code Execution
Reward-shaped on SQL correctness, query plan quality, and PySpark execution — evaluated across 230 test cases across 6 benchmarks.
Safetensors, GGUF quantizations, OpenAI-compatible vLLM, Ollama, llama.cpp, and Docker.
OpenAI-compatible API
Python (bfloat16)
Bonacci MoE 9B is the intelligence layer powering Bonacci Studio and Bonacci Flow.
Bonacci Studio
Conversational data engineering. Chat with the model, get production SQL, PySpark, and ThinkingLanguage pipelines back.
▸ explore studio →Bonacci Flow
Pipeline Autopilot at platform scale — autonomous orchestration, scheduling, and self-healing for your data infrastructure.
▸ explore flow →Bonacci MoE 9B is coming soon
The model is currently under active development. Leave us a note and we'll let you know the moment weights and APIs are available — on HuggingFace, vLLM, and embedded inside the Bonacci platform.
— first DE/DS-native MoE ✎