~/pitchdeck/01-cover

Seed Round · 2026

$ Intelligence layer for
Data Engineering & Data Science

ThinkingDBx is building the Bonacci family of agentic platforms, a vertical AI model from scratch, our own pipeline language, and memory infrastructure for AI agents.

Company ThinkingDBx

Founder Mallesh Madapathi

Location Hyderabad, India

Stage Seed Round

ThinkingDBx

01 / 14

~/pitchdeck/02-problem

the problem

The data stack was
built for humans, not agents.

Modern data teams stitch together five to ten point products to move data, transform it, observe it, and prepare it for AI. The result is brittle, expensive, and incompatible with how autonomous systems actually work.

sprawl

5 to 10 tools

Fivetran, dbt, Airflow, Monte Carlo, plus a vector DB bolted on for AI. Each one a contract, a console, and a different mental model.

cost

6-figure spend

Mid-sized data teams spend $200K to $2M annually on pipeline tooling alone. Per-row, per-connector, per-seat. It compounds.

friction

Built for clicking

Every tool assumes a human in the loop. An agent cannot reason across the seams. Memory is missing. Lineage is opaque.

ThinkingDBx

02 / 14

~/pitchdeck/03-why-now

why now

Three shifts are
converging at once.

LLMs crossed the reasoning threshold. Models can now plan over real production code, schemas, and execution traces, not just snippets.
Vertical training is finally accessible. TPU and GPU economics make it possible for a focused team to train a specialized model from scratch, not just fine-tune.
Enterprise AI budgets are committed but stuck. Every CTO has approved AI spend. Almost none can deploy because the data layer cannot keep up with the agent layer.
Memory has emerged as the missing primitive. Stateless agents collapse past 30 minutes. The agent era needs purpose-built memory infrastructure.

The window to define this category is the next 18 months. Incumbents are architected for the pre-agent era and cannot retrofit.

ThinkingDBx

03 / 14

~/pitchdeck/04-solution

our solution

Five layers.
One coherent stack.

We collapse the modern data stack into a single agentic platform, powered by a vertical model we are training ourselves, expressed in a language we designed for data pipelines, with memory built natively for agents.

layer 1

Bonacci Studio

Agentic data engineering platform. Visual + AI chat.

Live

layer 2

Bonacci Flow

Pure agentic DE/DS. No canvas, just intent.

Under Build

layer 3

Bonacci MoE 9B

Vertical model, built from scratch for DE/DS.

In Training

layer 4

ThinkingLanguage

Compiled language for pipelines. Apache DataFusion.

Live · OSS

layer 5

ThinkingMemory

Agent-native memory. Four-layer architecture.

Live · OSS

the loop

Bonacci Studio runs on ThinkingLanguage. Bonacci Flow reasons via the Bonacci model. Both products use ThinkingMemory. No incumbent has all five. No single-product AI startup has the surface area.

ThinkingDBx

04 / 14

~/pitchdeck/05-bonacci-studio

product · live

Bonacci Studio

Agentic data engineering platform. Pipelines built visually or through AI chat across JDBC, Kafka, MCP, PySpark, and SQL, orchestrated by autonomous agents.

Replaces Fivetran, dbt, Airflow, Monte Carlo, plus the vector DB bolted on for AI.
85% cheaper than Fivetran, Informatica, and Talend on equivalent enterprise workloads.
Multi-modal interface visual drag-and-drop, natural language chat, and code, all in one canvas.
Production-grade JDBC, Kafka, MCP, schema registry, CDC, scheduling, alerting.

status

Shipped and live

deploymentProduction-ready, onboarding first design partners.

architectureBuilt on ThinkingLanguage + Apache Spark + Camel + Kafka.

urlstudio.bonacci.thinkingdbx.com

ThinkingDBx

05 / 14

~/pitchdeck/06-bonacci-flow

product · under build

Bonacci Flow

A pure agentic data engineering and data science platform. No drag-and-drop. No canvas. Just intent. Agents plan, build, train, and ship pipelines and models end-to-end.

Intent in, production out. The user describes the outcome. Agents handle planning, building, and deployment.
Native reasoning over Spark, SQL, PySpark, and ML frameworks via the Bonacci model.
Full DE + DS loop. Pipelines, training, evaluation, and shipping in one runtime.
Closes the agentic stack alongside Studio's visual + chat interface.

status

In active build

The Flow runtime depends on Bonacci MoE 9B for deep reasoning over data engineering primitives. We are building them in lockstep, with first GA targeted within the funded runway.

target

First GA within 12 months

ThinkingDBx

06 / 14

~/pitchdeck/07-bonacci-moe-9b

model · in training

Bonacci MoE 9B

Our own model, built from scratch for data engineering and data science. Not a fine-tune. Not a wrapper. A vertical foundation model with first-class understanding of ThinkingLanguage, Spark, SQL, and pipeline execution semantics.

architecture

MoE 9B

Mixture-of-experts. Built for vertical depth in DE/DS reasoning.

pre-training

122B+ tokens

General corpus + FineWeb-Edu. TPU v6e-16.

post-training

SFT + GRPO

15K curated DE/DS pairs. Underway.

distribution

HF + vLLM

Released on HuggingFace, embedded in Bonacci.

Why a vertical model? Horizontal LLMs cannot reason over Spark execution plans, schema lineage, or pipeline failure modes. A model trained specifically on this domain produces deterministic, shippable code from intent.

ThinkingDBx

07 / 14

~/pitchdeck/08-infrastructure

infrastructure · live · open source

Two primitives.
Both live. Both open source.

language layer

ThinkingLanguage

The first compiled language where data pipelines, ML, and streaming are first-class primitives. Built on Apache DataFusion. Lets agents and humans express end-to-end pipelines with deterministic execution semantics.

▸Compiled, not interpreted
▸First-class pipelines, ML, streaming
▸Apache DataFusion execution
▸Deterministic. Shippable.

source

github.com/mplusm/thinkinglanguage →

memory layer

ThinkingMemory

Agent-agnostic memory infrastructure. Working, Episodic, Semantic, and Procedural memory layers, built for AI agents rather than retrofitted from vector search. Auto-compresses, forgets, and consolidates.

▸Four-layer memory architecture
▸Built for agents, not retrieval
▸Self-compressing, self-forgetting
▸No cold starts

source

github.com/mplusm/thinkingmemory →

ThinkingDBx

08 / 14

~/pitchdeck/09-market

market

A $80B+ market
rebuilding for agents.

Data infrastructure spend is large, growing, and entering a generational transition. The incumbents won the last era. They cannot win this one.

$80B+

Global Data Infra TAM

20%+

Annual growth rate

$50B

India SaaS exports by 2030

85%

Cost vs incumbents

The shift is not optional. Every enterprise CTO has approved AI budget. Almost none can deploy because their data layer is the bottleneck. We sell the answer.

ThinkingDBx

09 / 14

~/pitchdeck/10-competition

competition & moat

No one has the
full stack.

Incumbents lack the agent layer. AI-startup bolt-ons lack the data infrastructure. Vertical model players lack the platform. We ship all five layers in one coherent stack.

	Platform	Vertical Model	Pipeline Language	Agent Memory	Cost Position
Fivetran / Informatica / Talend	Partial	No	No	No	Expensive
Databricks / Snowflake	Partial	No	No	No	Expensive
AI startup bolt-ons	No	No	No	Partial	Mid
Horizontal LLM providers	No	No	No	No	Variable
ThinkingDBx	Yes	Yes	Yes	Yes	85% cheaper

ThinkingDBx

10 / 14

~/pitchdeck/11-traction

traction

Three products live.
One in training. One in build.

Pre-revenue and bootstrapped to date. Every layer is real and shipping; we are entering the GTM and capital-raise phase to scale what is already working.

bonacci studio

Shipped & Live

In production. Onboarding first enterprise design partners.

thinkingmemory

Open Source

Released. Developer community building. Signal for global developer reach.

thinkinglanguage

Open Source

Live, compiled. Powers Bonacci Studio in production.

bonacci moe 9b

122B+ tokens

Pre-training complete on TPU v6e-16. SFT + GRPO post-training underway.

What seed capital unlocks: Bonacci Flow GA, Bonacci MoE 9B weights release, first 20 enterprise design partners, hardened enterprise Studio, and the GTM motion to scale them.

ThinkingDBx

11 / 14

~/pitchdeck/12-team

team

Built by practitioners,
not theorists.

founder & ceo

Mallesh Madapathi

Senior Data Engineer and Data Scientist with 7+ years of experience across different domains of IT, and an MSc in Computer Science, AI & ML.

based: Hyderabad, India

email: mallesh@thinkingdbx.com

Why us

Full-stack technical depth. One team shipping platform, language, model, memory, and agentic runtime.
Builders, not pitchers. Three products live before raising. ThinkingMemory and ThinkingLanguage shipped open source.
India + global ambition. Hyderabad-built. Globally-targeted. India SaaS playbook (Freshworks, Postman, BrowserStack) plus AI-infra positioning.
Capital-efficient. Bootstrapped to date. Trained 122B+ tokens on TPU without prior funding.

ThinkingDBx

12 / 14

~/pitchdeck/13-use-of-funds

use of funds

Where the
capital goes.

Targeting Bonacci Flow GA, Bonacci MoE 9B weights release, hardened enterprise Studio, and the first cohort of enterprise design partners.

40%

25%

10%

40%

Engineering

Ship Bonacci Flow GA. Harden Studio for enterprise.

25%

Model R&D

Complete Bonacci MoE 9B training. Release weights.

25%

GTM

Sales + DevRel. First 20 enterprise design partners.

10%

Infra

Compute, TPU/GPU credits, production SRE.

ThinkingDBx

13 / 14

~/pitchdeck/14-the-ask

Now Raising · Seed

Help us build the
intelligence layer.

We are raising our seed round. Co-leads welcome. Strategic angels with data infrastructure, AI/ML, or developer-tools expertise especially valued.

▸ Book a 20-min intro See the products →

Mallesh Madapathi · Founder & CEO

mallesh@thinkingdbx.com

+91-9987941993

Hyderabad, India

— let's build ✎

14 / 14

$ Intelligence layer forData Engineering & Data Science

The data stack wasbuilt for humans, not agents.

5 to 10 tools

6-figure spend

Built for clicking

Three shifts areconverging at once.

Five layers.One coherent stack.

Bonacci Studio

Bonacci Flow

Bonacci MoE 9B

ThinkingLanguage

ThinkingMemory

Bonacci Studio

Shipped and live

Bonacci Flow

In active build

Bonacci MoE 9B

Two primitives.Both live. Both open source.

ThinkingLanguage

ThinkingMemory

A $80B+ marketrebuilding for agents.

No one has thefull stack.

Three products live.One in training. One in build.

Shipped & Live

Open Source

Open Source

122B+ tokens

Built by practitioners,not theorists.

Mallesh Madapathi

Why us

Where thecapital goes.

Help us build theintelligence layer.

$ Intelligence layer for
Data Engineering & Data Science

The data stack was
built for humans, not agents.

Three shifts are
converging at once.

Five layers.
One coherent stack.

Two primitives.
Both live. Both open source.

A $80B+ market
rebuilding for agents.

No one has the
full stack.

Three products live.
One in training. One in build.

Built by practitioners,
not theorists.

Where the
capital goes.

Help us build the
intelligence layer.