Rebellions Takes On Nvidia in AI Inference Chips

South Korean chipmaker Rebellions, valued above $2.3B, is challenging Nvidia in AI inference with a chiplet design built for agents and multimodal models.

4 min readEAEvgenii ArsentevEvgenii Arsentev · PhD

South Korean chipmaker Rebellions is positioning itself as a direct challenger to Nvidia in one specific arena: AI inference, the stage where a trained model actually runs and answers a request. Founded in 2020, the startup has raised $850 million, is now valued at more than $2.3 billion, and is expanding internationally. Its name is the pitch — 'here we are to lead one of the rebellions against the big incumbent in this AI industry, Nvidia,' said chief business officer Marshall Choy on the company's Targeting AI podcast appearance.

Crucially, Rebellions isn't trying to beat Nvidia at everything. From the start it focused only on inference, leaving training — the part Nvidia's GPUs dominate — alone. 'We thought that was by focusing on the inferencing side of things, to complement the existing GPUs that people have today doing training,' Choy said. That bet looks smart now that agentic AI is taking off: autonomous agents lean heavily on inference to decide their next move, so the cost and efficiency of running models in production is becoming the number that matters.

A different chip design

Technically, Rebellions builds 'chiplets' rather than one big monolithic processor: a large AI chip is fragmented into smaller, specialized silicon units that combine compute and memory into a compact, customizable package, working like a neural processing unit tuned for specific jobs — from call centers to healthcare systems. The company also tunes for mixture-of-experts and multimodal models rather than plain large language models, and it's closely tied to the open-source PyTorch community. It's a key member of 'K-Nvidia,' a multi-billion-dollar South Korean push to build sovereignty in AI and semiconductors.

Why this matters for you

You never see the chip, but you pay for it. Almost every time a chatbot answers you or an AI agent does a task, that work runs as inference, and today that overwhelmingly means Nvidia hardware — which keeps prices high and the whole industry dependent on one supplier. A credible alternative that is cheaper and more efficient at inference is exactly the kind of competition that pushes API prices down and reduces the single-vendor risk hanging over every AI product. My read is that the real fight in AI is quietly moving from 'who trains the biggest model' to 'who can run it cheaply enough to be worth using,' and that's the fight Rebellions picked. Whether a $2.3 billion startup can dent a company many times its size is an open question — but even one serious competitor in inference is good for anyone whose bill scales with usage.

What I'd actually do

Don't hard-wire your project to one model or provider. Keep your prompts and logic in a layer you control, so swapping the underlying model — or the hardware it runs on — is a config change, not a rewrite. When inference gets cheaper (and competition like this is how it does), you want to be able to take the discount immediately.

#ai#chips#rebellions#nvidia#inference

Related guides

EAEvgenii Arsentev

Author

Evgenii Arsentev

PhD · Chief Product Officer at a tech company

Want to actually build this?

Guides explain. The free course transforms — personalized, gamified, and built to get you shipping fast.

◉ Start the free course

Source: aibusiness.com