All posts

MiniMax M3 Crashed the Price of Frontier AI

Chinese startup MiniMax released M3 over the weekend, beating GPT-5.5 and Gemini 3.1 Pro on coding and agentic tasks for a fraction of the cost. Open weights are coming, with plans starting near twenty dollars

June 2, 20263 min read
Heavy black zine-style illustration of a split price tag and a thick arrow smashing through tall AI model blocks like dominoes, led by a blocky mechanical creature, symbolizing a低价

The Price of Intelligence Just Crashed

Chinese AI startup MiniMax released its M3 model on Sunday. The benchmarks are sharp. It tops GPT-5.5 and Gemini 3.1 Pro on coding and agentic tasks. The context window stretches to one million tokens. It handles text, images, and audio natively. The real headline is the bill. MiniMax priced its subscription token plans at roughly five to ten percent of what OpenAI and Google charge. Some reports put the starting point near twenty dollars a month. That is not a discount. It is a rewrite of the market.

And the company is not keeping it in a vault. MiniMax plans to release M3 under an open source license with open weights. That means teams can download the model, inspect it, fine-tune it on private data, and run it locally or inside their own stack. For founders who have been renting intelligence by the token from a closed API, this is a genuine exit ramp.

Why Open Weights Matter More Than the Benchmark

Benchmarks are useful until they are not. Every lab cherry-picks the metric that flatters its latest release. But open weights change the economics of building. When you own the model file, you control the latency, the privacy posture, and the unit cost. You can quantize it, merge it with LoRA adapters, or strip out the vision layer entirely if that fits your product. Pinterest already proved this playbook works when it gutted Qwen3-VL and rebuilt the vision stack in-house, cutting AI costs by ninety percent while improving accuracy. M3 gives more teams the raw material to pull off the same trick.

There is also the question of data. Most interesting startups sit on a messy, proprietary dataset that no frontier lab has seen. Fine-tuning a closed model through an API is expensive and legally vague. Fine-tuning an open model on your own hardware is just engineering. With M3 arriving at twenty dollars a month, the threshold for that experiment drops to almost zero.

What This Means for What You Ship Next

At Botflow, we see this pattern accelerate every month. Builders start with an API call because it is easy. Then the bill shows up. Then they move to smaller open models, custom embeddings, or their own quantized weights. Botflow is built for that second phase. The backend runs on Convex, which handles real-time queries and vector search out of the box. You are not duct-taping a database to a model. You are building a product where the AI layer and the data layer share the same reactive backbone.

That matters when you want to move fast. If you are building a shopping aggregator, a customer support agent, or an internal operations tool, you need the model to read from your data in real time, not through a slow RAG pipeline that breaks every other Tuesday. You also need to swap models without rewiring your entire stack. An open-weights strategy gives you that flexibility.

MiniMax is not the first startup to challenge the pricing cartel, and it will not be the last. The trend is clear. Frontier capability is becoming a commodity faster than most investors expected. For founders, this is the best kind of deflation. It means a twenty-two-year-old in a garage can ship an AI feature set that would have required a Series C budget two years ago.

The builders who win the next wave will be the ones who treat models as replaceable engines, not as mystical partners. They will pick a stack that lets them upgrade the engine without rebuilding the car. If you have been waiting for the moment to add real AI to your product, the excuse just got thinner. Download the weights, point your prompt at your own data, and ship.