Sovereign ATX × Alpha School | Private AI Infrastructure

Section 01

The One-Paragraph Version

Alpha is spending ~$10,000 per student per year on cloud AI — and it's the right call. You're building the best AI-powered school in the world and that takes real investment. But cloud AI has a ceiling: every token is rationed, every query goes through OpenAI or Anthropic servers, and the frontier models Joe actually cares about — the ones beating Claude Opus 4.6 on published benchmarks — aren't available via any API at any price. This proposal is about removing that ceiling entirely. For $2,500/month (month-to-month, hardware included, cancel anytime), Alpha gets a private AI cluster on campus: unlimited capacity, student data that never leaves the building, and access to frontier open-source models that outperform cloud AI on the benchmarks that matter.

∞

Token limit per student

Bytes of student data leaving campus

2TB

Unified memory — runs models no cloud offers

M‑to‑M

Month-to-month. Cancel anytime.

Section 02

Cloud AI Is Rationed. Sovereign Removes the Ceiling.

The $10,000/student/year you're investing in AI tools is impressive — and it should be. Alpha's bet on AI-powered education is the right one. But cloud AI infrastructure has a fundamental design constraint: it's shared, metered, and rationed.

Token limits throttle the best use cases. Deep research, multi-step reasoning, long-context projects — these are exactly what Alpha students should be doing. Cloud API limits make that expensive to scale.
Peak hours mean degraded performance. When 40 students are working simultaneously, cloud providers slow responses. You pay premium pricing and get shared infrastructure.
The frontier models aren't available. Kimi K2.5 Thinking — which outperforms Claude Opus 4.6 on AIME 2025 — isn't on any API. DeepSeek R1 is available in limited form but not at the quality or capacity of running it locally. The best models don't have an "enterprise API" option.
Student data leaves campus. Every query goes through OpenAI or Anthropic infrastructure. For a school with FERPA obligations, that's a real exposure — and a reputational one.

Sovereign installs a private cluster on campus. Your students get uncapped access to frontier models. Nothing leaves the building. The ceiling is gone.

Section 03

Frontier Models. Not Available on Any Cloud.

The open-source model ecosystem caught up to — and in key areas, surpassed — cloud AI in 2025. These are the models running on the cluster:

88.4%

Kimi K2.5 Thinking
AIME 2025

86.7%

Claude Opus 4.6
AIME 2025

Kimi K2.5 Thinking beats Claude Opus 4.6 on AIME 2025

Advanced math olympiad benchmark. Published results. Kimi K2.5 Thinking is not available via any cloud API — only on hardware you own. This is what "removing the ceiling" means.

Model	Size	Best For	vs. Cloud	Available via API?
Kimi K2.5 Thinking	1T MoE	Math, reasoning, coding + multimodal vision	Beats Claude Opus 4.6 on AIME 2025 (88.4% vs 86.7%)	No
Llama 4 Maverick	128B	Fast general use, multimodal, daily student driver	Frontier-class. Meta's flagship open model.	No (local only at full quality)
DeepSeek R1	671B	Reasoning, math, science, deep analysis	Beats Claude Opus 4.6 on MATH-500, GPQA	Not at this quality level
Qwen3.5-397B	397B	Research, writing, multilingual, humanities	Competitive with Claude Opus 4.6	No
Claude Opus 4.6 / ChatGPT 5.4	Unknown	Cloud reference	Outperformed by models above	Yes — at $X/token, rationed

The cluster holds 2TB of unified memory across 4× Apple Mac Studio M5 Ultra. That's enough to run Kimi K2.5 Thinking at full FP16 quality — or hold multiple frontier models simultaneously so students can switch instantly. No other infrastructure option offers this at any price point.

Section 04

Why Apple Silicon, Not NVIDIA

NVIDIA GPUs are the right choice for hyperscale cloud training. They are not the right choice for a 40-student campus cluster. Here's why the M5 Ultra is the correct hardware for this deployment:

Unified memory architecture. A single M5 Ultra holds 512GB of memory on the same die as the compute — meaning a 400B parameter model loads in seconds and runs without the bandwidth bottlenecks that plague GPU server arrays.
2TB across 4 machines. Four M5 Ultras in a distributed inference cluster via the exo framework pool into 2TB. That number doesn't exist in the GPU world at this price point or power envelope.
Power and heat are manageable. NVIDIA data center GPUs require dedicated power circuits and specialized cooling. The M5 Ultra cluster draws a fraction of that — it fits in a standard server rack in any room with A/C.
The models that matter run on Apple Silicon. The largest frontier open-source models — 400B to 1T parameters — were designed to run in high-memory environments. The M5 Ultra unified memory architecture is ideal for exactly this class of model.

Section 05

What Gets Installed

One rack. Turnkey. Sovereign handles everything from site survey to go-live.

🖥️

4× Mac Studio M5 Ultra

512GB unified memory each. 2TB pooled. Runs the world's largest open-source models at full quality via exo distributed inference.

💻

15× Mac Mini Workstations

Pre-configured and pointed at the cluster on day one. Open WebUI interface — looks like ChatGPT, zero learning curve for students.

🌐

Enterprise Networking

Ubiquiti UniFi Dream Machine SE + USW-Pro-24-PoE. AI cluster on isolated VLAN. No inbound ports. Tailscale for remote management.

🤖

Models Loaded & Live

Kimi K2.5 Thinking, Llama 4 Maverick, DeepSeek R1, Qwen3.5-397B — all loaded, tested, and validated before handoff.

🎓

Student & Faculty Onboarding

Live training session. Students access from any device on campus. Faculty get the same system — custom agents for teacher workflows available on request.

🛠️

Remote Managed Service

Kavin monitors via Tailscale. New models pushed monthly. 4-hour response SLA. The hardware runs autonomously — no on-site IT required.

Student Experience

Students open a browser tab. They see an interface that looks identical to ChatGPT — it's Open WebUI, an open-source frontend. They type. They get a response from a model that beats Claude Opus 4.6 on the math benchmark. They never know (or care) that the AI is running on a rack down the hall instead of in a data center in Virginia. Zero learning curve. Access from any device on campus.

Faculty & Admin

Same system. Teachers get the same access. Custom agents for faculty workflows — lesson planning, grading assistance, curriculum research — can be built and deployed to specific user groups. Admin can audit usage, set access levels, and export logs at any time without involving a vendor.

Section 06

The Questions Joe Will Ask

What happens if Kavin can't service this?

The hardware runs autonomously. Kavin connects remotely via Tailscale for monitoring and updates — there are no credentials or keys held by Sovereign that are required for the cluster to keep running. If Sovereign ATX disappeared tomorrow, the cluster would keep serving students indefinitely. The only thing that would change is you'd stop getting model updates pushed automatically. The hardware is yours (on the rental plan, it transfers to you; on the purchase plan, it's yours from day one). Sovereign connects to the cluster — the cluster doesn't depend on Sovereign to operate.

Why now instead of waiting for the M5 Ultra?

The M5 Ultra Mac Studio is what's in this proposal — it was announced in 2025 and is available now. If you're thinking about the next generation after M5 Ultra: the rental plan is designed for exactly this. When new hardware drops, Sovereign swaps it in. No reinstall, no disruption — students don't notice. The software stack, models, and access URLs stay the same. Starting now means 40 students get unlimited AI access starting now, not in 12 months.

Is the student experience actually good?

Yes. Open WebUI is polished, actively developed, and looks like what students expect — think ChatGPT with model selection. Students access it from any device on the school network via browser. No app to install. Model selection is handled in the interface — Kavin can set a default so students don't have to choose. The system handles 40 simultaneous users without throttling.

What's the Alpha story here?

An Alpha student built this. Kavin Lingham — currently enrolled at Alpha — founded Sovereign ATX, deployed a live production installation at Black Sheep Coffee in March 2026, and is offering Alpha School the same stack as a founding client. Alpha students built the infrastructure Alpha runs on. That's the story — and it's real.

What if we want to upgrade or change models?

New frontier models are pushed to the cluster within days of release. Old models are archived. The school decides what's available — Kavin doesn't hold a veto on the model roster. If a new open-source model drops and beats everything else on a specific task, it gets added. This is a major advantage over cloud AI: you're not waiting for OpenAI or Anthropic to release a new version. The open-source ecosystem ships continuously.

Section 07

Pricing — Two Options

Both options include the same hardware, the same models, the same installation, and the same ongoing service. The difference is capital structure.

Recommended

Option A — Rental

$2,500

/ month · month-to-month · hardware included

Full cluster hardware included — no capital outlay
All 4 Mac Studio M5 Ultras + 15 Mac Minis
Enterprise networking hardware included
Installation, setup, and onboarding included
Ongoing managed service included
Model updates included
Cancel anytime — no contract, no lock-in
Hardware upgrade path: swap in next-gen seamlessly

Option B — Purchase

$65K

upfront + $2,500 / month managed service

Full hardware ownership from day one
4× Mac Studio M5 Ultra + 15× Mac Mini
Enterprise networking (Ubiquiti UDM-SE + Pro switch)
Full installation, setup, and onboarding
90-day support included in upfront price
$2,500/month ongoing for monitoring, updates, support
Hardware on Alpha's balance sheet (~$40K asset value)
Best for: schools that want permanent ownership

The rental option is the right starting point. $2,500/month for unlimited AI capacity across 40 students — no token limits, no data leaving campus, frontier models not available via any cloud. Cancel anytime. If this doesn't change how Alpha students work within 90 days, cancel it. We're betting it does.

Section 08

From Agreement to Live in 4–5 Weeks

Day 0 — Agreement Signed

Hardware order placed same day. Site survey scheduled within 48 hours.

Week 2 — Networking Deployed

Ubiquiti network installed. AI cluster VLAN configured. Rack location prepped. Cable runs completed.

Week 3 — Hardware Arrives

Mac Studios and Mac Minis arrive. Kavin personally racks, cables, and powers all hardware.

Week 4 — Software & Models

exo framework configured, models loaded and benchmarked locally, Open WebUI deployed, all 15 workstations configured.

Week 5 — Go Live

Student and faculty training session. Documentation delivered. System is live. Kavin begins remote monitoring.

Ongoing — Remote Service

Kavin monitors via Tailscale. Monthly model updates. 4-hour response SLA. New frontier models added as they release.

Section 09

Built by One of Your Students

My name is Kavin Lingham. I'm enrolled at Alpha School. I've spent the last year building Sovereign ATX — a private AI infrastructure company. In March 2026, I deployed the first external production installation at Black Sheep Coffee, a multi-location Austin café group. That deployment is running live.

I wrote the infrastructure configs. I rack the hardware personally. I manage deployments remotely via Tailscale. I wrote the service agreements and BAA templates. The whole operation runs out of my home lab on a Mac Studio M1 Ultra that I use for R&D.

Alpha School is where I learned to build things that actually work. This proposal is me offering Alpha School the same infrastructure I'm building for paying clients — as a founding client, at a price that makes it easy to say yes, with no lock-in because I want this to be a 5-year relationship, not a signed contract.

The pitch isn't "let me save you money." The pitch is: an Alpha student built the best AI infrastructure for education that currently exists, and he's offering it to Alpha first.

"Alpha gave me the time and freedom to build something real. This is me building it for Alpha."

Section 10

Next Steps

This is a 30-minute conversation, not a 6-month procurement process. Here's how we move:

30-minute call — walk through any questions on the hardware, models, succession risk, student experience, or anything else
Site walk (1 hour) — Kavin identifies rack location, power, and cable routing at the Alpha campus
Agreement signed — one-page month-to-month service agreement; hardware order placed same day
4–5 weeks to live — 40 students with unlimited access to frontier AI, all on campus

If you want to see the stack running in production before committing, Kavin can arrange a visit to the Black Sheep Coffee deployment — the same hardware, same software, same managed service, currently live.

"Cloud AI is the ceiling.
This removes it."
— Unlimited capacity for $2,500/month. Month-to-month. Cancel anytime.

Remove the Ceiling.Own the Infrastructure.

The One-Paragraph Version

Cloud AI Is Rationed. Sovereign Removes the Ceiling.

Frontier Models. Not Available on Any Cloud.

Why Apple Silicon, Not NVIDIA

What Gets Installed

The Questions Joe Will Ask

Pricing — Two Options

From Agreement to Live in 4–5 Weeks

Built by One of Your Students

Next Steps

Remove the Ceiling.
Own the Infrastructure.