Section 01
The One-Paragraph Version
Alpha is spending ~$10,000 per student per year on cloud AI — and it's the right call. You're building the best AI-powered school in the world and that takes real investment. But cloud AI has a ceiling: every token is rationed, every query goes through OpenAI or Anthropic servers, and the frontier models Joe actually cares about — the ones beating Claude Opus 4.6 on published benchmarks — aren't available via any API at any price. This proposal is about removing that ceiling entirely. For $2,500/month (month-to-month, hardware included, cancel anytime), Alpha gets a private AI cluster on campus: unlimited capacity, student data that never leaves the building, and access to frontier open-source models that outperform cloud AI on the benchmarks that matter.
∞
Token limit per student
0
Bytes of student data leaving campus
2TB
Unified memory — runs models no cloud offers
M‑to‑M
Month-to-month. Cancel anytime.
Section 02
Cloud AI Is Rationed. Sovereign Removes the Ceiling.
The $10,000/student/year you're investing in AI tools is impressive — and it should be. Alpha's bet on AI-powered education is the right one. But cloud AI infrastructure has a fundamental design constraint: it's shared, metered, and rationed.
- Token limits throttle the best use cases. Deep research, multi-step reasoning, long-context projects — these are exactly what Alpha students should be doing. Cloud API limits make that expensive to scale.
- Peak hours mean degraded performance. When 40 students are working simultaneously, cloud providers slow responses. You pay premium pricing and get shared infrastructure.
- The frontier models aren't available. Kimi K2.5 Thinking — which outperforms Claude Opus 4.6 on AIME 2025 — isn't on any API. DeepSeek R1 is available in limited form but not at the quality or capacity of running it locally. The best models don't have an "enterprise API" option.
- Student data leaves campus. Every query goes through OpenAI or Anthropic infrastructure. For a school with FERPA obligations, that's a real exposure — and a reputational one.
Sovereign installs a private cluster on campus. Your students get uncapped access to frontier models. Nothing leaves the building. The ceiling is gone.
Section 03
Frontier Models. Not Available on Any Cloud.
The open-source model ecosystem caught up to — and in key areas, surpassed — cloud AI in 2025. These are the models running on the cluster:
88.4%
Kimi K2.5 Thinking
AIME 2025
vs
86.7%
Claude Opus 4.6
AIME 2025
Kimi K2.5 Thinking beats Claude Opus 4.6 on AIME 2025
Advanced math olympiad benchmark. Published results. Kimi K2.5 Thinking is not available via any cloud API — only on hardware you own. This is what "removing the ceiling" means.
| Model |
Size |
Best For |
vs. Cloud |
Available via API? |
| Kimi K2.5 Thinking |
1T MoE |
Math, reasoning, coding + multimodal vision |
Beats Claude Opus 4.6 on AIME 2025 (88.4% vs 86.7%) |
No |
| Llama 4 Maverick |
128B |
Fast general use, multimodal, daily student driver |
Frontier-class. Meta's flagship open model. |
No (local only at full quality) |
| DeepSeek R1 |
671B |
Reasoning, math, science, deep analysis |
Beats Claude Opus 4.6 on MATH-500, GPQA |
Not at this quality level |
| Qwen3.5-397B |
397B |
Research, writing, multilingual, humanities |
Competitive with Claude Opus 4.6 |
No |
| Claude Opus 4.6 / ChatGPT 5.4 |
Unknown |
Cloud reference |
Outperformed by models above |
Yes — at $X/token, rationed |
The cluster holds 2TB of unified memory across 4× Apple Mac Studio M5 Ultra. That's enough to run Kimi K2.5 Thinking at full FP16 quality — or hold multiple frontier models simultaneously so students can switch instantly. No other infrastructure option offers this at any price point.
Section 04
Why Apple Silicon, Not NVIDIA
NVIDIA GPUs are the right choice for hyperscale cloud training. They are not the right choice for a 40-student campus cluster. Here's why the M5 Ultra is the correct hardware for this deployment:
- Unified memory architecture. A single M5 Ultra holds 512GB of memory on the same die as the compute — meaning a 400B parameter model loads in seconds and runs without the bandwidth bottlenecks that plague GPU server arrays.
- 2TB across 4 machines. Four M5 Ultras in a distributed inference cluster via the exo framework pool into 2TB. That number doesn't exist in the GPU world at this price point or power envelope.
- Power and heat are manageable. NVIDIA data center GPUs require dedicated power circuits and specialized cooling. The M5 Ultra cluster draws a fraction of that — it fits in a standard server rack in any room with A/C.
- The models that matter run on Apple Silicon. The largest frontier open-source models — 400B to 1T parameters — were designed to run in high-memory environments. The M5 Ultra unified memory architecture is ideal for exactly this class of model.
Section 05
What Gets Installed
One rack. Turnkey. Sovereign handles everything from site survey to go-live.
🖥️
4× Mac Studio M5 Ultra
512GB unified memory each. 2TB pooled. Runs the world's largest open-source models at full quality via exo distributed inference.
💻
15× Mac Mini Workstations
Pre-configured and pointed at the cluster on day one. Open WebUI interface — looks like ChatGPT, zero learning curve for students.
🌐
Enterprise Networking
Ubiquiti UniFi Dream Machine SE + USW-Pro-24-PoE. AI cluster on isolated VLAN. No inbound ports. Tailscale for remote management.
🤖
Models Loaded & Live
Kimi K2.5 Thinking, Llama 4 Maverick, DeepSeek R1, Qwen3.5-397B — all loaded, tested, and validated before handoff.
🎓
Student & Faculty Onboarding
Live training session. Students access from any device on campus. Faculty get the same system — custom agents for teacher workflows available on request.
🛠️
Remote Managed Service
Kavin monitors via Tailscale. New models pushed monthly. 4-hour response SLA. The hardware runs autonomously — no on-site IT required.
Student Experience
Students open a browser tab. They see an interface that looks identical to ChatGPT — it's Open WebUI, an open-source frontend. They type. They get a response from a model that beats Claude Opus 4.6 on the math benchmark. They never know (or care) that the AI is running on a rack down the hall instead of in a data center in Virginia. Zero learning curve. Access from any device on campus.
Faculty & Admin
Same system. Teachers get the same access. Custom agents for faculty workflows — lesson planning, grading assistance, curriculum research — can be built and deployed to specific user groups. Admin can audit usage, set access levels, and export logs at any time without involving a vendor.
Section 06
The Questions Joe Will Ask
What happens if Kavin can't service this?
The hardware runs autonomously. Kavin connects remotely via Tailscale for monitoring and updates — there are no credentials or keys held by Sovereign that are required for the cluster to keep running. If Sovereign ATX disappeared tomorrow, the cluster would keep serving students indefinitely. The only thing that would change is you'd stop getting model updates pushed automatically. The hardware is yours (on the rental plan, it transfers to you; on the purchase plan, it's yours from day one). Sovereign connects to the cluster — the cluster doesn't depend on Sovereign to operate.
Why now instead of waiting for the M5 Ultra?
The M5 Ultra Mac Studio is what's in this proposal — it was announced in 2025 and is available now. If you're thinking about the next generation after M5 Ultra: the rental plan is designed for exactly this. When new hardware drops, Sovereign swaps it in. No reinstall, no disruption — students don't notice. The software stack, models, and access URLs stay the same. Starting now means 40 students get unlimited AI access starting now, not in 12 months.
Is the student experience actually good?
Yes. Open WebUI is polished, actively developed, and looks like what students expect — think ChatGPT with model selection. Students access it from any device on the school network via browser. No app to install. Model selection is handled in the interface — Kavin can set a default so students don't have to choose. The system handles 40 simultaneous users without throttling.
What's the Alpha story here?
An Alpha student built this. Kavin Lingham — currently enrolled at Alpha — founded Sovereign ATX, deployed a live production installation at Black Sheep Coffee in March 2026, and is offering Alpha School the same stack as a founding client. Alpha students built the infrastructure Alpha runs on. That's the story — and it's real.
What if we want to upgrade or change models?
New frontier models are pushed to the cluster within days of release. Old models are archived. The school decides what's available — Kavin doesn't hold a veto on the model roster. If a new open-source model drops and beats everything else on a specific task, it gets added. This is a major advantage over cloud AI: you're not waiting for OpenAI or Anthropic to release a new version. The open-source ecosystem ships continuously.
Section 07
Pricing — Two Options
Both options include the same hardware, the same models, the same installation, and the same ongoing service. The difference is capital structure.
Recommended
Option A — Rental
$2,500
/ month · month-to-month · hardware included
- Full cluster hardware included — no capital outlay
- All 4 Mac Studio M5 Ultras + 15 Mac Minis
- Enterprise networking hardware included
- Installation, setup, and onboarding included
- Ongoing managed service included
- Model updates included
- Cancel anytime — no contract, no lock-in
- Hardware upgrade path: swap in next-gen seamlessly
Option B — Purchase
$65K
upfront + $2,500 / month managed service
- Full hardware ownership from day one
- 4× Mac Studio M5 Ultra + 15× Mac Mini
- Enterprise networking (Ubiquiti UDM-SE + Pro switch)
- Full installation, setup, and onboarding
- 90-day support included in upfront price
- $2,500/month ongoing for monitoring, updates, support
- Hardware on Alpha's balance sheet (~$40K asset value)
- Best for: schools that want permanent ownership
The rental option is the right starting point. $2,500/month for unlimited AI capacity across 40 students — no token limits, no data leaving campus, frontier models not available via any cloud. Cancel anytime. If this doesn't change how Alpha students work within 90 days, cancel it. We're betting it does.
Section 08
From Agreement to Live in 4–5 Weeks
Day 0 — Agreement Signed
Hardware order placed same day. Site survey scheduled within 48 hours.
Week 2 — Networking Deployed
Ubiquiti network installed. AI cluster VLAN configured. Rack location prepped. Cable runs completed.
Week 3 — Hardware Arrives
Mac Studios and Mac Minis arrive. Kavin personally racks, cables, and powers all hardware.
Week 4 — Software & Models
exo framework configured, models loaded and benchmarked locally, Open WebUI deployed, all 15 workstations configured.
Week 5 — Go Live
Student and faculty training session. Documentation delivered. System is live. Kavin begins remote monitoring.
Ongoing — Remote Service
Kavin monitors via Tailscale. Monthly model updates. 4-hour response SLA. New frontier models added as they release.
Section 09
Built by One of Your Students
My name is Kavin Lingham. I'm enrolled at Alpha School. I've spent the last year building Sovereign ATX — a private AI infrastructure company. In March 2026, I deployed the first external production installation at Black Sheep Coffee, a multi-location Austin café group. That deployment is running live.
I wrote the infrastructure configs. I rack the hardware personally. I manage deployments remotely via Tailscale. I wrote the service agreements and BAA templates. The whole operation runs out of my home lab on a Mac Studio M1 Ultra that I use for R&D.
Alpha School is where I learned to build things that actually work. This proposal is me offering Alpha School the same infrastructure I'm building for paying clients — as a founding client, at a price that makes it easy to say yes, with no lock-in because I want this to be a 5-year relationship, not a signed contract.
The pitch isn't "let me save you money." The pitch is: an Alpha student built the best AI infrastructure for education that currently exists, and he's offering it to Alpha first.
"Alpha gave me the time and freedom to build something real. This is me building it for Alpha."
Section 10
Next Steps
This is a 30-minute conversation, not a 6-month procurement process. Here's how we move:
- 30-minute call — walk through any questions on the hardware, models, succession risk, student experience, or anything else
- Site walk (1 hour) — Kavin identifies rack location, power, and cable routing at the Alpha campus
- Agreement signed — one-page month-to-month service agreement; hardware order placed same day
- 4–5 weeks to live — 40 students with unlimited access to frontier AI, all on campus
If you want to see the stack running in production before committing, Kavin can arrange a visit to the Black Sheep Coffee deployment — the same hardware, same software, same managed service, currently live.
"Cloud AI is the ceiling.
This removes it."
— Unlimited capacity for $2,500/month. Month-to-month. Cancel anytime.