adlrocha.substack.com/p/adlrocha-in-a-quest-to-becoming-ai?triedRedirect=true
5 corrections found
No more flat subscriptions, from now on everyone has to pay for the tokens they use.
GitHub did switch Copilot to usage-based billing, but it did not eliminate flat subscription plans. The plans still exist and now include a monthly allotment of AI Credits; extra token-based charges are optional/conditional overages.
Full reasoning
GitHub's April 27, 2026 announcement says all Copilot plans remain in place and that each plan now includes a monthly allotment of GitHub AI Credits. It also says base plan pricing is not changing (for example, Copilot Pro remains $10/month and Pro+ remains $39/month). In other words, GitHub did not move to a pure pay-per-token model where everyone must pay for every token they use. Instead, it kept subscription plans and folded a monthly included credit balance into those plans, with additional usage available only after the included credits are exhausted and, for paid plans, if extra usage is allowed.
GitHub's billing docs say the same pattern existed even before the June 1 transition for premium requests: each plan included a monthly allowance, and extra usage was only billed if overages were enabled. So the statement that there are 'no more flat subscriptions' overstates the change.
2 sources
- GitHub Copilot is moving to usage-based billing - The GitHub Blog
"Instead of counting premium requests, every Copilot plan will include a monthly allotment of GitHub AI Credits, with the option for paid plans to purchase additional usage." The same post also says: "Base plan pricing is not changing. Copilot Pro remains $10/month, Pro+ remains $39/month..."
- GitHub Copilot premium requests - GitHub Docs
GitHub Docs states that "Each plan includes a fixed number of premium requests per user per month" and that if you exceed your allowance, extra usage is billed only if overages are enabled.
What do you get in exchange? 192GB of VRAM at 936 GB/s of aggregate bandwidth, the fastest throughput on this list for dense models.
936 GB/s is the bandwidth of one RTX 3090, not eight of them. An 8× RTX 3090 setup has about 7.49 TB/s of aggregate memory bandwidth, assuming you sum per-card bandwidth the same way the article sums it for other multi-GPU builds.
Full reasoning
NVIDIA's official Ampere documentation gives the GeForce RTX 3090 a memory bandwidth of 936 GB/sec per card. The article describes an 8× RTX 3090 configuration, so using the same 'aggregate bandwidth' logic applied elsewhere in the piece, the total is 8 × 936 GB/s = 7,488 GB/s (about 7.49 TB/s), not 936 GB/s.
That means the article is understating the 8-card system's aggregate bandwidth by a factor of eight. This matters because the same bandwidth figure is later used to compare the 8×3090 build against other multi-GPU and single-card systems.
1 source
- NVIDIA Ampere GA102 GPU Architecture Whitepaper
NVIDIA's whitepaper states that GDDR6X delivers "936 GB/sec of peak memory bandwidth in the GeForce RTX 3090" and lists "Memory Bandwidth (GB/sec) ... RTX 3090 FE 936 GB/sec."
One card is ~$7-10k and gives you 96GB of VRAM at roughly 1,700 GB/s, faster per-card bandwidth than the entire 8× 3090 build.
An RTX PRO 6000 Blackwell is not faster in memory bandwidth than an entire 8× RTX 3090 setup. NVIDIA lists the RTX PRO 6000 at about 1.8 TB/s, while eight RTX 3090s total about 7.49 TB/s if you aggregate per-card bandwidth.
Full reasoning
NVIDIA's RTX PRO 6000 Blackwell datasheet lists 96 GB of memory and 1.8 TB/s of memory bandwidth per card. NVIDIA's Ampere documentation lists the RTX 3090 at 936 GB/s per card. For an 8× RTX 3090 machine, that works out to 7,488 GB/s (about 7.49 TB/s) in aggregate.
So while a single RTX PRO 6000 Blackwell has higher per-card bandwidth than a single RTX 3090, it is not faster in bandwidth than an entire 8-card RTX 3090 build. The comparison in the article mixes up per-card bandwidth with system-wide aggregate bandwidth.
2 sources
- NVIDIA RTX PRO 6000 Blackwell Workstation Edition Datasheet
The datasheet lists the RTX PRO 6000 Blackwell Workstation Edition with "96 GB of GDDR7 memory" and "1.8 TB/s of memory bandwidth."
- NVIDIA Ampere GA102 GPU Architecture Whitepaper
NVIDIA states the GeForce RTX 3090 has "936 GB/sec of peak memory bandwidth" and lists "Memory Bandwidth (GB/sec) ... RTX 3090 FE 936 GB/sec."
By bandwidth it punches well above the 8×3090 build described above, at a fraction of the assembly headache.
Not by bandwidth: tinygrad's own tinybox red v2 page lists 2,560 GB/s aggregate GPU RAM bandwidth, while eight RTX 3090s would total about 7,488 GB/s if aggregated the same way.
Full reasoning
tinygrad's official tinybox page lists the tinybox red v2 at 2,560 GB/s of aggregate GPU RAM bandwidth. NVIDIA's official Ampere documentation lists the RTX 3090 at 936 GB/s per card. Using the same aggregation approach the article uses elsewhere, an 8× RTX 3090 build totals 7,488 GB/s (about 7.49 TB/s).
So the tinybox red v2 does not 'punch well above' an 8×3090 build on bandwidth. It may be easier to buy and assemble, and it may compare favorably on other dimensions, but on raw aggregated memory bandwidth the 8×3090 setup is substantially higher.
2 sources
- tinygrad.org tinybox page
tinygrad lists the tinybox red v2 with "GPU RAM bandwidth 2560 GB/s."
- NVIDIA Ampere GA102 GPU Architecture Whitepaper
NVIDIA states the GeForce RTX 3090 has "936 GB/sec of peak memory bandwidth" and lists "Memory Bandwidth (GB/sec) ... RTX 3090 FE 936 GB/sec."
Mac M3 Ultra 512GB 512 GB unified ~300 GB/s
The M3 Ultra bandwidth figure in the table is far too low. Apple lists M3 Ultra Mac Studio at 819 GB/s of unified memory bandwidth, not roughly 300 GB/s.
Full reasoning
Apple's official Mac Studio technical specifications list the M3 Ultra configuration at 819 GB/s memory bandwidth. Apple also describes the M3 Ultra Mac Studio as delivering over 800 GB/s of unified memory bandwidth in its newsroom announcement.
That means the table entry showing the M3 Ultra system at ~300 GB/s understates its memory bandwidth by well over a factor of two. The ~300 GB/s figure is not consistent with Apple's published specs for M3 Ultra.
2 sources
- Mac Studio - Technical Specifications - Apple
Apple lists the M3 Ultra Mac Studio as having "819GB/s memory bandwidth."
- Apple unveils new Mac Studio, the most powerful Mac ever - Apple
Apple says the M3 Ultra Mac Studio has "a high-bandwidth memory architecture that delivers over 800GB/s of unified memory bandwidth."