ai – addshore

GitHub Copilot is moving to AI credits (after accidently burning billions?)

May 15, 2026 by addshore

Last month I wrote a history of AI agentic coding, from my perspective, which heavily leaned on GitHub Copilot. One of the things that I have really appreciated over the years was the packaged cost of Copilot in comparison to the apparent cost of using per token prices APIs directly, or even the other packaged deals. However at the end of this month GitHub Copilot is moving to usage-based billing, and they now have a Copilot Billing Preview tool to allow you to compare what you have been paying vs what you will be paying in the future.

In my last post I took a look at my usage breakdown month by month, showing steady growth, and also shifts between the various models. All of that was mostly within the 10 USD per month plan (though this past month I have shifted to the 39 USD per month plan due to the new session and weekly token limits that people are complaining about online a fair bit (I haven’t actually seen a hint of these on the 39 USD per month plan)

However, next month this 39 USD is going to shoot up! And probably for good reason, as it looks like they might have been loosing a billion+ a month in recent months? (More on that below)

The tool is browser based, and just requires you to drop in a CSV file from the Premium request analytics of your account (which now has some additional fields). It then shows you various visualizations in the browser and extracts useful data from the more verbose report, including specifically some comparisons between your previous cost, and apparent future cost with AI credits instead.

Month comparisons

I went back and downloaded all of my new premium request usage report data for this year throughout which I slowly progressed from around 300 PRU per months (premium requests used) toward and past 600 PRU per month (largely due to the cloud agent usage increase. And in summary, this is what the difference between PRU based billing and AICS (AI Credit) billing looks like for me.

Month	Plan	PRUs	AICs	Current billing (PRUs)	Usage-based billing (AICs)
January 2026	Pro (10 USD) 300 PRU	293.14	1,059.761 AICs	10 USD	10 USD
Feburary 2026	Pro (10 USD) 300 PRU	318.03	2,306.479	10.72 USD	18.06 USD
March 2026	Pro (10 USD) 300 PRU	719.09	39,728.397	26.76 USD	392.28 USD
April 2026	Pro (10 USD) 300 PRU	563.74	39,911.737	20.55 USD	394.12 USD
1/2 of May 2026	Pro (10 USD) 300 PRU	354.63	31,017.761	1/2 39 USD	310.18 USD
Projected May 2026	Pro+ (39 USD), 1500 PRU	700	60,000	39 USD	620 USD

A first look at Docker AI Sandboxes for GitHub Copilot

May 14, 2026 by addshore

With local AI agents increasingly writing and executing code autonomously, giving them unrestricted access to your machine is becoming a massive security risk. This is one of the primary reasons that agentic flows have so many flavors of approval that may need to happen throughout an agents course of action, though others include review points and being able to keep the agent on track.

I have been very much enjoying my increased use of GitHub Cloud Agents in my work and play, which is rather powerful if you can setup your entire stack (more or less accurately) in a remote environment using VMs and containers. On the project that I currently work the most I have a copilot-setup-steps.yaml file or 53 lines leveraging my existing docker compose based development environment setup of 41 services that only takes 2 minutes to “install” (multi repo clones, and dependency installation), then allowing agent to run various different development configurations depending on the tasks at hand, using a mixture of the services (or not).

However today is the first day I’ll be taking a very brief look at Docker AI Sandboxes, to try and do more of this locally and or on machines nearby…

Editing wikibase.world (a MediaWiki site), with Jules (an AI agent)

May 5, 2026 by addshore

I recently decided to run an experiment on wikibase.world: what happens when you give an AI agent the keys to a live MediaWiki instance and ask it to do some targetting gardening, including edits to Wikibase?

Meet the Jules free tier, though i’m sure you could use any agent. Over the course of a few hours, I tasked Jules with editing wikibase.world, moving from simple API edits, querying SPARQL, browsing external websites, and even learning how to properly participate in MediaWiki talk pages, requesting for me to edit its knowledge / prompt on a protected wiki page.

Onboarding and Basic API Usage

Before Jules could do anything, it needed an account. I asked it to register itself as “Addagent” using the MediaWiki API and handle the CAPTCHA and token requirements.

The prompt was:

Can you register me an account on https://wikibase.world/ I guess via https://wikibase.world/w/index.php?title=Special:CreateAccount&returnto=Project%3AHome or the API And then tell me the password The username should be “Addagent”

It went ahead and did this first time, and now https://wikibase.world/wiki/User:Addagent exists. To create the account it seemingly used https://www.guerrillamail.com/ which I have since changed to an actual email address I control incase I need to reset the account password (which I also noted down).

One thing of note while using Jules, is that it really is optimized for coding, and it continually reports that it is “Running code review…” between steps, even though there is no code repo and nowhere to commit code to and no real code in this project either, and it continually referred to “pre-submit steps” even though there is not going to be any code submission.

It looks like Python was used by the agent to perform the account creation, and that script included completing whatever CPATCHA it was served as part of the wikibase.cloud hosting.

The screenshot to the right shows the various steps completed by the agent, as it broke down the task to be completed.

A first edit, adding a description

Late to “AI” assisted development?

May 15, 2026April 21, 2026 by addshore

Earlier this week, someone asked me if they were perhaps late to making use of AI-assisted development, as they dove into it in the past 2 months (using GitHub Copilot) and are already seeing large gains in a small team in terms of leverage of time. I thought for a second and responded that they might have seen comparably worthwhile gains roughly a year ago. In this post, I’m going to take a look back over the past years to try and figure out what the timeline has actually looked like.

My own vauge memory isn’t very certain, and roughly speaking pre COVID I dont remember much AI being used in software development, and after COVID we were in the AI era? The first place I personally remember using assisted development was via the initial VSCode GitHub Copilot auto completions, which were at the time questionably useful to start with but still showed promise. Included along the way will likely be the first version of Claude Code, Gemini entering the scene, and within GitHub copilot the advancements from completions, to ask & edit, to agent, and finally autopilot and cloud agents.

2017 – 2022: The Transformer era

June 2017: The “Attention Is All You Need” paper was published by Google, introducing the the Transformer architecture and paving the way for the era of transformer models.
June 2018: OpenAI releases GPT-1.
February 2019: OpenAI releases GPT-2.
November 2019: Initial COVID19 outbreaks.
March 2020: WHO declare COVID19 a pandemic.
June 2020: OpenAI releases GPT-3. Developers discover its ability to generate HTML, CSS and code snippets despite not being trained for programming.

And although there are other notable mentions, such as BERT by Google in 2018 and CodeBERT in 2020, most of the above comes far before most people will have started looking at or using AI for coding, and that includes me. As I initially started using models during development with the introduction of GitHub Copilot and the autocompletions within VSCode.

GitHub Copilot Technical Preview (June 2021+)

My email innivation to the GitHub Copilot Technical Preview came back in on the 8th July 2021, and it looks like the public announcement on the GitHub blog can still be found dated 29th June 2021.

Google Antigravity for WSL

February 10, 2026 by addshore

If you are anything like me, you might have given Google Antigravity a go, as I did in a recent post, and decided that there is not yet any WSL support given the extension marketplace specifically says This extension is not compatible with Antigravity.

However… it turns out that even if this is the case, the Remote-WSL: Connect to WSL option still appears in the command pallet, and is usable even without the extension installed?!

VS Code Copilot (Agent) vs Google Antigravity (Planning) & More

February 10, 2026January 31, 2026 by addshore

This entry is part 2 of 2 in the series Golang AI kata comparison

Back in July 2025 I did a little comparison of various AI code assistants with a small Golang kata (with some content stripped out), and I’m back for a second attempt using the same kata, but focusing on some of the newer Copilot models, as well as cloud agents, and a run through with Google Antigravity. All runs have been screen recorded, very generic time metrics extracted, and the code is also all up in branches in the code repo if you are curious…

The prompts used will also be the same as i my last blog post, starting with…

The code repository is going to be a small Library application.
There are CSV files in the resources directory that contain the content of the library.
Create a user interface that allows display of all books and magazines with detailsCode language: Access log (accesslog)

And continuing to guide the agent through adding a couple of features such as searches, ordering, writing some basic tests, allowing adding data and having a companion application (either CLI or UI, depending on what it chose to do first). You can expand the second below to see them all…

Great! Now I want to add to the interface that you created to allow searching for a book or magazine by its ISBN.
I still want to be able to easily list all books and magazines without any filters / searches too.Code language: Access log (accesslog)

Add an additional way to find books and magazines, by using their authors email address.Code language: Access log (accesslog)

Add the ability to sort all books and magazines, with all of their details displayed, sorted by title.
The sort should be done for magazines and books in combination.Code language: Access log (accesslog)

And as a follow-up …

Write unit tests for the project where appropriate.
If the code is not very unit testable, refactor it, maintaining functionality.Code language: Access log (accesslog)

Add one final feature that allows a user to add books, magazine and or authors to the library.
This should update the CSV files to show the data for future library lookups.
Don't let them just add to each file separately (such as author first), create a sensible form for adding things in a single action.Code language: Access log (accesslog)

And for the second application…

If you didn't yet create an interactive WEB UI, do so now.
If you didn't yet create an interactive CLI UI, do so now.
This can be very simple without much or any styling, but must be functional for the required features:
 - read data from CSV files
 - display all books and magazines with details
 - search by ISBN or author email
 - print them sorted by title
 - add things to the library (persisting to the CSVs)Code language: Access log (accesslog)

You might also have seen me occasionally send this hint to try harder…

Allow the user to choose the sort directionCode language: Access log (accesslog)

Before diving into details, overall the 5.2 Codex model as part of Copilot seems to be my favourite, though to be honest, if you can structure your prompts and repositories to work well within the GitHub Copilot cloud agents setup, that looks very appealing.

The numbers

These numbers all come with a pinch of salt, and you need to read the blurbs next to each video below to learn about that salt. Each number roughly summarizes how long that stage took according to the video replay. Some setups do more validation which takes more time, some just hope they got it right, others get stuck in the same problems over and over again, and some one shot it…

IDE & Assistant	Model	Main tasks	Tests	Editing	Second app	Total
VS Code & GitHub Copilot	Grok Code Fast 1	6:34	1:49	1:53	CLI 1:20	~ 12 minutes
VS Code & GitHub Copilot	Claude Haiku 4.5	11:57	3:20	7:27	CLI 3:41	~27 minutes
VS Code & GitHub Copilot	Claude Opus 4.5	2:45	4:04	1:19	CLI 7:21	~16 minutes
VS Code & GitHub Copilot	GPT 5.2 Codex	5:49	1:21	2:46	CLI 0:55	~ 10 minutes
Google Antigravity	Gemini 3 Pro	19:15	4:15	4:56	CLI 1:56	~ 32 minutes

It turns out figuring the timings for the individual steps for the cloud agents was rather hard, so I’ll only note their complete times…

IDE & Assistant	Model	Total
Cloud – GitHub Copilot	Auto?	17 minutes
Cloud – Google Jules	Gemini 3 Flash	31 minutes

The shortest overall time from my last post was around 18 minutes which was Roocode in VS Code with GPT 4.1, so seeing that GPT 5.2 Codex in Copilot did it all in around half the time and came up with a “better application” hints at a great improvement in the last 6 months.

Editing Grokipedia, a first look

December 11, 2025 by addshore

As a long time editor and developer in the Wikipedia and Wikimedia space, I’m of course sceptical about what Grokipedia is trying to be, and if it stands any chance of success. it may struggle to deliver the resilience, transparency, and community processes that keep projects like Wikipedia thriving, and in the early weeks the untouchable AI generated content was certainly not going to work moving forward.

However, in the last week or so editing became an option, hidden behind Grok as a safeguard against abuse.

I thought I’d have a look at trying to edit a few of areas of content to see what the experience is like, and capture some of the good and bad points.

In no particular order…

Broken link formatting

A fix attempt

The Donald Trump articles has some broken formatting, which looks like an incorrectly parsed or formatted Markdown link that is now just showing in the HTML of the page. For posterity, I captured a copy of this version of the page on archive.ph, but here is a snapshot of how it appears.

Slop in, craft out?

October 27, 2025 by addshore

Earlier today, I sent this absolutely perfectly crafted piece of slop into GitHub Copilot…

Right, but i want thje patche sot be / and /* always

And as I already expected, due to using these LLM based coding agents and assistants continually throughout their evolution, the resulting change was exactly what I wanted, despite the poor instructions.

Now, I’m sure there is actually some difference, and likely this depends on the relevance of the typoed areas, and how often such typos might also appear in training data.

Why is this, you might ask?

AI Code assistant experience comparison (golang-kata-1)

January 31, 2026July 13, 2025 by addshore

This entry is part 1 of 2 in the series Golang AI kata comparison

If you’re reading this, and thinking about trying an IDE integrated coding agent, or thinking about switching, maybe stick around, have a read and watch some of the videos. There is at least 6 hours worth of experience wrapped up in this 20 minuite read!

I’m watching a thread on the GitHub community forums, where people are discussing how GitHub Copilot has potentially gone slightly downhill. And in some ways I agree, so I through I’d spend a little bit more time looking at the alternatives, and how they behave.

This post tries to compare 9 different setups, and will primarily look at the differences in presentation within the VS Code IDE that each of these different coding assistants have. How the default user interactions work, and how the tasks are broken down and presented to the user, and generally what the user experience is like between these different assistants.

I’ll try to flag up some other useful information along the way, such as time comparisons, amount of human interaction needed, and overall satisfaction with what the thing is doing, and if this all presents itself nicely in this post, I might find myself writing more in the future…

However, I will not be looking at cost, setup, resource usage or what’s happening with my data along the way…

Assistant, LLM combinations

Assistant	Model	Main tasks @	Tests @	Second app @
Github Copilot	GPT 4o	~ 5:00	~ 24:45	~ 32
Github Copilot	GPT 4.1	~ 15:00	~ 17:40	~ 35
Github Copilot	Claude Sonnet 4	~ 17:00 (inc tests)	~ 17:00	~ 28
Gemini Code Assistant	Gemini Something ?	~ 11:20	~ 14:30	~ 25
AmazonQ	Claude Sonnet 4	~ 7:20	~ 15:50	~ 28
Roocode	GPT 4.1 (via Github Copilot)	~ 5:30	~ 10:00	~ 18
Roocode	Claude Sonnet 4 (via Anthropic)	~ 15:30	~ 20:00	~ 37
Claude Code	Claude Sonnet 4	~ 9:30	~ 17:40	~ 24
Claude Code	Claude Opus 4	~ 10:00	N/A	N/A

I have setup this post, and the code problem in such a way that I should be able to easily add more combinations and comparisons in the future, and directly compare the performance back to this post. Ideally, at some stage I’d try some other models via Ollama, and also some other pay per requests LLM APIs…