Lexeme and MediaInfo, implementing EntityDocument
As we continue the journey, looking at Entity and EntityDocument within Wikibase, another useful thing to look at are the third and fourth widely used (at least within the Wikimedia space) entity types for Wikibase. Both of these entity types make use of the EntityDocument, with none of the old assumptions baked into the Entity…
Wikibase, from Entity to EntityDocument
The term document has already come up a few times while discussing what a Wikibase entity is, and if that should change (be that in name only, code or structures), including in my first post of this series. Looking at the very first definition of entity in the duck duck go search that I performed…
Wikibase: What is an entity?
I left the Wikidata and Wikibase teams roughly a year ago, and at the time there were some long and deep discussions going on inside the team trying to define what an entity was, and what should and should not be an entity. At the recent Hackathon in Tallinn, this topic resurfaced to me, as…
Creating properties with statements using Wikidata Integrator
Wikidata Integrator is a Python library that simplifies data integration from Wikidata (and other Wikibases). It is written in Python, is focused on Wikibase concepts (as opposed to some libraries which are MediaWiki focused) and has a user-friendly interface. I’m currently working on a demo Wikibase and decided to bring all of the data into…
Wikibase and reconciliation
Over the years I have created a few little side projects, as well as working on other folks’ Wikibases, and of course Wikidata. And the one thing that I still wish would work better out of the box is reconciliation. What is reconciliation In the context of Wikibase, reconciliation refers to the process of matching…
What is Wikibase Cloud (according to ChatGPT)
In this post, I’ll be asking ChatGPT a bunch of questions about Wikibase Cloud. mainly to see how accurate it manages to be, and how helpful the information is. Messages to Chat GPT will be in bold, things that are wrong, in red, and things that are right in green. Firstly, “What is Chat GPT?“…
Wikidata query service updater evolution
The Wikidata Query Service (WDQS) sits in front of Wikidata and provides access to query its data via a SPARQL API. The query service itself is built on top of Blazegraph, but in many regards is very similar to any other triple store that provides a SPARQL API. In the early days of the query…
Wikidata maxlag, via the ApiMaxLagInfo hook
Wikidata tinkers with the concept of maxlag that has existed in MediaWiki for some years in order to slow automated editing at times of lag in various systems. Here you will find a little introduction to MediaWiki maxlag, and the ways that Wikidata hooks into the value, altering it for its needs. As you can…
Small commits
There are many blog posts and articles out there about making small git commits. I’m sure most people (including me) bring up the same few topics around why small commits are good and why we should all probably be making smaller commits. In this post, I’ll look at some of the key topics from my…
Wikibase a history
I have had the pleasure of being part of the Wikibase journey one way or another since 2013 when I first joined Wikimedia Germany to work on Wikidata. That long-running relation to the project should put me in a fairly good position to give a high-level overview of the history, from both a technical and…