A first look at WikiCrowd
I have quite enjoyed the odd contribution to an app by Google called Crowdsource. You can find it either on the web, or also as an app.
Crowdsource allows people to throw data at Google in controlled ways to add to the massive pile of data that Google uses to improve its services and at the end of the day beat its competition.
It does this by providing a collection of micro contribution tasks in a marginally gamified way, similar to how Google Maps contributions get you Local Guide points etc. In Crowdsource you get a contribution count, a level, and a metric for agreements.
While I enjoy making the odd contribution when bored out of my mind and enjoy looking at the new challenges (currently at 2625 contributions), I always think that data like this should just be going out into the world under a free licence to benefit everyone.
So finally, introducing WikiCrowd, an interface, and soon to be app, that I developed over the new year period.
WikiCrowd Overview
WikiCrowd is hosted on toolforge and can be found at https://wikicrowd.toolforge.org/ (Source code on Github)
In order to contribute, you need some knowledge of the world, a Wikimedia account and that’s it!
I’m very aware there are an infinite number of things similar to this in the world, even within the Wikimedia space, but this is my take, with a few small existing differences, and some that are in the works.
The tasks that are presented are intended to be achievable by anyone, with very little context needed about what Wikipedia / Wikimedia is, how structured data works there, and what is actually happening behind the scenes.
There is currently a very heavy bias towards image depictions, as it turns out that Wikimedia Commons categories are a fairly good starting point for easy data sets, and people enjoy looking at images.
When selecting a category, you get presented with repeated questions, with a simple yes, no, maybe option.
Currently, if you hit “Yes”, and edit under your account will happen shortly after submission. “No” and “Maybe” are recorded, but for now nothing else will happen.
The idea here is that people like simple tasks, focusing on a single thing to identify in a picture.
Other views exist, such as proposing Wikidata statements from lead text of Wikipedia articles, which has been taken from the Wikidata Game by Magnus.
At the time of writing this post, 25 people have tried the tool out in the 5 days since it was put online. There are around 10k questions currently in the system (being expanded daily), and 5.3k of them are already answered leading to 3.7k edits on Wikimedia Commons and Wikidata.
The future
The easiest thing to see in the future is the continued expansion of the categories used for image depicts questions. If you have any ideas leave a comment on this post or reach out to me (perhaps on Twitter).
Really I want this to be a native phone app, which will bring offline access, faster access, and have it a little closer to folks fingertips. I needed a backend API before getting there, and accidentally created a preliminary UI in the process. I’m currently working on a phone app using flutter.dev.
Currently, all “Yes” responses immediately result in edits, but I want to introduce a concept of agreement. In most cases, this would be 2 or 3 yes responses = 1 edit, which should increase accuracy and avoid mistakes by fast fingers.
Likewise, all “No” or “Maybe” responses currently don’t get shown to anyone a second time. I’d like to change that for the same reason.
I feel that there is some value in the negative responses being public somehow. I’m sure when training machine learning models people would also like the negatives, but I’m not sure how I may expose those yet.
[…] looking working on my recent WikiCrowd project I ended up looking at the ontological tree of both Wikidata entities and Wikimedia Commons […]
I’m not sure where you want feedback. I’ve only played with the alias ones.
https://imgur.com/a/bSoPE6G
It’s easier with an image. Obviously bathurst isn’t an alias – it’s a maiden name, but we need the forenames too. The answer is No, but if I could edit the submission it could be Yes.
The real problem I have is that I cannot tell whether the wikidata entry is for the same Linda Marlowe. Is it always the wikidata item linked to the article? I’ve found myself googling to get the wikidata item. I ned that extra information to be sure.
Another one https://imgur.com/a/vOsRfPW
Which 4 is that? Do you want me to take on trust that the wikidata entry Q1929961is the foreigner album 4, rather than all the other possible things called 4?
Thanks for both comments
Would you re able to write up your thoughts as issues on https://github.com/addshore/wikicrowd ?
Then we can look at tweaking things moving forward
I’m just about to re populate the depicts questions on the tool too!
[…] January 2022 I published a new Wikimedia tool called […]
[…] addshore/wikicrowd (GitHub, Blog) […]