A watershed in the history of Wikipedia?

22 March 2007 by sage

Now may be as arbitrary a time as any other to identify as a watershed in the history of Wikipedia, but it seems like things are changing in a number of ways.

The most obviously change is in public perception and media coverage. Between the Middlebury College story, the Essjay story, and the Sinbad story, Wikipedia has been a constant presence in the headlines for several weeks running. With the possible exception of Essjay, none of these is even close to the significance of the Seigenthaler controversy, but the volume of related news and blog noise since late January (when the first of these stories emerged) has been as large or larger.

The remarkable thing about the Middlebury story is that it’s the only one like it; all the media stories have focused on it because no similar policies (beyond individual professors) have been enacted. The history department banned citations of Wikipedia, but actually endorsed Wikipedia as a starting point for research (and held an excellent recorded debate which highlighted the non-sensational reality of Middlebury situation, and includes an eloquent argument for the pedagogical value of Wikipedia). Meanwhile, many other professors have been interviewed by student newspapers (e.g., 1, 2, 3, 4, 5, 6, and many more) and professional news organizations, and written their own defenses of Wikipedia in venues like The Chronicle of Higher Education and The New Republic. The second Digital Campus podcast has a follow-up on the previous Wikipedia discussion I mentioned, with some commentary on a few of these stories; apparently, reporters are practically knocking these professors’ doors down requesting Wikipedia-related expert opinions. Professor have by and large become familiar enough with Wikipedia to respect its strengths and not project too many of the weaknesses they expect.

Other Wikipedian bloggers have covered the recent challenges to and discussions about Jimbo’s nebulous role in Wikipedia governance; see Joseph Reagle and Stephen Bain for more.

Another side to the watershed, which nobody is quite recognizing yet, relates to the limits of Wikipedia. The exponential phase of (English) Wikipedia’s growth (in terms of number of articles, and in terms of number of active users) is probably over. From 2003 to mid-2006, the number of articles had followed a very regular exponential pattern. Had exponential growth continued, it would have hit 2,000,000 a few weeks ago; it just passed 1,700,000 today. The average number of articles created per day since late December (around 1724) has actually been lower than the average number per day over the previous year (1823). This difference is only partly the result of the always slower holiday season. It seems that available unwritten encyclopedic topics is becoming a significant constraint.

The number of active users is harder to gauge, since Erik Zachte’s statistics page has not been updated for the English Wikipedia since mid-October. However, we can probably look to the German Wikipedia as a rough analog, since German Wikipedia seems to have a larger level of market saturation, when you account for the ratio of English-speakers to German-speakers (~8:1). The number of “active“(5 edits per month) and “very active” (100 edits per month) German Wikipedians seems to have plateaued in August 2006, at about 7500 and 1000 respectively. At that time, English Wikipedia had around 44,000 actives and 4500 very actives. If English Wikipedia’s active community has continued to expand towards cathcing up with German ratio-wise, we could expect the number of very active Wikipedians to max out around 8000 somewhere near the end of 2007. However, unlike English, German Wikipedia has had a near linear growth curve since early 2004 in terms of number of articles. I don’t know why article number has grown linearly while editor numbers were growing exponentially (until they plateaued), but it seems likely that because of topic saturation, English Wikipedia will plateau (or peak) in terms of editor:speaker ratio at a lower level than German. Consistent with the watershed thesis, my guess is that active community size is plateauing right now.

Of unknown but likely relevance to the watershed, two central Wikimedia employees announced their resignations today (apparently for unrelated reasons): Danny Wool and Brad Patrick. Both have implied that as independent Wikimedians they will, in the immediate to intermediate future, be bringing forward some constructive criticisms of the way the Wikimedia Foundation runs.

Other recent Wikipedia reading material:

Possibly related posts:

  1. Presentism and the history of science on Wikipedia
  2. History of science viewing stats on Wikipedia
  3. Narrative history vs. Insightful history, Time and Space
  4. History of science manifesto
  5. Back from History of Science Society meeting

Posted in Wikipedia | 2 Comments »

2 Responses to “A watershed in the history of Wikipedia?”

  1. David Gerard says:

    What’s the total of articles created each day, not articles surviving? I still see well over half the articles created each day being shot on sight.

    I think the reason for the increased media coverage is: we’re a top 10 website. We’re mainstream.

  2. Sage says:

    David, you estimate of “well over half” getting deleted looks accurate to me. My back-of-the-envelope calculation is as follows:

    Out of the first 100 entries in the deletion log, 58 were article deletions. Over 24 hours, there were approximately 4000 entries in the log. So if about 2400 articles are deleted each day, and the net of new articles is about 1800, then somewhere around 60% of new articles get deleted. The largest share of these are deleted under CSD A7, for not asserting notability/importance.

leave a comment

recent comments

  • Jana: It is perfect time to make some plans for the future and it’s time to be happy. I have read this post and if I could I desire...
  • Nihiltres: I’ve experienced an interesting counterexample. I was given a pirate copy of “Minecraft” by my younger...
  • sage: I’m not trying to justify copyright infringement, which certainly can harm creators in many cases. But it’s a lot more...
  • Jason Waggoner: Actually, they are both theft, strictly speaking. In the first scenario, you are technically robbing the copywrite owners...
  • Darryl: There’s no mention of what might be the one best feature of Vignette, that you can set it to save the original unprocessed...

Popular Posts

archives

categories

ragesoss dents

  • But when many anecdotes complement each other, and editors pull that out, you get something cool that no current review database can match. - January 28th, 2012 at 8:46 PM
  • As I imagine it, wiki-like curation would be an important element. Amazon reviews are useful not just for ave. stars, but anecdotes. - January 28th, 2012 at 8:42 PM
  • RT@evan Someday knowing the ins and outs of copyright will be like knowing the intricate rules of internal passports in Communist East ... - January 28th, 2012 at 8:10 PM
  • Federation makes a lot of sense... pre-existing communities centered on X (e.g., Android devices) just set up an instance and federate in. - January 28th, 2012 at 8:06 PM
  • Cool! My development skills are minimal, but I'm trying to pick up Django by building a rudimentary review site. - January 28th, 2012 at 8:04 PM
  • Why we need a free culture, community-run review site, independent of any merchant: http://ur1.ca/7x7cn Anybody want to start one with me? - January 28th, 2012 at 12:39 PM

follow me on Identi.ca
www.flickr.com
ragesoss' items go to Sage's photostream

free culture

history of science, etc.

miscellanea

ragesoss feeds