I have compiled the statistical information of the Indian language Wikipedias for the month of 2010 March. The PDF of the report is available at Indian language Wikipedias – Statistical report – 2010 March
Since the 2010 March dumps for some of Indian Language Wikipedias like Hindi, Marathi, Gujarathi, and so on are not yet available, the information (for some parameters) for those wikipedias are not included in this report. Hope the information will be available soon. But since it is already March 25, I do not want to wait until the information is available. I can include the updated information in the report for April month.
The data for this report is taken from the statistical analysis of all the WikiMedia wikis prepared and maintained by Erik Zachte (Website: http://infodisiac.com/). The statistics is available at http://stats.wikimedia.org/EN/Sitemap.htm.
The data for this report is collected on the last day of the month. That is, the statistical data for the month of 2010 January is collected at 2010 January 31 23:59 PM GMT.
The report is divided into two different sections.
- Statistical report of Wikipedias
- Localization status of Mediawiki software
Following are the different topics covered under each section.
1. Wikipedia Statistics
Article statistics
- Number of Articles
- Number of Edits
- Break up of edits
- Edits per article
- Average size of an article (bytes)
- Database size (in Mega Bytes)
- Percentage of articles with size greater than 500 bytes
- Percentage of articles with size greater than 2000 bytes (2 kilobytes)
User Statistics
- Number of active wikipedians
- Page views per month (All figures in Lakhs/month)
- MediaWiki Localization Statistics
2. Media Wiki Localization status (percentage)
The information of the following Indian language wikipedias is included in this report.
- Assamese (http://as.wikipedia.org)
- Bengali (http://bn.wikipedia.org)
- Bhojpuri (http://bh.wikipedia.org)
- Bishnupriya Manipuri (http://bpy.wikipedia.org)
- Burmese (http://my.wikipedia.org)
- Gujarathi (http://gu.wikipedia.org)
- Hindi (http://hi.wikipedia.org)
- Kannada (http://kn.wikipedia.org)
- Kashmiri (http://ks.wikipedia.org)
- Malayalam (http://ml.wikipedia.org)
- Marathi (http://mr.wikipedia.org)
- Nepali (http://ne.wikipedia.org)
- Nepal Bhasha/Newari (http://new.wikipedia.org)
- Odia (Oriya) (http://or.wikipedia.org)
- Pali (http://pi.wikipedia.org)
- Punjabi (http://pa.wikipedia.org)
- Sanskrit (http://sa.wikipedia.org)
- Sindhi (http://sd.wikipedia.org)
- Sinhala (http://si.wikipedia.org)
- Tamil (http://ta.wikipedia.org)
- Telugu (http://te.wikipedia.org)
- Urdu (http://ur.wikipedia.org)
Some of the above languages are not spoken in the present day India. Even though my focus is on Indian language wikipedias, I have included almost all the languages from Indian Sub continent. One of the main reason is that all the above languages belong to the same language family (either Aryan or Dravidian).
In the statistical report, the above languages are divided into 3 groups based on the number of articles in the respective Wikpedia. This division is done purely for comparing apples with apples
. There is no meaning in comparing a Wikipedia with 50,000 articles (for example, Hindi Wikipedia) with another wikipedia with number of articles less than 1000 articles (for example, Assamese Wikipedia). Following are the groups and the languages that fall under each group.
Group 1 (More than 10,000 articles)
- Nepal Bhasha/Newari
- Hindi
- Telugu
- Marathi
- Bishnupriya Manipuri
- Tamil
- Bengali
- Gujarathi
- Urdu
- Malayalam
Group 2 (More than 1,000 articles)
- Kannada
- Sanskrit
- Nepali
- Burmese
- Sinhala
- Pali
- Bhojpuri
- Punjabi
Group 3 (Less than 1,000 articles)
- Odia (Oriya)
- Kashmiri
- Sindhi
- Assamese
The PDF of the report is available at Indian language Wikipedias – Statistical report – 2010 March
The link to the statistical report (PDF) for the past months is provided below.
- Indian language Wikipedias – Statistical report – 2010 February
- Indian language Wikipedias – Statistical report – 2010 January
I hope this initiative will improve the interaction between different Indian Language Wikipedias and wikipedians. We (Malayalam Wikipedians – http://ml.wikipedia.org) are maintaining a similar comparison study of the major Indian Language wikipedias for the past two years. This study has helped us to understand the status of Malayalam Wikipedia as compared to other Indian Language Wikipedias. I hope this report will help other Indian language wikipedias also.

Its a great language. Just love it.