Exploratory Data Analysis

Understanding data from the Election Commission of India

The 2004 Lok Sabha Elections

Flag of India
The Lok Sabha is the lower house of Parliament of India (The Rajya Sabah is the upper house). Candidates are elected directly. The house is composed of proportionate numbers of representatives from each territory and state based on population. The Lok Sabha website states that "the total elective membership is distributed among the States in such a way that the ratio between the number of seats allotted to each State and the population of the State is, so far as practicable, the same for all States". This site also makes use of State Assembly Election data and turnout data in order to better understand how women have faired in past elections.
Indian women


Indian election ballot


Indian men lined up to vote

Background - Women’s Reservation Bill, India 2010

After over 14 years of attempts to pass a reservation quota for female electoral candidates, on Tuesday, March 9th 2010 India’s upper house of parliament, the Rajya Sabha, passed a bill to reserve 33% of local, state assembly and parliamentary seats. While it is still too early to understand how the bill will affect Indian voting patterns, it has nonetheless received praise from a wide array of political leaders, including Manmohan Singh, India’s Prime Minister and member of the Indian National Congress. The INC’s typical opponent, the BJP has also lauded the nascent law. The passage of such a bill is not a surprising move for the Indian government, which holds political reservations for other groups considered to be disadvantaged, including Scheduled Castes (SC), Scheduled Tribes (ST) and in the tiny state of Sikkim, the Bhutia Lepcha ethnic group.

The passage of this bill has met with wide political opposition from parties such as the BSP, that claim further quotas within the reservation should be allotted for SC, ST and muslim women. The BSP, a party formed in 1984 to represent the interests of OBCs (other backwards castes, scheduled tribes, scheduled castes), refused to vote on the bill, walking out on parliament. Parties representing similar interests such as the JD(U), RJD, the SP (Samajwadi Party) and Shiv Sena have all voiced opposition to the bill for similar reasons. However, these parties are the minority in the Rajya Sabha with a total of only 24 votes.

Opposition can also be found on blogs, opinion pieces and news polls. Many of these pieces cite that the reservation could be misused for political purposes, that uplift of women should occur at the grassroots level, and most simply, that women already hold places of considerable power in Indian government. In essence, there are an innumerable number of positions that give good reason to both support and oppose the bill. The question I attempt to explore using prior electoral data, is how parties are arriving at their decisions to support the 33% reservation using visualizations that speak to the natural state of the data.

This project uses R and Hadley Wickham’s ggplot2 to create visualizations utilizing a method called grammar of graphics (GoG). GoG, popularized by Leland Wilkinson proposes that the rules which govern display of data in graphical format are both mathematical and aesthetic. GoG incorporates elements of Object Oriented Design and allows us to present complex multivariate data in new and sophisticated ways. With this, a few warnings are in order.

Current Indian electoral data is incredibly hard to come by. The election commission is slow to release data from recent elections. For instance, data from the 2008 general elections is non existent. A page organized for the 2004 elections contains a considerable amount of easy to use Excel files. However, these files are obviously sub queries from a much larger database. Lastly, the lack of continuous data here (especially time) makes developing wide varieties of visualizations a particularly tricky issue.

Videos