Open Justice India
About
Open Justice - India is a project to open up law datasets in India. This includes, but is not limited to:
- Court Judgements and Orders, across all levels (Supreme Court, High Courts, District Courts) in both PDF and text formats
- Structured Data about cases
- Causes Lists, and Hearing data, about what cases are heard when.
- Indic Language support.
Open Justice aims to publish such data across all court levels continually on a cadence - akin to how Common Crawl does a monthly release.
Join Us
- Join the mailing list and introduce yourselves.
- We have a Signal Group for quick communications and announcements.
- We host a community call over Zoom once every 2 weeks. Subscribe to the mailing list or Signal group to get notified about the next one.
Our Projects
ecourts
Python library to help scrape Indian Court Orders from the ecourts website. Published on PyPi. Under active development.
Also See
This is not the first attempt at opening up Indian Legal data, there's a bunch of such efforts already. This is a collaborative project, and we seek to build on the existing body of work by these many institutions.
- India Justice Report
- The India Justice Report is a national periodic reporting that attempts to measure the capacity of four pillars of the justice system - the police, the prison system, the judiciary and legal aid—in each state, against its own declared standards or benchmarks.
The raw data behind the India Justice Report is not published.
- Justice Hub by CivicDataLab, build via the Agami Data for Justice Challenge 2019
- An open source platform for data related to the Indian legal and justice system. Crowdsourced from a community of law and data researchers, practitioners and enthusiasts.
- Judicial Dataset by Development Data Labs
- A public dataset describing 81 million case records from the Indian e-Courts platform. It covers India's lower judiciary -- all courts including and under the jurisdiction of District and Sessions courts. Built for a paper on evidence of Judicial Bias, for which 5 Million of these cases were used.
- DAKSH High Court Data
- Case and hearing records for 23 out of 25 High Courts across
India scraped through the eCourts app. A subset of data from six High Courts over a
10-year period was published. They've written about their preparation
note and how they tagged Criminal/Civil
cases.
Note: The dataset is behind a form submission, which fails so this is inaccessible currently.
- Indian Legal Documents Corpus for CJPE
- The ILDC dataset is a corpus of 35k Indian Supreme Court cases annotated with original court decisions. Made up primarily by a India Kanoon scrape. Available for Research and non-commercial purposes by request.
- Judicial Data Collaborative
- JDC aims to drive and sustain advocacy on the quality and limitations of Indian judicial data and engage the judicial data community to enable cross-learning among various projects. In particular, they maintain a listing of various Judicial Data Projects in India and a Judicial Data Wiki for definitions and taxonomies.
Why
Explainer coming soon...