About This Project

An educational resource for understanding how changes to training data affect AI model behavior.

Work in Progress

What is This?

Data Counterfactuals is an interactive explainer and research reference designed to help people understand the relationship between training data and model outcomes. It proposes a unifying framework that connects several research areas that are often studied separately:

Data valuation — How much is each data point worth?
Influence functions — Which training examples affect a prediction?
Scaling laws — How does more data improve performance?
Data poisoning — How can bad data harm a model?
Collective action — What leverage do data creators have?

The core insight is that all of these questions are asking variations of the same thing: "What would happen if we trained on different data?" (Just to clarify: the term "data counterfactual" and the grid metaphor are pedagogical tools we developed to make these connections clearer—they are not established terminology in the literature.)

Current Status

This project is under active development. The interactive Grid explorer works, but many features are still being refined. The paper collections are curated but not comprehensive — they represent a starting point for exploring these research areas, not a complete literature review.

Contributions, corrections, and suggestions are welcome.

Join the Discussion

Have questions, feedback, or suggestions? Join the conversation:

GitHub Discussions — Threaded conversations with markdown support, reactions, and notifications.
WhiteWind — Long-form posts on the ATProto/Bluesky network.

You can also open issues or pull requests directly on the GitHub repository.

Who Made This?

This project is part of a broader effort to make AI/ML concepts more accessible. It's open source and welcomes contributions.

View the source on GitHub

Related Projects

Shared References