White Paper

The final report of the Public Interest Corpus project is available as a stable, citable version, here: The Public Interest Corpus: A Framework for Implementation. Because we want to encourage additional feedback, we’re also sharing a version here that allows you to comment. 

The report is the product of more than a year of work supported by the Mellon Foundation, in which we asked how research libraries can make books data available for AI training and computational research in ways that serve the public interest, rather than reinforcing the existing concentration of access to texts among a small number of well-resourced commercial actors.

This report and the Public Interest Corpus project as a whole benefited from the input of a large number of researchers, authors, librarians, technologists, publishers and lawyers.

This work is licensed under a CC-BY 4.0 license. https://creativecommons.org/licenses/by/4.0/deed.en
© 2026 Authors Alliance & Northeastern University