This repository contains supporting datasets and analysis code for several of our papers evaluating the use of artificial intelligence to enhance electronic textbooks at scale. These projects include automatic question generation (AQG) as well as other generative AI–based features such as text simplification. All datasets are drawn from real student interactions in the VitalSource Bookshelf ereader platform.
Our earliest research focused on AQG as a method for adding formative practice to textbooks. Millions of automatically generated questions have been added to thousands of textbooks in Bookshelf as part of a free study feature called CoachMe. CoachMe is based on the Doer Effect, the learning science principle that students who do practice as they read have better learning outcomes than those who only read. Our efforts have since expanded beyond AQG to include other generative AI-based interventions to support student learning and engagement. All of our published research papers can be found on our research site.
The datasets available are:
Unless otherwise noted, our datasets are available under the Creative Commons Attribution 4.0 International License.
If you have questions, please feel free to email [email protected].