This could also be a bias in the google books dataset. As I recall, the older google books dataset relied heavily on library preservation. It's probable that library curation would filter books for utility, longevity, and educational purpose. These filters would certainly penalize informal books.