Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The 'genres' emerge out of the clusters. The fact that you can pick out 'genres' from this plot is an example of semi-supervised learning.


OK but there's no clue what they are until you start poking around. It's not something that's useful without some substantial investment of time and exploration. If that's the goal, fine but it's not how I would present a "favorite books" list.


Making 'genres' and classifying things is imprecise and requires experts in subject matter and library science to get it right. The "genre" labels here emerge out of the data itself.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: