Natural language processing for the social sciences and humanities