From the Crawford Lab: We have developed a set of text-mining algorithms to extract education and occupation, both important variables that describe socioeconomic status (SES), from electronic health records. The development and evaluation of the algorithm is described in PMC5147499, and the exclusion, jobs, and prefix lists developed for this algorithm can be found here. Detailed usage of the package can be found on the github site.