All posts filed under “code

R datasets for historians by Lincoln Mullen

Lincoln Mullen has authored (and contributed to the ROpenSci project) datasets useful for historians learning to work with R: https://github.com/ropensci/historydata These sample data sets are intended for historians learning R. They include population, institutional, religious, military, and prosopographical data suitable…

MALLET

You can retrieve MALLET here, although in the future you should probably get it directly from David Mimno’s github repository. MALLET can be used for more than just topic modeling, so do explore the documentation at the UMass page (the first…

The Topic Modeling Tool GUI

Available in a Google Code repository, the “Topic Modelling Tool” (TMT) provides quick and easy topic model generation and navigation. Compared to MALLET, the TMT is easy to install and use. We have a section on this in our book. For…

R

R is an extremely versatile language. Download it from here and then use the lovely R Studio to make it more useful for you. Now, one of the tasks we put R to in our book was to topic model materials….

Scripts for the Stanford Topic Modeling Toolbox

We use two example scripts from the Stanford Topic Modeling Toolbox website. In case something happens to them, we re-host them here. Here is http://nlp.stanford.edu/software/tmt/tmt-0.3/examples/example-2-lda-learn.scala. // Stanford TMT Example 2 – Learning an LDA model // http://nlp.stanford.edu/software/tmt/0.3/ // tells Scala where…