is a web-based text reading and analysis environment. It is a scholarly project that is designed to facilitate reading and interpretive practices for digital humanities students and scholars as well as for the general public. Voyant Tools is an open-source project and the code is available through GitHub.
supports large-scale computational analysis of the works in the HathiTrust Digital Library to facilitate non-profit and educational research. HTRC Analytics includes:
is a system which locally harvests/caches content you specify. It then transforms the content into plain text, performs sets of natural language processing & text mining against the text, saves the results in a number of formats, reduces the whole to a cross-platform database file, queries the database thus summarizing the collection, zips the results of the entire process into a single file, and makes the file available to you for further investigation.
is an open source, Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text. Some knowledge of Java code needed.
is an integrated development environment (IDE) for R. It includes a console, syntax-highlighting editor that supports direct code execution, as well as tools for plotting, history, debugging and workspace management. RStudio Desktop, open source edition, is free and can be downloaded to your computer. R has a strong online support community, but the learning curve is steep for beginning programmers.