Abbot is a tool for undertaking large-scale conversion of XML document collections in order to make them interoperable with one another. In particular, Abbot can make one or more collections conform to a designated schema (including a schema used to define one of the collections).
By default, Abbot converts documents into TEI Analytics -- a TEI subset designed for text analysis applications.
ArcGIS is a suite of software that comprises of Desktop GIS, Server GIS, Mobile GIS, and Online GIS. ArcGIS is a platform for building a complete geographic information system (GIS) that lets you easily create, edit, and analyse geographic knowledge on the desktop; publish data, maps, globes and models to a GIS server and/or share them online; and use them on the desktop, on the Web, or in the field.
CorpusSearch 2 allows users to construct and search syntactically annotated corpora, including finding and counting lexical and syntactic patterns, correcting systemic errors, and coding linguistic features.
EXMARaLDA (Extensible Markup Language for Discourse Annotation) is a system of concepts, data formats and tools for the computer assisted transcription and annotation of spoken language, and for the construction and analysis of spoken language corpora.
Graphviz is open source software for graph visualization, representing structural information as diagrams of abstract graphs and networks. The package includes web and interactive graphical interfaces, and auxiliary tools, libraries, and language bindings.
Heritrix is web crawler used by the Internet Archive, which provides a web-based user interface after initial configuration on a Linux machine. Also used by the Library of Congress, Heritrix captures metadata in the Web ARChive (WARC) format.
ImageJ is a Java image processing program that can display, edit, and analyze images, and perform simple transformations like scaling and rotating images. It can run either as an online applet or as a downloadable application on any computer with a Java 1.4 or later.
Juxta is an open-source cross-platform tool for comparing and collating multiple witnesses to a single textual work. The software allows users to set any of the witnesses as the base text, to add or remove witness texts, to switch the base text at will, and to annotate Juxta-revealed comparisons and save the results.
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
MorphAdorner is a Java command-line program which acts as a pipeline manager for processes performing morphological adornment of words in a text. Currently MorphAdorner provides methods for adorning text with standard spellings, parts of speech and lemmata. MorphAdorner also provides facilities for tokenizing text, recognizing sentence boundaries, and extracting names and places.
Perl is a high-level, general-purpose, interpreted, dynamic programming language. Originally developed for text manipulation, it is now used for a wide range of tasks including graphics programming, system administration, network programming, applications that require database access and CGI programming on the Web.