TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

TAPoR 2.5 is scheduled for decommissioning.
Please visit TAPoR 3

Popular Tools
User Recommended Tools
Random Tools

Voyant Links

Links finds collocates for words and displays links between them using a force directed graph. It shows term frequencies in proximity to keyword. It is a visualization and shows a web of terms.
Voyant Links
Voyant Links

TextArc

TextArc is a free visualization tool that represents an entire text on a single page. It has elements of an index, concordance and summary all in one place, encouraging the viewer to use its juxtapositions to uncover meaning. The web-based applet is ...
TextArc
TextArc

SEASR: OpenNLP Entities To Protovis Network Graph

SEASR's OpenNLP Entities to Protovis Network Graph is a free tool for extracting entities within a specified sentence distance within a text. The OpenNLP system is used for entity extraction, and their relationships are represented in a link node network ...
SEASR: OpenNLP Entities To Protovis Network Graph
SEASR: OpenNLP Entities To Protovis Network Graph

Voyant Cirrus

Cirrus is a visualization tool that displays a word cloud relating to the frequency of words appearing in one or more documents. One can click on any word appearing in the cloud to obtain detailed information about its relativity.
Voyant Cirrus
Voyant Cirrus

TUSTEP

TUSTEP (Tubingen System of Text Processing Tools) is a free, open source, widely-used toolbox for text processing. It is aimed at scholarly audiences, can work with texts in both latin and non-latin scripts, and is primarily designed for humanites applications. ...
TUSTEP
TUSTEP

Google Ngram Viewer

The Google Ngram Viewer is a free, web-based tool affiliated with Google Books. It displays a graph showing how a set of user-supplied phrases have occurred in a corpus of books over a selection of years. The tool includes functions for part-of-speech ...
Google Ngram Viewer
Google Ngram Viewer

DocuBurst

DocuBurst is a free web-based visualization tool for exploring the contents of a text.  Visitors can upload their own text or view those provided by others. DocuBurst presents an interactive chart called a ‘radial sunburst’ diagram which organizes ...
DocuBurst
DocuBurst

Wordle

Wordle is an online toy for generating word clouds using the text you provide.  Text can be submitted by providing an URL or by pasting raw text into an input.  The most frequent words from the text are then used as the source for the resulting visualization, ...
Wordle
Wordle

Concordle

Concordle is a free, web based word cloud and concordance tool built in Javascript. It describes itself as the "not so pretty cousin of Wordle" and first debuted in 2006. Users can paste text into the provided box and generate a word cloud, concordance ...
Concordle
Concordle

TextArc

TextArc is a free visualization tool that represents an entire text on a single page. It has elements of an index, concordance and summary all in one place, encouraging the viewer to use its juxtapositions to uncover meaning. The web-based applet is ...
TextArc
TextArc

R

R is an open source programing language designed for statistical analysis and parallel computing. R began its life as a research project at the University of Aukland, but has since expanded to become a collaborativly run open source project run by the ...
R
R

Voyant Corpus Term Frequencies

Corpus Term Frequencies shows overall word frequencies for the entire corpus as well as information about how word frequencies are spread out over documents within the corpus. Hover over column headers and buttons for more information.
Voyant Corpus Term Frequencies
Voyant Corpus Term Frequencies

Wordle

Wordle is an online toy for generating word clouds using the text you provide.  Text can be submitted by providing an URL or by pasting raw text into an input.  The most frequent words from the text are then used as the source for the resulting visualization, ...
Wordle
Wordle

BookLamp

BookLamp, part of the Book Genome Project, is a tool and a resource for finding books. It offers an alternative to social recommendation engines reliant on author popularity by treating its books as equal regardless of number of copies sold. BookLamp's ...
BookLamp
BookLamp

Tokenize - XML (TAPoR)

This tool splits an XML document at specified points into 'tokens' - words, lines, sentences, paragraphs or characters. The user can specify characters, patterns, or tags upon which to separate tokens, and choose to have the results listed separator ...
Tokenize - XML (TAPoR)
Tokenize - XML (TAPoR)

Paper Machines

Paper Machines is a topic modelling and visualization tool available as a plugin for Zotero. It analyzes Zotero bibliographic collections based on a selection of text mining processes, and enables users to export a variety of visualizations, such as ...
Paper Machines
Paper Machines

TextGrid

TextGrid is a virtual research environment for text-based humanities scholarship. It offers a variety of tools and services for collaboratively creating, analyzing, editing and publishing texts. The TextGrid environment is split into two components, ...
TextGrid
TextGrid

STASEL (Stylistic Treatment at the Sentence Level)

STASEL (Stylistic Treatment at the Sentence Level) is a historically important program designed to analyze text. It was intended for language and style instruction, but had research-based applications.
STASEL (Stylistic Treatment at the Sentence Level)
STASEL (Stylistic Treatment at the Sentence Level)

List Words - HTML (TAPoRware)

This tool lists words in an HTML document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and plain text versions ...
List Words - HTML (TAPoRware)
List Words - HTML (TAPoRware)

List Words - HTML (TAPoRware)

This tool lists words in an HTML document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and plain text versions ...
List Words - HTML (TAPoRware)
List Words - HTML (TAPoRware)

DfR Browser

The DfR Browser is a free, open source visualization interface for exploring aggragates of articles from the JSTOR database. It uses topic modelling, co-occurrances and document metadata to provide multiple views on the corpus of interest based on topic ...
DfR Browser
DfR Browser
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Annotation Canadian Comparator English (language) European French (language) German Historic Java Metadata Multilingual Natural language processing Social media
All Tags: