TAPoR 2.0

Discover Research Tools for Textual Study

  • Browse Tools by Type or Tag
  • Search and Use Tools
  • Read and Create Tool Reviews
  • Contribute and Advertise Tools

Popular Tools
User Recommended Tools
Random Tools

Voyant Links

Links finds collocates for words and displays links between them using a force directed graph. It shows term frequencies in proximity to keyword. It is a visualization and shows a web of terms.
Voyant Links
Voyant Links

TextArc

TextArc is a free visualization tool that represents an entire text on a single page. It has elements of an index, concordance and summary all in one place, encouraging the viewer to use its juxtapositions to uncover meaning. The web-based applet is ...
TextArc
TextArc

Weka (Waikato Environment for Knowledge Analysis)

Weka (Waikato Environment for Knowledge Analysis) is a free Java-based data mining workbench of machine learning algorithms, offered by the Machine Learning Group of the University of Waikato. It includes tools for data pre-processing, classification, ...
Weka (Waikato Environment for Knowledge Analysis)
Weka (Waikato Environment for Knowledge Analysis)

List Words - HTML (TAPoRware)

This tool lists words in an HTML document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and plain text versions ...
List Words - HTML (TAPoRware)
List Words - HTML (TAPoRware)

R

R is an open source programing language designed for statistical analysis and parrallel computing. R began its life as a research project at the University of Aukland, but has since expanded to become a collaborativly run open source project run by ...
R
R

Tropes

Tropes is a legacy commercial text analysis tool now available for free. It is designed for natural language processing and semantic classification, including chronological analysis of sequential pieces of text and summarization. It includes a graphical ...
Tropes
Tropes

DocuBurst

DocuBurst is a free web-based visualization tool for exploring the contents of a text.  Visitors can upload their own text or view those provided by others. DocuBurst presents an interactive chart called a ‘radial sunburst’ diagram which organizes ...
DocuBurst
DocuBurst

BookLamp

BookLamp, part of the Book Genome Project, is a tool and a resource for finding books. It offers an alternative to social recommendation engines reliant on author popularity by treating its books as equal regardless of number of copies sold. BookLamp's ...
BookLamp
BookLamp

Tesseract OCR

Tesseract is a free raw OCR engine originally developed by HP Labs and now maintained by Google. It works with the Leptonica Image Processing Library, and is capable of reading a variety of image formats. It can convert images to text in over 40 languages. ...
Tesseract OCR
Tesseract OCR

Wordle

Wordle is an online toy for generating word clouds using the text you provide.  Text can be submitted by providing an URL or by pasting raw text into an input.  The most frequent words from the text are then used as the source for the resulting visualization, ...
Wordle
Wordle

Wordle

Wordle is an online toy for generating word clouds using the text you provide.  Text can be submitted by providing an URL or by pasting raw text into an input.  The most frequent words from the text are then used as the source for the resulting visualization, ...
Wordle
Wordle

Digitate

Digitate is a free, open source application for making notes and annotations directly on an image of a cultural artifact such as a manuscript or a painting. Images can also be grouped into projects, saved for later or exported. This application is only ...
Digitate
Digitate

TextArc

TextArc is a free visualization tool that represents an entire text on a single page. It has elements of an index, concordance and summary all in one place, encouraging the viewer to use its juxtapositions to uncover meaning. The web-based applet is ...
TextArc
TextArc

TextGrid

TextGrid is a virtual research environment for text-based humanities scholarship. It offers a variety of tools and services for collaboratively creating, analyzing, editing and publishing texts. The TextGrid environment is split into two components, ...
TextGrid
TextGrid

Profiler Plus

Profiler Plus is a commercial general-purpose text analysis program based on natural language processing. It supports multiple languages and can output in text, XML and CSV.
Profiler Plus
Profiler Plus

Voyant Cirrus

Cirrus is a visualization tool that displays a word cloud relating to the frequency of words appearing in one or more documents. One can click on any word appearing in the cloud to obtain detailed information about its relativity.
Voyant Cirrus
Voyant Cirrus

List Words - HTML (TAPoRware)

This tool lists words in an HTML document, either uploaded by the user or from a web address. List Words works with relatively small texts of under a megabyte in size. It is part of the TAPoRware collection of tools; there are XML and plain text versions ...
List Words - HTML (TAPoRware)
List Words - HTML (TAPoRware)

Extract Text From HTML - Beta (TAPoRware)

This tool extracts texts from user-specified HTML tags, elements and attributes. There is no XML counterpart at present.
Extract Text From HTML - Beta (TAPoRware)
Extract Text From HTML - Beta (TAPoRware)

Voyant Corpus Term Frequencies

Corpus Term Frequencies shows overall word frequencies for the entire corpus as well as information about how word frequencies are spread out over documents within the corpus. Hover over column headers and buttons for more information.
Voyant Corpus Term Frequencies
Voyant Corpus Term Frequencies

Voyant Links

Links finds collocates for words and displays links between them using a force directed graph. It shows term frequencies in proximity to keyword. It is a visualization and shows a web of terms.
Voyant Links
Voyant Links

CollateX

CollateX is a free Java library for collating texts offered by Interedition, designed to be the successor to Peter Robinson's Collate. It uses a component-oriented architecture, to enable users to mix and match components according to their needs. Though ...
CollateX
CollateX
View tools by tag:
1960s 1970s 1980s 1990s 2000s 2010s American Canadian Comparator English English (language) French (language) German German (language) Historic Java Javascript Metadata Multilingual Natural language processing
All Tags: