Discursis is a tool for analyzing text-based natural language with a focus on sequential analysis. It is particularly desgined to work with texts that have an internal temporal structure, such as a transcribed conversation. For each text, it generates a statistics-based internal language model and applies tagging to each temporal unit detected. The model and tagging are then used as the basis for an interactive visualization which permits examination at a various levels of granularity, ranging from a whole-text to a unit-by-unit level; it can also show topic usage patterns within the text. Users must purchase a license; both student and academic trial licenses are available.
Umigon is a free, web-based and open-source tool for sentiment analysis of tweets. From a person's Twitter handle, Umigon retrieves that account's tweets and processes it for sentiment with accounting for factual statements (ex: "I hate war" will be classified has negative, and "war in Syria" will be classified as neutral). Tweets can also be pasted manually in the entry box provided. Beyond sentiment analysis, Umigon can identify characteristics such as whether the tweet contains a question, whether it contains possible promotional/commercial subject matter, or temporal indicators based on tense. Users are encouraged to report inaccurately identified sentiments via the button provided next to each tweet. Results can be exported in Excel or CSV format for futher analysis.
The developer of this application can be reached on Twitter.
CheckText is a free, web-based text analysis tool. Users can paste in text, upload it from their files, or import content from a web page. For each text, CheckText generates statistics such as word count, syllable count or number of complex words, provides a reading ease and level breakdown, and graphs the most frequent words. The tool also offers a uniqueness check aimed at detecting plagiarism, and generates a PDF report containing the full text with the statistical and readability data.
Alt.Text is a free, working prototype application for exploring a text on both an outline and content level via a graphical user interface. It breaks down texts into components such as sections, passages or documents, and permits users to leverage these components to create outlines and break off sections. Alt.Text includes three interfaces: the Document Editor (define sections, create outlines), the Section Editor (define passages of text within a given section), and the Outline Viewer (choose which passages to include in an outline section). Alt.Text is under active development, and is available for download in both Windows and OS X versions.
Paper Machines is a topic modelling and visualization tool available as a plugin for Zotero. It analyzes Zotero bibliographic collections based on a selection of text mining processes, and enables users to export a variety of visualizations, such as word clouds, phrase nets or heat maps, from the results.
CATMA (Computer Aided Textual Markup and Analysis) is a free, open source markup and analysis tool from the University of Hamburg's Department of Languages, Literature and Media. It incorporates three interactive modules, a tagger enabling textual markup and markup editing, an analyzer incorporating a query language and predefined functions, and a query builder that allows users to construct queries from combinations of pre-defined questions while allowing for manual modification for more specific questions. It also interfaces with the Voyant toolset. As of version 4.1, CATMA is a web application with collaborative work functions, and improvements to its user interface, queries and corpus analysis capacity.
|All (482)||Reviewed (249)|
|New (10)||Popular (400)|
|Annotation (18)||Bibliographic (4)|
|Collaboration (10)||Comparison (22)|
|Concording (66)||Editing (23)|
|Miscellaneous (219)||Natural language processing (1)|
|Network analysis (0)||Programming language (20)|
|Publishing (3)||Rdf (0)|
|Search (131)||Sentiment analysis (0)|
|Sequence analysis (0)||Social media analysis (0)|
|Statistical (149)||Text cleaning (17)|
|Text gathering (47)||Visualization (160)|