Text Analysis is computational approach to studying digital texts which can be performed using a software tool or a programming language to analyze a corpus of text to identify meaningful patterns and key trends instead of physically reading through them.Below are few exciting things you could do with text mining!
Voyant is an online web-based tool for reading and analyzing your digital texts.Use it to perform lexical analysis including the study of frequency and distribution data and creating word clouds; in particular.
Access Voyant :Voyant Tools
Getting started with Voyant : Support Guide
2. Google Books Ngram Viewer
Uses all digitized e-books in Google Books to create charts for searched words and phrases.
Access Google Books Ngram Viewer : Ngram Viewer
3. Wordle and Concordle
Word Clouds are a popular way of visualizing how important words are in a collection of texts. Wordle and Concordle help create customizable word clouds based on the user's text data.
Access Concordle: Concordle
Open source software that allows users to clean their datasets prior to use.
Access Open Refine : Open Refine
5.Hathi Trust Research Center Analytics -
HathiTrust Research Center (HTRC) enables computational analysis of works in the HathiTrust Digital Library (HTDL) to facilitate non-profit research and educational uses of the collection. HTRC, which is co-located at Indiana University and the University of Illinois at Urbana-Champaign, engages in research and development for computational text analysis of massive digital libraries.
Access Hathi Trust : Hathi Trust Analytics
Teaching materials: HTRC
HathiTrust + Bookworm: For visualizing trends in language overtime : Link HTRC + Bookworm
|Tools||Description||Access the software|
|KNIME||KNIME, the Konstanz Information Miner, is a free and open-source text analytics, reporting and integration platform. Also available in UAlbany Library Computing Sites||Download|
|MALLET||Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text||Download|
|Python + NLTK||Open source programming language widely used for manipulating and analyzing text data||Download and Install|
|R Studio||Open source analysis software that rely on community driven packages to mine data. Also available in UAlbany Library Computing Sites||Download|
|WEKA||Weka contains a collection of visualization tools and algorithms for text analysis and predictive modeling||Download|
|Rapid Miner||An open source software useful to extract insight from unstructured text content||Download|