Introduction to Text Analysis with NLTK

Posted by Michelle A. McSweeney (Johnson) on March 3, 2016 in Announcements, Events

Introduction to Text Analysis with the Natural Language Toolkit for Python

Thursday, March 10th, 6:30-8:30pm

The Natural Language Toolkit (NLTK) is a Python library that allows researchers to work with text-based data, such as literary works or social media corpora.

Through attending this session, you will learn how to use Python to analyze large amounts of text to find word frequencies, collocations, and other patterns invisible from a non-computational perspective. You will learn how to do work counts, collocations, frequency distributions, and turn any .txt document into a corpus NLTY can work with.

You will get the most out of this session if you have some familiarity with the Python programming language.If you have never used Python before, please start with an Introduction to Programming with Python Workshop or come to the Python Users’ Group (Tuesdays 1-3pm).

We will be using iPython Notebook and NLTK. The best way to get both is by downloading Anaconda. Please install Anaconda BEFORE attending the workshop.

Workshop Leaders: Michelle McSweeney and Hannah Aizenman

Please contact gc.digitalfellows@gmail.com with questions.

Photo Credit: Chris Blakeley