Summary and Setup

This lesson is about the fundamentals of Natural Language Processing (NLP) in Python.

Before joining this course, participants should have:

  • foundational knowledge of Python
  • foundational knowledge of Git and GitHub

Lesson objectives


  • Fundamentals of NLP: Introduce terminology, basic concepts and possible applications of NLP.
  • Data acquisition and Pre-processing: Preprocessing techniques such as tokenization, stemming, lemmatization, and removing stop words
  • Text analysis and feature extraction: Extract features from text, including TF-IDF and word embeddings

FIXME: Setup instructions live in this document. Please specify the tools and the data sets the Learner needs to have installed.

Data Sets


Download the data zip file and unzip it to your Desktop

Software Setup


Details

Setup for different systems can be presented in dropdown menus via a solution tag. They will join to this discussion block, so you can give a general overview of the software used in this lesson here and fill out the individual operating systems (and potentially add more, e.g. online setup) in the solutions blocks.

Use PuTTY

Use Terminal.app

Use Terminal