Summary and Setup
This lesson is about the fundamentals of Natural Language Processing (NLP) in Python.
Before joining this course, participants should have:
- foundational knowledge of Python
- foundational knowledge of Git and GitHub
Lesson objectives
- Fundamentals of NLP: Introduce terminology, basic concepts and possible applications of NLP.
- Data acquisition and Pre-processing: Preprocessing techniques such as tokenization, stemming, lemmatization, and removing stop words
- Text analysis and feature extraction: Extract features from text, including TF-IDF and word embeddings
FIXME: Setup instructions live in this document. Please specify the tools and the data sets the Learner needs to have installed.
Data Sets
Download the data zip file and unzip it to your Desktop
Software Setup
Details
Setup for different systems can be presented in dropdown menus via a
solution
tag. They will join to this discussion block, so
you can give a general overview of the software used in this lesson here
and fill out the individual operating systems (and potentially add more,
e.g. online setup) in the solutions blocks.
Use PuTTY
Use Terminal.app
Use Terminal