Summary and Schedule
This lesson is about the fundamentals of Natural Language Processing (NLP) in Python.
Before joining this course, participants should have:
- foundational knowledge of Python
- foundational knowledge of Git and GitHub
Lesson objectives
- Fundamentals of NLP: Introduce terminology, basic concepts and possible applications of NLP.
- Data acquisition and Pre-processing: Preprocessing techniques such as tokenization, stemming, lemmatization, and removing stop words
- Text analysis and feature extraction: Extract features from text, including TF-IDF and word embeddings
Setup Instructions | Download files required for the lesson | |
Duration: 00h 00m | 1. Welcome |
Who is this lesson for? What will be covered in this lesson? :::::: |
Duration: 00h 05m | 2. Episode 1: Introducing NLP |
What is natural language processing (NLP)? Why is it important to learn about NLP? What are some classic tasks associated with NLP? :::::: |
Duration: 00h 17m | 3. Episode 2: Preprocessing |
What different types of preprocessing steps are there? Why we need preprocessing? What are the consequences of applying data preprocessing on our text? |
Duration: 00h 27m | 4. Episode 3: Word embeddings |
What are word embeddings? What properties word embeddings have? What is a word2vec model? Can we inspect word embeddings? (Optional) How do we train a word2vec model? |
Duration: 00h 37m | Finish |
The actual schedule may vary slightly depending on the topics and exercises chosen by the instructor.
FIXME: Setup instructions live in this document. Please specify the tools and the data sets the Learner needs to have installed.
Data Sets
Download the data zip file and unzip it to your Desktop
Software Setup
Details
Setup for different systems can be presented in dropdown menus via a
solution
tag. They will join to this discussion block, so
you can give a general overview of the software used in this lesson here
and fill out the individual operating systems (and potentially add more,
e.g. online setup) in the solutions blocks.
Use PuTTY
Use Terminal.app
Use Terminal