So many tweets and news articles and unstructured text surrounds us. How do we make sense of all of these? Natural language processing or NLP can help. NLP refers to algorithms that process, understand and generate aspects of natural language either in text or in spoken voice. In this episode we will cover some of the common techniques in NLP to help get started in this exciting field!
We cover several tasks in a NLP pipeline:
1. Tokenization and punctuation removal
2. Stemming and Lemmatization
3. One hot vectors
4. Word embeddings including Word2Vec and Glove
5. Recurrent Neural Networks and LSTMs
6. tf and tf-idf approaches - when to use word embeddings, when to use tf / tf-idf approaches?
7. Generating text using encoder-decoder or sequence to sequence models
Podden och tillhörande omslagsbild på den här sidan tillhör Sanket Gupta. Innehållet i podden är skapat av Sanket Gupta och inte av, eller tillsammans med, Poddtoppen.