Feature Engineering Tutorial Series 4: Linear Model Assumptions

Linear models make the following assumptions over the independent variables X, used to predict Y: There is a linear relationship between X and the outcome Y The independent variables X are normally distributed There is no or little co-linearity among the independent variables Homoscedasticity (homogeneity of variance) Examples of linear Read more…

Feature Engineering Series Tutorial 3: Rare Labels

Labels that occur rarely Categorical variables are those whose values are selected from a group of categories, also called labels. Different labels appear in the dataset with different frequencies. Some categories appear more frequently in the dataset, whereas some other categories appear only in a few number of observations. For Read more…

Word Embedding and NLP with TF2.0 and Keras on Twitter Sentiment Data

Word Embedding and Sentiment Analysis What is Word Embedding? Natural Language Processing(NLP) refers to computer systems designed to understand human language. Human language, like English or Hindi consists of words and sentences, and NLP attempts to extract information from these sentences. Machine learning and deep learning algorithms only take numeric Read more…

Amazon and IMDB Review Sentiment Classification using SpaCy

Sentiment Classification using SpaCy What is NLP? Natural Language Processing (NLP) is the field of Artificial Intelligence concerned with the processing and understanding of human language. Since its inception during the 1950s, machine understanding of language has played a pivotal role in translation, topic modeling, document indexing, information retrieval, and Read more…

2D CNN in TensorFlow 2.0 on CIFAR-10 – Object Recognition in Images

What is CNN This Notebook demonstrates training a simple Convolutional Neural Network (CNN) to classify CIFAR images. Convolutional Neural Networks (ConvNets or CNNs) are a category of Neural Networks that have proven very effective in areas such as image recognition and classification. Unlike traditional multilayer perceptron architectures, it uses two operations Read more…

Real-Time Sentiment Analysis of a Phone Call Using NLTK and TextBlob in Python

Speech to text conversion and real-time sentiment analysis In this project we are going to analyse the sentiment of the call. We are first going to convert the speech to text and the analyse the sentiment using TextBlob. TextBlob is a Python library for processing textual data. It provides a simple API Read more…

NLP: End to End Text Processing for Beginners

Complete Text Processing for Beginners Everything we express (either verbally or in written) carries huge amounts of information. The topic we choose, our tone, our selection of words, everything adds some type of information that can be interpreted and value can be extracted from it. In theory, we can understand Read more…