Fake News Content Analysis and Detection

Performed Text Mining, Data Wrangling, Exploratory Data Analysis (EDA), Fake vs Real news content analysis on Buzzfeed news dataset. Identified fake news sources, compared title length and discriminatory words by statistical tests and built a predictive models to detect fake news.

Amazon Customer Reviews analysis using Apache Spark

Analyzed 7 million reviews and ratings given by the customers over 20 years. Utilized Big data technologies for data analysis and visualization.

Political Ideology Animated Persona

Scraped Democrats and Republican tweets using Twitter API, processed and cleaned tweets to build an animated political persona. Trained a bidirectional LSTM model using GloVe Embeddings that predicts the political ideology interest scores from given text.