Analyzing & Visualizing Tweet using Python, pyspark, Kafka, zookeeper and Tableau

Handing humongous streaming data and providing real time analysis on provided data is like revolutionary. Python with combination of pyspark make it possible with ease. Here, we have advantage of getting data using twitter provided api. Twitter API allows to get the real time data, process it using RDD, pass…


Important for manipulating data and preprocessing

Pandas is must learn library to do data analysis. It makes it more important to learn tricks to master it. It is composed of many useful utilities to play with huge dataframe in fastest and easiest manner. …


Preprocessing data by looping through dataframe

Pandas is one of the most used libraries in pre processing the data. Why? Pandas is simple, easy and filled with lots of utilities to tweak data. Pre processing always involves from loading the data, analyzing it and transforming it. This library has all the qualities to use data for…


Collections, itertools and set theory make code simple, efficient and fast

This blog is on advance features of python which can definitely help you to write efficient and pythonic code. This will not only reduce the lines of code but also able to simplify the code which makes it easy to debug and more reliable. Python has lot of features and…


Knowing the gaps and time taken to enhance the performance requires to know the facts

Python is considered as new generation language. Obviously, developers are learning, developing and writing lots of code in python. It always comes up with requirement of improving performance of your code and debug the code. Debugging is always made easy with the use of tools and existing libraries of python…


Pythonic way to simplify complex things in a logical way

Python is an in-demand programming language. This language is easy to learn and apply. It will become easier if you already know any programming language. Depth in python allows you to use it for scripting, UI application development, and machine learning. Another drawback of having knowledge of other programming languages…


Group by to imitate SQL

Pandas is one of the wonderful libraries to master on path of data analysis and machine learning. It is mandatory to learn because of its unique capabilities and existing features. In last blog, we discussed about initial usage of pandas and its basic application. Now, we are good to move…


Manipulating dataframe to pre process data

Pandas is the most used library to read data from any form of source and convert data into dataframes for proprocessing. Being open source library, it has lot inside and quite flexible with multiple ways to do the same thing in efficient manner. This library is capable of handling huge…


Important and best algorithm derived from Bayes Theorem

Naive bayes is one of the simplest and must learn algorithm to learn basics of machine learning. This algorithm will hold its success given few conditions or assumptions are true. No other algorithm can beat this as it will logical, simple and perfect to use.

Let us talk about Bayesian…


Supervised , Unsupervised and Reinforcement

Artificial intelligence, Machine learning and Neural Network are few buzzwords in today’s world. Every body knows about it or want to know about it. This will be the trend of things going to be in next decade which will rule the technology. We can understood…

Laxman Singh

Machine Learning Engineer | Data Science | MTECH NUS, Singapore

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store