Three common dirty data problems and how to fix them with Pandas

Three common dirty data problems and how to fix them with Pandas

Data Science is a process and involves; defining your problem, collecting data, cleaning data, training and deploying machine learning models. The training of machine learning models is what attracts many to take the journey and I guess it is the cool part. However, many Data Scientist in industry will argue that training a model is [...]

Data Science Getting Started

Data Science Getting Started

This is just a list of online resources for getting started with Data Science. I only list resources that I have used, recommended by a lecture or where I have done a brief walk-trough to fill some knowledge gaps. Courses for absolute beginners: Coursera: Data Science Specialization LinkedIn Learning: Data Science foundations Udemy: Complete Python [...]

My Journey To Data Science

My Journey To Data Science

Data Science is a new field and it is very broad. To keep things simple, the heart of Data Science lies in the implementation of complex algorithms to data. This is done to discover some underling relationships between variables to predict a certain outcome. The primary goal is to enable “intelligent” machine driven decision making.  [...]