Data preprocessing

alt text
Machine Learning is vast area. I am trying to learn little bit of it within my limited time and knowledge. First attempt is to preprocessing the data. All the codes are trying to write in most popular programming language python. because it is really simple to read and understand to others.

Import Libraries

First step is importing the required libraries

  1. NumPy - is a Library which contains Mathematical functions.
  2. Pandas - library used to import and manage the data sets.

Import Data Sets

Next step is Importing Data sets. Data sets are generally available in csv format. Each line of CSV file is a data record. Read local CSV file as a dataframe using read_csv method of pandas library. Then separate Matrix and Vector of independent and dependent variables from the dataframe.

Share