Data slicing or indexing in python on datasets.

6 min readFeb 10, 2020

Hey guys this part is a very basic and important part. Before performing any action on the dataset we should know some rules of indexing and how indexing performs in Python.

Here we are using the Anaconda tool to perform some action on dataset which is a .csv file. By using data slicing we can perform data operations on limited data from datasets.

Import library to load a dataset

When we are working with .csv file we always use pandas library to import the dataset into our Spyder.

Import dataset into Spyder

To import dataset as .csv file we have pd.read_csv(‘filename’). Initially we are assigning this dataset to a variable which is of type DataFrame.

Above is our dataset which contains some random dataset. We are performing all the operations of data slicing on an above dataset.

Assign the whole dataset to a new variable

There are three-way we can do this as follows.

1: Using ‘=’

2: Using .copy() method

3: Using [:]

Before going on we should know how to check the memory address of the dataset in the Python. So we use hex(id()) to get the address of the dataset in Python.

#1: Here we directly use ‘=’ to perform assign operation as follows.

But the fun fact is in the result.

We see some amazing results we see the addresses are the same. So dataset1 and dataset are pointing to same memory location.

#2: We use here the .copy() method to copy all data from the dataset variable to another variable.

and again we see some amazing results.

After performing the assign operation using .copy() method we can see the addresses are different. So dataset and datatset2 are not sharing same memory reference.

#3: This is a more common way to assign a dataset variable to another variable.

and off course the result.

Here we see some interesting results .copy() and [:] addresses and not the same. So if we want to create a different dataset variable but not wasting some memory at that time just use the first method that is ‘=’. Because in other methods we can see the memory reference is changing.

Row operations

We saw how to copy or assign the whole dataset to another dataset variable. Now its time to perform some slicing on rows of the dataset.

To print a particular column from data we use the following methods.

You may things like this at first like I did but we get an error if we run this line of the script. I know you don’t believe me so see the below result.