{"id":1103,"date":"2022-07-19T09:16:36","date_gmt":"2022-07-06T14:41:40","guid":{"rendered":"https:\/\/www.silicloud.com\/blog\/uncategorized\/a-tutorial-on-the-python-pandas-module\/"},"modified":"2024-03-06T13:57:53","modified_gmt":"2024-03-06T13:57:53","slug":"a-tutorial-on-the-python-pandas-module","status":"publish","type":"post","link":"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/","title":{"rendered":"A tutorial on the Python Pandas module."},"content":{"rendered":"<h2>The module named Python Pandas.<\/h2>\n<ul class=\"post-ul\">\n<li>Pandas is an open source library in Python. It provides ready to use high-performance data structures and data analysis tools.<\/li>\n<li>Pandas module runs on top of NumPy and it is popularly used for data science and data analytics.<\/li>\n<li><a href=\"https:\/\/numpy.org\/\">NumPy<\/a> is a low-level data structure that supports multi-dimensional arrays and a wide range of mathematical array operations. Pandas has a higher-level interface. It also provides streamlined alignment of tabular data and powerful time series functionality.<\/li>\n<li>DataFrame is the key data structure in Pandas. It allows us to store and manipulate tabular data as a 2-D data structure.<\/li>\n<li>Pandas provides a rich feature-set on the DataFrame. For example, data alignment, data statistics, slicing, grouping, merging, concatenating data, etc.<\/li>\n<\/ul>\n<hr \/>\n<h2>Installing Pandas and Getting Started with it.<\/h2>\n<p>To install the Pandas module, it is necessary to have Python 2.7 or a higher version. If you are utilizing conda, the module can be installed by using the command provided below.<\/p>\n<pre class=\"post-pre\"><code>conda install pandas\r\n<\/code><\/pre>\n<p>If you are utilizing PIP, execute the given command to install the pandas module.<\/p>\n<pre class=\"post-pre\"><code>pip3.7 install pandas\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/8-0.png\" alt=\"Python Install Pandas Module\" \/><\/div>\n<p>To incorporate Pandas and NumPy into your Python script, include the following code snippet:<\/p>\n<pre class=\"post-pre\"><code>import pandas as pd\r\nimport numpy as np\r\n<\/code><\/pre>\n<p>Since Pandas relies on the NumPy library, it is essential to import this dependency.<\/p>\n<hr \/>\n<h2>Data structures offered in the Pandas module.<\/h2>\n<p>Pandas module offers three data structures, which are as stated below:<\/p>\n<ul class=\"post-ul\">\n<li>Series: It is a 1-D size-immutable array like structure having homogeneous data.<\/li>\n<li>DataFrames: It is a 2-D size-mutable tabular structure with heterogeneously typed columns.<\/li>\n<li>Panel: It is a 3-D, size-mutable array.<\/li>\n<\/ul>\n<hr \/>\n<h2>A DataFrame from the Pandas library.<\/h2>\n<p>The DataFrame, which is the most crucial and commonly utilized data structure, serves as a standardized method for data storage. The data in a DataFrame is organized in rows and columns similar to an SQL table or a spreadsheet database. To create a DataFrame object, we have the option to manually input the data or import it from various file types such as CSV, TSV, Excel, or an SQL table. The following constructor can be used for this purpose.<\/p>\n<pre class=\"post-pre\"><code>pandas.DataFrame(data, index, columns, dtype, copy)\r\n<\/code><\/pre>\n<p>Here is a brief explanation of the parameters.<\/p>\n<ul class=\"post-ul\">\n<li>data &#8211; create a DataFrame object from the input data. It can be list, dict, series, Numpy ndarrays or even, any other DataFrame.<\/li>\n<li>index &#8211; has the row labels<\/li>\n<li>columns &#8211; used to create column labels<\/li>\n<li>dtype &#8211; used to specify the data type of each column, optional parameter<\/li>\n<li>copy &#8211; used for copying data, if any<\/li>\n<\/ul>\n<p>You can create a DataFrame in various ways, such as by using dictionaries or a list of dictionaries. It can also be created from a list of tuples, CSV, Excel file, and so on. To illustrate, let&#8217;s execute a basic code to generate a DataFrame from a list of dictionaries.<\/p>\n<pre class=\"post-pre\"><code>import pandas as pd\r\nimport numpy as np\r\ndf = pd.DataFrame({\r\n    \"State\": ['Andhra Pradesh', 'Maharashtra', 'Karnataka', 'Kerala', 'Tamil Nadu'],\r\n    \"Capital\": ['Hyderabad', 'Mumbai', 'Bengaluru', 'Trivandrum', 'Chennai'],\r\n    \"Literacy %\": [89, 77, 82, 97,85],\r\n    \"Avg High Temp(c)\": [33, 30, 29, 31, 32 ]\r\n})\r\nprint(df)\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/24-0.png\" alt=\"Python pandas Dataframe\" \/><\/div>\n<hr \/>\n<h2>Transferring information from a CSV file to a DataFrame.<\/h2>\n<p>We have the option to import a CSV file to create a DataFrame. A CSV file is a text file where each line represents a data record. The values within the record are separated by commas. Pandas offers a convenient function called read_csv() to read the CSV file&#8217;s contents and transform them into a DataFrame. For instance, we can have a file named &#8216;cities.csv&#8217; with information about Indian cities. This CSV file is located in the same directory as our Python scripts. We can import this file by using the following command:<\/p>\n<pre class=\"post-pre\"><code>import pandas as pd\r\ndata =  pd.read_csv('cities.csv')\r\nprint(data)\r\n<\/code><\/pre>\n<p>Our objective is to load and analyze data in order to make informed decisions. Therefore, we can employ any suitable technique for data loading. For this tutorial, we are manually inputting the data into the DataFrame.<\/p>\n<hr \/>\n<h2>Examining data in the DataFrame.<\/h2>\n<p>When we run the DataFrame by its name, we can see the complete table. When dealing with real-time datasets that contain thousands of rows, it is necessary to examine data from extensive volumes of datasets for analysis. Pandas offers various helpful functions to inspect specific data. To extract the first n rows, we can use df.head(n), while df.tail(n) allows us to print the last n rows. To illustrate, the code provided below will display the first 2 rows and the last 1 row of the DataFrame.<\/p>\n<pre class=\"post-pre\"><code>print(df.head(2))\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/34-0.png\" alt=\"Head\" \/><\/div>\n<pre class=\"post-pre\"><code>print(df.tail(1))\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/36-4.png\" alt=\"Values\" \/><\/div>\n<hr \/>\n<h3>1. Obtaining statistical summaries of data.<\/h3>\n<p>To obtain a statistical summary (which includes count, mean, standard deviation, minimum, maximum, etc.) of the data, we can utilize the df.describe() function. Now, let&#8217;s implement this function to present the statistical summary of the &#8220;Literacy %&#8221; column. To achieve this, we can include the following snippet of code.<\/p>\n<pre class=\"post-pre\"><code>print(df['Literacy %'].describe())\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/41-0.png\" alt=\"Describe\" \/><\/div>\n<hr \/>\n<h3>2. Arranging data entries in a specific order.<\/h3>\n<p>Using the df.sort_values() function, we have the ability to arrange records based on any column. As an illustration, let&#8217;s arrange the column &#8220;Literacy %&#8221; in a descending order.<\/p>\n<pre class=\"post-pre\"><code>print(df.sort_values('Literacy %', ascending=False))\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/46-0.png\" alt=\"Sort\" \/><\/div>\n<hr \/>\n<h3>3. Cutting through documents<\/h3>\n<p>To extract data from a specific column, you can utilize the column name. For instance, when extracting the &#8216;Capital&#8217; column, we can use:<\/p>\n<pre class=\"post-pre\"><code>df['Capital']\r\n<\/code><\/pre>\n<p>or (Only need one alternative)<\/p>\n<pre class=\"post-pre\"><code>(df.Capital)\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/53-0.png\" alt=\"Slice Capitals\" \/><\/div>\n<pre class=\"post-pre\"><code>print(df[['State', 'Capital']])\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/55-0.png\" alt=\"Slicemultiplecol\" \/><\/div>\n<pre class=\"post-pre\"><code>df[0:3]\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/57-0.png\" alt=\"Slicerows\" \/><\/div>\n<hr \/>\n<h3>4. Data filtration<\/h3>\n<p>You can also apply column value filters. For instance, the code below filters the columns that have a literacy percentage of over 90%.<\/p>\n<pre class=\"post-pre\"><code>print(df[df['Literacy %']&gt;90])\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/62-0.png\" alt=\"Filter Lit\" \/><\/div>\n<pre class=\"post-pre\"><code>print(df[df['State'].isin(['Karnataka', 'Tamil Nadu'])])\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/64-0.png\" alt=\"Filter\" \/><\/div>\n<hr \/>\n<h3>Change the name of the column.<\/h3>\n<p>You can utilize the df.rename() function to change the name of a column. This function requires the old column name and the desired new column name as inputs. In this case, we can rename the column &#8216;Literacy%&#8217; to &#8216;Literacy percentage&#8217;.<\/p>\n<pre class=\"post-pre\"><code>df.rename(columns = {'Literacy %':'Literacy percentage'}, inplace=True)\r\nprint(df.head())\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/69-0.png\" alt=\"Rename\" \/><\/div>\n<hr \/>\n<h3>6. Managing and organizing data efficiently<\/h3>\n<p>Native paraphrase:<br \/>\nData Science includes manipulating data so that it can be utilized effectively by algorithms. Data Wrangling refers to the process of manipulating data, such as combining, categorizing, and joining. The Pandas library offers helpful functions like merge(), groupby(), and concat() to assist in Data Wrangling tasks. To gain a clearer understanding, we will create two DataFrames and demonstrate the functionalities of Data Wrangling.<\/p>\n<pre class=\"post-pre\"><code>import pandas as pd\r\n\r\nd = {  \r\n    'Employee_id': ['1', '2', '3', '4', '5'],\r\n    'Employee_name': ['Akshar', 'Jones', 'Kate', 'Mike', 'Tina']\r\n}\r\ndf1 = pd.DataFrame(d, columns=['Employee_id', 'Employee_name'])  \r\nprint(df1)\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/74-0.png\" alt=\"Wrangling 1\" \/><\/div>\n<pre class=\"post-pre\"><code>import pandas as pd\r\n\r\ndata = {  \r\n    'Employee_id': ['4', '5', '6', '7', '8'],\r\n    'Employee_name': ['Meera', 'Tia', 'Varsha', 'Williams', 'Ziva']\r\n}\r\ndf2 = pd.DataFrame(data, columns=['Employee_id', 'Employee_name'])  \r\nprint(df2)\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/76-0.png\" alt=\"Wrangling 2\" \/><\/div>\n<hr \/>\n<h4>a. Combining<\/h4>\n<p>Let&#8217;s merge the two DataFrames we made by using the merge() function and matching the values of &#8216;Employee_id&#8217;.<\/p>\n<pre class=\"post-pre\"><code>print(pd.merge(df1, df2, on='Employee_id'))\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/81-0.png\" alt=\"Merging\" \/><\/div>\n<hr \/>\n<h4>b. Categorizing<\/h4>\n<p>Grouping refers to the act of organizing data into distinct categories. As an illustration, in the given instance, the field labeled &#8220;Employee_Name&#8221; contains the name &#8220;Meera&#8221; twice. Therefore, we can group this data by the column titled &#8220;Employee_name.&#8221;<\/p>\n<pre class=\"post-pre\"><code>import pandas as pd\r\nimport numpy as np\r\n\r\ndata = {\r\n    'Employee_id': ['4', '5', '6', '7', '8'],\r\n    'Employee_name': ['Meera', 'Meera', 'Varsha', 'Williams', 'Ziva']\r\n}\r\ndf2 = pd.DataFrame(data)\r\n\r\ngroup = df2.groupby('Employee_name')\r\nprint(group.get_group('Meera'))\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/86-0.png\" alt=\"Grouping\" \/><\/div>\n<hr \/>\n<h4>c. Combining together<\/h4>\n<p>The process of concatenating data involves combining one set of data with another. Pandas offers a function called concat() specifically for merging DataFrames. To illustrate, we can concatenate the DataFrames df1 and df2 using this function.<\/p>\n<pre class=\"post-pre\"><code>print(pd.concat([df1, df2]))\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/91-0.png\" alt=\"Concatenation\" \/><\/div>\n<hr \/>\n<h2>One option for paraphrasing the given sentence could be: &#8220;Construct a DataFrame by supplying a dictionary consisting of Series.&#8221;<\/h2>\n<p>We can make a Series by utilizing the pd.Series() function and supplying it with an array. Now, let&#8217;s create a basic Series like this:<\/p>\n<pre class=\"post-pre\"><code>series_sample = pd.Series([100, 200, 300, 400])\r\nprint(series_sample)\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/96-0.png\" alt=\"Series Sample\" \/><\/div>\n<pre class=\"post-pre\"><code>d = {'Matches played' : pd.Series([400, 300, 200], index=['Sachin', 'Kohli', 'Raina']),\r\n'Position' : pd.Series([1, 2, 3, 4], index=['Sachin', 'Kohli', 'Raina', 'Dravid'])}\r\ndf = pd.DataFrame(d)\r\nprint(df)\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/98-0.png\" alt=\"Df Dictseries\" \/><\/div>\n<hr \/>\n<h3>Choosing, including, and removing columns<\/h3>\n<p>We have the option to choose a particular column from the DataFrame. To illustrate, if we want to show solely the initial column, we can modify the preceding code as:<\/p>\n<pre class=\"post-pre\"><code>d = {'Matches played' : pd.Series([400, 300, 200], index=['Sachin', 'Kohli', 'Raina']),\r\n 'Position' : pd.Series([1, 2, 3, 4], index=['Sachin', 'Kohli', 'Raina', 'Dravid'])}\r\ndf = pd.DataFrame(d)\r\nprint(df['Matches played'])\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/103-0.png\" alt=\"Col Selection\" \/><\/div>\n<pre class=\"post-pre\"><code>d = {'Matches played' : pd.Series([400, 300, 200], index=['Sachin', 'Kohli', 'Raina']),\r\n 'Position' : pd.Series([1, 2, 3, 4], index=['Sachin', 'Kohli', 'Raina', 'Dravid'])}\r\ndf = pd.DataFrame(d)\r\ndf['Runrate']=pd.Series([80, 70, 60, 50], index=['Sachin', 'Kohli', 'Raina', 'Dravid'])\r\nprint(df)\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/105-0.png\" alt=\"Col Addition\" \/><\/div>\n<pre class=\"post-pre\"><code>del df['Matches played']\r\n<\/code><\/pre>\n<p>alternatively<\/p>\n<pre class=\"post-pre\"><code>df.pop('Matches played')\r\n<\/code><\/pre>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/109-0.png\" alt=\"Delcolum\" \/><\/div>\n<hr \/>\n<h2>In summary, to conclude<\/h2>\n<p>During this tutorial, we briefly explored the Python Pandas library, conducted practical exercises to demonstrate the capabilities of Pandas in the field of data science, and discussed the various data structures available in the Python library. Source: Official Pandas Website.<\/p>\n<p>&nbsp;<\/p>\n<p>more python tutorials<\/p>\n<p><a class=\"LinkSuggestion__Link-sc-1gewdgc-4 cLBplk\" href=\"https:\/\/www.silicloud.com\/blog\/python-breakpoint-rewrite-this-utilizing-pure-natural-language-please-provide-one-alternative-introduce-a-stopping-point-in-python-using-the-breakpoint-function\/\" target=\"_blank\" rel=\"noopener\">breakpoint function in Python<span class=\"sc-gswNZR eASTkv\">(Opens in a new browser tab)<\/span><\/a><\/p>\n<p><a class=\"LinkSuggestion__Link-sc-1gewdgc-4 cLBplk\" href=\"https:\/\/www.silicloud.com\/blog\/set-in-python\/\" target=\"_blank\" rel=\"noopener\">Set in Python<span class=\"sc-gswNZR eASTkv\">(Opens in a new browser tab)<\/span><\/a><\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The module named Python Pandas. Pandas is an open source library in Python. It provides ready to use high-performance data structures and data analysis tools. Pandas module runs on top of NumPy and it is popularly used for data science and data analytics. NumPy is a low-level data structure that supports multi-dimensional arrays and a [&hellip;]<\/p>\n","protected":false},"author":7,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_import_markdown_pro_load_document_selector":0,"_import_markdown_pro_submit_text_textarea":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-1103","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v21.5 (Yoast SEO v21.5) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>A tutorial on the Python Pandas module. - Blog - Silicon Cloud<\/title>\n<meta name=\"description\" content=\"Pandas is an open source library in Python. It provides ready to use high-performance data structures and data analysis tools.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"A tutorial on the Python Pandas module.\" \/>\n<meta property=\"og:description\" content=\"Pandas is an open source library in Python. It provides ready to use high-performance data structures and data analysis tools.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/\" \/>\n<meta property=\"og:site_name\" content=\"Blog - Silicon Cloud\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/SiliCloudGlobal\/\" \/>\n<meta property=\"article:published_time\" content=\"2022-07-06T14:41:40+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-03-06T13:57:53+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/8-0.png\" \/>\n<meta name=\"author\" content=\"Sophia Anderson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@SiliCloudGlobal\" \/>\n<meta name=\"twitter:site\" content=\"@SiliCloudGlobal\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Sophia Anderson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"10 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/\"},\"author\":{\"name\":\"Sophia Anderson\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/19a24313de9c988db3d69226b4a40a30\"},\"headline\":\"A tutorial on the Python Pandas module.\",\"datePublished\":\"2022-07-06T14:41:40+00:00\",\"dateModified\":\"2024-03-06T13:57:53+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/\"},\"wordCount\":1207,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#organization\"},\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/\",\"url\":\"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/\",\"name\":\"A tutorial on the Python Pandas module. - Blog - Silicon Cloud\",\"isPartOf\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#website\"},\"datePublished\":\"2022-07-06T14:41:40+00:00\",\"dateModified\":\"2024-03-06T13:57:53+00:00\",\"description\":\"Pandas is an open source library in Python. It provides ready to use high-performance data structures and data analysis tools.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.silicloud.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"A tutorial on the Python Pandas module.\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#website\",\"url\":\"https:\/\/www.silicloud.com\/blog\/\",\"name\":\"Silicon Cloud Blog\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#organization\"},\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#organization\",\"name\":\"Silicon Cloud Blog\",\"url\":\"https:\/\/www.silicloud.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png\",\"contentUrl\":\"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png\",\"width\":1024,\"height\":1024,\"caption\":\"Silicon Cloud Blog\"},\"image\":{\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/SiliCloudGlobal\/\",\"https:\/\/twitter.com\/SiliCloudGlobal\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/19a24313de9c988db3d69226b4a40a30\",\"name\":\"Sophia Anderson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/c726c09aa40e37115fb5c62d0c3ed62c16ca255d3763e2e3ae83a70ddf8c2175?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/c726c09aa40e37115fb5c62d0c3ed62c16ca255d3763e2e3ae83a70ddf8c2175?s=96&d=mm&r=g\",\"caption\":\"Sophia Anderson\"},\"url\":\"https:\/\/www.silicloud.com\/blog\/author\/sophiaanderson\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"A tutorial on the Python Pandas module. - Blog - Silicon Cloud","description":"Pandas is an open source library in Python. It provides ready to use high-performance data structures and data analysis tools.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/","og_locale":"en_US","og_type":"article","og_title":"A tutorial on the Python Pandas module.","og_description":"Pandas is an open source library in Python. It provides ready to use high-performance data structures and data analysis tools.","og_url":"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/","og_site_name":"Blog - Silicon Cloud","article_publisher":"https:\/\/www.facebook.com\/SiliCloudGlobal\/","article_published_time":"2022-07-06T14:41:40+00:00","article_modified_time":"2024-03-06T13:57:53+00:00","og_image":[{"url":"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/655dced1cdcf9b6757a00edb\/8-0.png"}],"author":"Sophia Anderson","twitter_card":"summary_large_image","twitter_creator":"@SiliCloudGlobal","twitter_site":"@SiliCloudGlobal","twitter_misc":{"Written by":"Sophia Anderson","Est. reading time":"10 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/#article","isPartOf":{"@id":"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/"},"author":{"name":"Sophia Anderson","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/19a24313de9c988db3d69226b4a40a30"},"headline":"A tutorial on the Python Pandas module.","datePublished":"2022-07-06T14:41:40+00:00","dateModified":"2024-03-06T13:57:53+00:00","mainEntityOfPage":{"@id":"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/"},"wordCount":1207,"commentCount":0,"publisher":{"@id":"https:\/\/www.silicloud.com\/blog\/#organization"},"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/","url":"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/","name":"A tutorial on the Python Pandas module. - Blog - Silicon Cloud","isPartOf":{"@id":"https:\/\/www.silicloud.com\/blog\/#website"},"datePublished":"2022-07-06T14:41:40+00:00","dateModified":"2024-03-06T13:57:53+00:00","description":"Pandas is an open source library in Python. It provides ready to use high-performance data structures and data analysis tools.","breadcrumb":{"@id":"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.silicloud.com\/blog\/a-tutorial-on-the-python-pandas-module\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.silicloud.com\/blog\/"},{"@type":"ListItem","position":2,"name":"A tutorial on the Python Pandas module."}]},{"@type":"WebSite","@id":"https:\/\/www.silicloud.com\/blog\/#website","url":"https:\/\/www.silicloud.com\/blog\/","name":"Silicon Cloud Blog","description":"","publisher":{"@id":"https:\/\/www.silicloud.com\/blog\/#organization"},"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.silicloud.com\/blog\/#organization","name":"Silicon Cloud Blog","url":"https:\/\/www.silicloud.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png","contentUrl":"https:\/\/www.silicloud.com\/blog\/wp-content\/uploads\/2023\/11\/EN-SILICON-Full.png","width":1024,"height":1024,"caption":"Silicon Cloud Blog"},"image":{"@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/SiliCloudGlobal\/","https:\/\/twitter.com\/SiliCloudGlobal"]},{"@type":"Person","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/19a24313de9c988db3d69226b4a40a30","name":"Sophia Anderson","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.silicloud.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/c726c09aa40e37115fb5c62d0c3ed62c16ca255d3763e2e3ae83a70ddf8c2175?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/c726c09aa40e37115fb5c62d0c3ed62c16ca255d3763e2e3ae83a70ddf8c2175?s=96&d=mm&r=g","caption":"Sophia Anderson"},"url":"https:\/\/www.silicloud.com\/blog\/author\/sophiaanderson\/"}]}},"_links":{"self":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts\/1103","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/comments?post=1103"}],"version-history":[{"count":2,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts\/1103\/revisions"}],"predecessor-version":[{"id":1678,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/posts\/1103\/revisions\/1678"}],"wp:attachment":[{"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/media?parent=1103"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/categories?post=1103"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.silicloud.com\/blog\/wp-json\/wp\/v2\/tags?post=1103"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}