{ "cells": [ { "cell_type": "markdown", "metadata": { "id": "KTQcqxS7uW2x" }, "source": [ "\"Open\n", "\n", "\n", "\n" ] }, { "cell_type": "markdown", "metadata": { "id": "mO08F9Bh2fU6" }, "source": [ "# Construyendo Data Frame desde diccionarios y cargando datos del un data frame\n" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "aU4aSvHu56Hw" }, "outputs": [], "source": [ "import numpy as np\n", "import pandas as pd\n", "import os" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "NO59NdYu2W1z" }, "outputs": [], "source": [ "x = np.linspace(0, 10, 10)\n", "y = np.linspace(0, 10, 10)\n", "\n", "d = {\"x\": x, \"y\": y}" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "hYctW3-G6OzD" }, "outputs": [], "source": [ "df = pd.DataFrame(d)" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "YCtY7mjr6QTA", "outputId": "26f818aa-f02e-4fc4-b65c-e7978ff76f26" }, "outputs": [ { "data": { "text/plain": [ "0 0.000000\n", "1 1.111111\n", "2 2.222222\n", "3 3.333333\n", "4 4.444444\n", "5 5.555556\n", "6 6.666667\n", "7 7.777778\n", "8 8.888889\n", "9 10.000000\n", "Name: y, dtype: float64" ] }, "execution_count": 25, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.y" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "ZWDT-7Z16khQ" }, "outputs": [], "source": [ "path = os.getcwd()\n", "# https://www.kaggle.com/gpreda/covid-world-vaccination-progress?select=country_vaccinations\n", "path=\"https://github.com/hernansalinas/Curso_aprendizaje_estadistico/blob/main/datasets/sesion_01b_country_vaccinations.xlsx?raw=true\"\n", "df = pd.read_excel(f\"{path}\") \n" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 400 }, "id": "dVSqlOC7-key", "outputId": "4d3c75ba-4f00-4895-8421-4a70c1ef36b0" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
countryiso_codedatetotal_vaccinationspeople_vaccinatedpeople_fully_vaccinateddaily_vaccinations_rawdaily_vaccinationstotal_vaccinations_per_hundredpeople_vaccinated_per_hundredpeople_fully_vaccinated_per_hundreddaily_vaccinations_per_millionvaccinessource_namesource_website
0AfghanistanAFG2021-02-220.00.0NaNNaNNaN0.00.0NaNNaNOxford/AstraZenecaGovernment of Afghanistanhttp://www.xinhuanet.com/english/asiapacific/2...
1AfghanistanAFG2021-02-23NaNNaNNaNNaN1367.0NaNNaNNaN35.0Oxford/AstraZenecaGovernment of Afghanistanhttp://www.xinhuanet.com/english/asiapacific/2...
2AfghanistanAFG2021-02-24NaNNaNNaNNaN1367.0NaNNaNNaN35.0Oxford/AstraZenecaGovernment of Afghanistanhttp://www.xinhuanet.com/english/asiapacific/2...
3AfghanistanAFG2021-02-25NaNNaNNaNNaN1367.0NaNNaNNaN35.0Oxford/AstraZenecaGovernment of Afghanistanhttp://www.xinhuanet.com/english/asiapacific/2...
4AfghanistanAFG2021-02-26NaNNaNNaNNaN1367.0NaNNaNNaN35.0Oxford/AstraZenecaGovernment of Afghanistanhttp://www.xinhuanet.com/english/asiapacific/2...
\n", "
" ], "text/plain": [ " country ... source_website\n", "0 Afghanistan ... http://www.xinhuanet.com/english/asiapacific/2...\n", "1 Afghanistan ... http://www.xinhuanet.com/english/asiapacific/2...\n", "2 Afghanistan ... http://www.xinhuanet.com/english/asiapacific/2...\n", "3 Afghanistan ... http://www.xinhuanet.com/english/asiapacific/2...\n", "4 Afghanistan ... http://www.xinhuanet.com/english/asiapacific/2...\n", "\n", "[5 rows x 15 columns]" ] }, "execution_count": 28, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.head()" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 441 }, "id": "juFHDrhT-m5j", "outputId": "8bb5fb97-c6f6-4152-b4bb-cce6fccbe56a" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
Temperature (K)Luminosity(L/Lo)Radius(R/Ro)Absolute magnitude(Mv)Star typeStar colorSpectral Class
030680.0024000.170016.120RedM
130420.0005000.154216.600RedM
226000.0003000.102018.700RedM
328000.0002000.160016.650RedM
419390.0001380.103020.060RedM
........................
23538940374830.0000001356.0000-9.935BlueO
23630839834042.0000001194.0000-10.635BlueO
2378829537493.0000001423.0000-10.735WhiteA
2389235404940.0000001112.0000-11.235WhiteA
23937882294903.0000001783.0000-7.805BlueO
\n", "

240 rows × 7 columns

\n", "
" ], "text/plain": [ " Temperature (K) Luminosity(L/Lo) ... Star color Spectral Class\n", "0 3068 0.002400 ... Red M\n", "1 3042 0.000500 ... Red M\n", "2 2600 0.000300 ... Red M\n", "3 2800 0.000200 ... Red M\n", "4 1939 0.000138 ... Red M\n", ".. ... ... ... ... ...\n", "235 38940 374830.000000 ... Blue O\n", "236 30839 834042.000000 ... Blue O\n", "237 8829 537493.000000 ... White A\n", "238 9235 404940.000000 ... White A\n", "239 37882 294903.000000 ... Blue O\n", "\n", "[240 rows x 7 columns]" ] }, "execution_count": 29, "metadata": {}, "output_type": "execute_result" } ], "source": [ " # https://www.kaggle.com/sakhawat18/asteroid-dataset \n", " # https://ssd.jpl.nasa.gov/tools/sbdb_query.html\n", " \n", " path_git = \"https://raw.githubusercontent.com/hernansalinas/Curso_aprendizaje_estadistico/main/datasets/sesion_01b_dataset.csv\"\n", " df = pd.read_csv(f\"{path_git}\")\n", " df" ] }, { "cell_type": "markdown", "metadata": { "id": "KsZoK3jYAzSv" }, "source": [ "Lectura de un dataset ubicado en el drive" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 423 }, "id": "Q0LnL4yC-tlB", "outputId": "16b64b2b-f879-44b7-ea33-5abe0015930c" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
locationdatevaccinetotal_vaccinations
0Austria2021-01-08Johnson&Johnson0
1Austria2021-01-08Moderna0
2Austria2021-01-08Oxford/AstraZeneca0
3Austria2021-01-08Pfizer/BioNTech31284
4Austria2021-01-15Johnson&Johnson0
...............
21892European Union2021-11-20Oxford/AstraZeneca67212673
21893European Union2021-11-20Pfizer/BioNTech438725101
21894European Union2021-11-20Sinopharm/Beijing2157500
21895European Union2021-11-20Sinovac9
21896European Union2021-11-20Sputnik V1845062
\n", "

21897 rows × 4 columns

\n", "
" ], "text/plain": [ " location date vaccine total_vaccinations\n", "0 Austria 2021-01-08 Johnson&Johnson 0\n", "1 Austria 2021-01-08 Moderna 0\n", "2 Austria 2021-01-08 Oxford/AstraZeneca 0\n", "3 Austria 2021-01-08 Pfizer/BioNTech 31284\n", "4 Austria 2021-01-15 Johnson&Johnson 0\n", "... ... ... ... ...\n", "21892 European Union 2021-11-20 Oxford/AstraZeneca 67212673\n", "21893 European Union 2021-11-20 Pfizer/BioNTech 438725101\n", "21894 European Union 2021-11-20 Sinopharm/Beijing 2157500\n", "21895 European Union 2021-11-20 Sinovac 9\n", "21896 European Union 2021-11-20 Sputnik V 1845062\n", "\n", "[21897 rows x 4 columns]" ] }, "execution_count": 30, "metadata": {}, "output_type": "execute_result" } ], "source": [ "url = \"https://docs.google.com/spreadsheets/d/e/2PACX-1vSHCOR8_Ha6TvBQwIcpjvJ0bzHYel1S8DXl4NHnMhVvdbibrgL_SP6rffuESpaJvPwLuUizXblQtHox/pub?output=csv\"\n", "df = pd.read_csv(url)\n", "df" ] }, { "cell_type": "markdown", "metadata": { "id": "m4_dXXZQH_hY" }, "source": [ "Asignacion a la columna index la columna date" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "WNs47UiYAVwZ" }, "outputs": [], "source": [ "df = pd.read_csv(url, index_col=\"date\")" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 455 }, "id": "lOHKEiSDAZja", "outputId": "405ccf7f-e783-4e9d-d257-5dcc816707e3" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
locationvaccinetotal_vaccinations
date
2021-01-08AustriaJohnson&Johnson0
2021-01-08AustriaModerna0
2021-01-08AustriaOxford/AstraZeneca0
2021-01-08AustriaPfizer/BioNTech31284
2021-01-15AustriaJohnson&Johnson0
............
2021-11-20European UnionOxford/AstraZeneca67212673
2021-11-20European UnionPfizer/BioNTech438725101
2021-11-20European UnionSinopharm/Beijing2157500
2021-11-20European UnionSinovac9
2021-11-20European UnionSputnik V1845062
\n", "

21897 rows × 3 columns

\n", "
" ], "text/plain": [ " location vaccine total_vaccinations\n", "date \n", "2021-01-08 Austria Johnson&Johnson 0\n", "2021-01-08 Austria Moderna 0\n", "2021-01-08 Austria Oxford/AstraZeneca 0\n", "2021-01-08 Austria Pfizer/BioNTech 31284\n", "2021-01-15 Austria Johnson&Johnson 0\n", "... ... ... ...\n", "2021-11-20 European Union Oxford/AstraZeneca 67212673\n", "2021-11-20 European Union Pfizer/BioNTech 438725101\n", "2021-11-20 European Union Sinopharm/Beijing 2157500\n", "2021-11-20 European Union Sinovac 9\n", "2021-11-20 European Union Sputnik V 1845062\n", "\n", "[21897 rows x 3 columns]" ] }, "execution_count": 33, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df" ] }, { "cell_type": "markdown", "metadata": { "id": "lCikUuRTIOYO" }, "source": [ "Renombrar columnas" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 455 }, "id": "jkWwczhTCK2R", "outputId": "c7b67cd6-968e-48b5-e186-2bc236c801bb" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
LocationVaccineTotal Vaccinations
date
2021-01-08AustriaJohnson&Johnson0
2021-01-08AustriaModerna0
2021-01-08AustriaOxford/AstraZeneca0
2021-01-08AustriaPfizer/BioNTech31284
2021-01-15AustriaJohnson&Johnson0
............
2021-11-20European UnionOxford/AstraZeneca67212673
2021-11-20European UnionPfizer/BioNTech438725101
2021-11-20European UnionSinopharm/Beijing2157500
2021-11-20European UnionSinovac9
2021-11-20European UnionSputnik V1845062
\n", "

21897 rows × 3 columns

\n", "
" ], "text/plain": [ " Location Vaccine Total Vaccinations\n", "date \n", "2021-01-08 Austria Johnson&Johnson 0\n", "2021-01-08 Austria Moderna 0\n", "2021-01-08 Austria Oxford/AstraZeneca 0\n", "2021-01-08 Austria Pfizer/BioNTech 31284\n", "2021-01-15 Austria Johnson&Johnson 0\n", "... ... ... ...\n", "2021-11-20 European Union Oxford/AstraZeneca 67212673\n", "2021-11-20 European Union Pfizer/BioNTech 438725101\n", "2021-11-20 European Union Sinopharm/Beijing 2157500\n", "2021-11-20 European Union Sinovac 9\n", "2021-11-20 European Union Sputnik V 1845062\n", "\n", "[21897 rows x 3 columns]" ] }, "execution_count": 34, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df1 = df.rename(columns={\"location\": \"Location\", \n", " \"vaccine\":\" Vaccine \", \n", " \"total_vaccinations\":\"Total Vaccinations\"} ).copy() #inplace=True\n", "df1\n", "#Note el espacio en Vaccine" ] }, { "cell_type": "markdown", "metadata": { "id": "ofqn2YCJIYJU" }, "source": [ "Convirtiendo a minusculas todas las columnas" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 455 }, "id": "ZdEO9FzpCavn", "outputId": "f4477ec4-6bc5-489a-e310-02b81717e520" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
locationvaccinetotal vaccinations
date
2021-01-08AustriaJohnson&Johnson0
2021-01-08AustriaModerna0
2021-01-08AustriaOxford/AstraZeneca0
2021-01-08AustriaPfizer/BioNTech31284
2021-01-15AustriaJohnson&Johnson0
............
2021-11-20European UnionOxford/AstraZeneca67212673
2021-11-20European UnionPfizer/BioNTech438725101
2021-11-20European UnionSinopharm/Beijing2157500
2021-11-20European UnionSinovac9
2021-11-20European UnionSputnik V1845062
\n", "

21897 rows × 3 columns

\n", "
" ], "text/plain": [ " location vaccine total vaccinations\n", "date \n", "2021-01-08 Austria Johnson&Johnson 0\n", "2021-01-08 Austria Moderna 0\n", "2021-01-08 Austria Oxford/AstraZeneca 0\n", "2021-01-08 Austria Pfizer/BioNTech 31284\n", "2021-01-15 Austria Johnson&Johnson 0\n", "... ... ... ...\n", "2021-11-20 European Union Oxford/AstraZeneca 67212673\n", "2021-11-20 European Union Pfizer/BioNTech 438725101\n", "2021-11-20 European Union Sinopharm/Beijing 2157500\n", "2021-11-20 European Union Sinovac 9\n", "2021-11-20 European Union Sputnik V 1845062\n", "\n", "[21897 rows x 3 columns]" ] }, "execution_count": 36, "metadata": {}, "output_type": "execute_result" } ], "source": [ "new_df = df1.rename(mapper = str.lower, axis=\"columns\")\n", "new_df\n" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "7PDI3y862Iyr", "outputId": "fbcdcf03-bee9-4be4-b85a-af43649c970b" }, "outputs": [ { "data": { "text/plain": [ "Index(['location', ' vaccine ', 'total vaccinations'], dtype='object')" ] }, "execution_count": 37, "metadata": {}, "output_type": "execute_result" } ], "source": [ "new_df.columns" ] }, { "cell_type": "markdown", "metadata": { "id": "YMkz2Sg4IcG2" }, "source": [ "Elimnando el espacio inicial de todas las columnas" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 455 }, "id": "Em4Z5QgbDIbF", "outputId": "379a0197-f70e-47a8-cf7f-b6dcf40cbfbb" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
locationvaccinetotal vaccinations
date
2021-01-08AustriaJohnson&Johnson0
2021-01-08AustriaModerna0
2021-01-08AustriaOxford/AstraZeneca0
2021-01-08AustriaPfizer/BioNTech31284
2021-01-15AustriaJohnson&Johnson0
............
2021-11-20European UnionOxford/AstraZeneca67212673
2021-11-20European UnionPfizer/BioNTech438725101
2021-11-20European UnionSinopharm/Beijing2157500
2021-11-20European UnionSinovac9
2021-11-20European UnionSputnik V1845062
\n", "

21897 rows × 3 columns

\n", "
" ], "text/plain": [ " location vaccine total vaccinations\n", "date \n", "2021-01-08 Austria Johnson&Johnson 0\n", "2021-01-08 Austria Moderna 0\n", "2021-01-08 Austria Oxford/AstraZeneca 0\n", "2021-01-08 Austria Pfizer/BioNTech 31284\n", "2021-01-15 Austria Johnson&Johnson 0\n", "... ... ... ...\n", "2021-11-20 European Union Oxford/AstraZeneca 67212673\n", "2021-11-20 European Union Pfizer/BioNTech 438725101\n", "2021-11-20 European Union Sinopharm/Beijing 2157500\n", "2021-11-20 European Union Sinovac 9\n", "2021-11-20 European Union Sputnik V 1845062\n", "\n", "[21897 rows x 3 columns]" ] }, "execution_count": 38, "metadata": {}, "output_type": "execute_result" } ], "source": [ "new_df = new_df.rename(mapper = str.strip, axis=\"columns\")\n", "new_df" ] }, { "cell_type": "markdown", "metadata": { "id": "m7Lz7n1wIiYE" }, "source": [ "inicializando el indice a valores enteros" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 423 }, "id": "PEQ4snAjIhHf", "outputId": "bd26360d-5dda-4e3d-f89f-5b35e4db7c36" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
datelocationvaccinetotal vaccinations
02021-01-08AustriaJohnson&Johnson0
12021-01-08AustriaModerna0
22021-01-08AustriaOxford/AstraZeneca0
32021-01-08AustriaPfizer/BioNTech31284
42021-01-15AustriaJohnson&Johnson0
...............
218922021-11-20European UnionOxford/AstraZeneca67212673
218932021-11-20European UnionPfizer/BioNTech438725101
218942021-11-20European UnionSinopharm/Beijing2157500
218952021-11-20European UnionSinovac9
218962021-11-20European UnionSputnik V1845062
\n", "

21897 rows × 4 columns

\n", "
" ], "text/plain": [ " date location vaccine total vaccinations\n", "0 2021-01-08 Austria Johnson&Johnson 0\n", "1 2021-01-08 Austria Moderna 0\n", "2 2021-01-08 Austria Oxford/AstraZeneca 0\n", "3 2021-01-08 Austria Pfizer/BioNTech 31284\n", "4 2021-01-15 Austria Johnson&Johnson 0\n", "... ... ... ... ...\n", "21892 2021-11-20 European Union Oxford/AstraZeneca 67212673\n", "21893 2021-11-20 European Union Pfizer/BioNTech 438725101\n", "21894 2021-11-20 European Union Sinopharm/Beijing 2157500\n", "21895 2021-11-20 European Union Sinovac 9\n", "21896 2021-11-20 European Union Sputnik V 1845062\n", "\n", "[21897 rows x 4 columns]" ] }, "execution_count": 39, "metadata": {}, "output_type": "execute_result" } ], "source": [ "new_df = new_df.reset_index()\n", "new_df" ] }, { "cell_type": "markdown", "metadata": { "id": "8365UsknEldi" }, "source": [ "Otra forma de generar el cambio" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "0hQJDv2SDZcU", "outputId": "5f98198c-c878-44eb-a3f5-f34c9819c82f" }, "outputs": [ { "data": { "text/plain": [ "Index(['Location', ' Vaccine ', 'Total Vaccinations'], dtype='object')" ] }, "execution_count": 40, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df1.columns" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "3ZqnXBAUEiRs" }, "outputs": [], "source": [ "cols = [c.lower().strip() for c in df1.columns]" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "5AQRngv0E1yO" }, "outputs": [], "source": [ "df1.columns = cols" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "HOPBbPaZE2WE", "outputId": "1436c37c-bf1a-4c56-cfde-106a916fdee5" }, "outputs": [ { "data": { "text/plain": [ "Index(['location', 'vaccine', 'total vaccinations'], dtype='object')" ] }, "execution_count": 43, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df1.columns" ] }, { "cell_type": "markdown", "metadata": { "id": "t7TBxazlNLyS" }, "source": [ "![img](https://github.com/hernansalinas/Curso_aprendizaje_estadistico/blob/main/Sesiones/imagenes/codeCase.png?raw=true \"CodeCase\")" ] }, { "cell_type": "markdown", "metadata": { "id": "g3lsrvjRF4KB" }, "source": [ "# Pascal Case notation" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 423 }, "id": "NejJMDLPFmWl", "outputId": "884d4aa3-4b12-4709-8ebc-4e27b4979abb" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
locationdatevaccinetotal_vaccinations
0Austria2021-01-08Johnson&Johnson0
1Austria2021-01-08Moderna0
2Austria2021-01-08Oxford/AstraZeneca0
3Austria2021-01-08Pfizer/BioNTech31284
4Austria2021-01-15Johnson&Johnson0
...............
21892European Union2021-11-20Oxford/AstraZeneca67212673
21893European Union2021-11-20Pfizer/BioNTech438725101
21894European Union2021-11-20Sinopharm/Beijing2157500
21895European Union2021-11-20Sinovac9
21896European Union2021-11-20Sputnik V1845062
\n", "

21897 rows × 4 columns

\n", "
" ], "text/plain": [ " location date vaccine total_vaccinations\n", "0 Austria 2021-01-08 Johnson&Johnson 0\n", "1 Austria 2021-01-08 Moderna 0\n", "2 Austria 2021-01-08 Oxford/AstraZeneca 0\n", "3 Austria 2021-01-08 Pfizer/BioNTech 31284\n", "4 Austria 2021-01-15 Johnson&Johnson 0\n", "... ... ... ... ...\n", "21892 European Union 2021-11-20 Oxford/AstraZeneca 67212673\n", "21893 European Union 2021-11-20 Pfizer/BioNTech 438725101\n", "21894 European Union 2021-11-20 Sinopharm/Beijing 2157500\n", "21895 European Union 2021-11-20 Sinovac 9\n", "21896 European Union 2021-11-20 Sputnik V 1845062\n", "\n", "[21897 rows x 4 columns]" ] }, "execution_count": 113, "metadata": {}, "output_type": "execute_result" } ], "source": [ "#https://www.kaggle.com/saliblue/country-vaccinations-by-manufacturer\n", "url = \"https://docs.google.com/spreadsheets/d/e/2PACX-1vSHCOR8_Ha6TvBQwIcpjvJ0bzHYel1S8DXl4NHnMhVvdbibrgL_SP6rffuESpaJvPwLuUizXblQtHox/pub?output=csv\"\n", "df = pd.read_csv(url)\n", "df\n" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "9gw7BDqnNddr", "outputId": "e2e5d35c-569d-4f3d-934c-51bf3da4e27e" }, "outputs": [ { "data": { "text/plain": [ "['Hello', 'World']" ] }, "execution_count": 114, "metadata": {}, "output_type": "execute_result" } ], "source": [ "#df.columns\n", "a = \"hello world \"\n", "col = [c.capitalize() for c in a.split()]\n", "col" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "zCQO2fULOIBo", "outputId": "1e325dfc-a4b8-485c-f3b1-60fdad93e215" }, "outputs": [ { "data": { "text/plain": [ "Index(['location', 'date', 'vaccine', 'total_vaccinations'], dtype='object')" ] }, "execution_count": 115, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.columns" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "bAs9cXaTF5Io" }, "outputs": [], "source": [ "df.columns = [c.replace(\"_\",\" \") for c in df.columns]" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "eiVCDVMgGMjn", "outputId": "60430311-1b22-4ad6-edd9-ad68d95fa87c" }, "outputs": [ { "data": { "text/plain": [ "Index(['location', 'date', 'vaccine', 'total vaccinations'], dtype='object')" ] }, "execution_count": 117, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.columns" ] }, { "cell_type": "markdown", "metadata": { "id": "-wOxELRVR5Pa" }, "source": [ "Paso a paso para una expresion más compacta, ejemplo de PascaCase" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "Ly3_hoPjRyn5", "outputId": "7c8054fc-b159-4014-aaca-f8c9a14acc33" }, "outputs": [ { "data": { "text/plain": [ "['location', 'date', 'vaccine', 'total vaccinations']" ] }, "execution_count": 118, "metadata": {}, "output_type": "execute_result" } ], "source": [ "a = [ cols for cols in df.columns ]\n", "a" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "zyOnE5G0SC6g", "outputId": "fc37b0b0-755e-4ff9-ec1b-9818e3caba52" }, "outputs": [ { "data": { "text/plain": [ "[['location'], ['date'], ['vaccine'], ['total', 'vaccinations']]" ] }, "execution_count": 119, "metadata": {}, "output_type": "execute_result" } ], "source": [ "a = [ [c for c in cols.split()] for cols in df.columns ]\n", "a" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "tm6Aj82QSgQD", "outputId": "aaa64fc8-2639-422d-d114-cf058a791426" }, "outputs": [ { "data": { "text/plain": [ "[['Location'], ['Date'], ['Vaccine'], ['Total', 'Vaccinations']]" ] }, "execution_count": 120, "metadata": {}, "output_type": "execute_result" } ], "source": [ "a = [ [c.capitalize() for c in cols.split()] for cols in df.columns ]\n", "a" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 36 }, "id": "tYD6MXJWS2pJ", "outputId": "72c5d0b8-e6ed-4721-8715-3062576054ab" }, "outputs": [ { "data": { "application/vnd.google.colaboratory.intrinsic+json": { "type": "string" }, "text/plain": [ "'adfads Bsdfadf'" ] }, "execution_count": 121, "metadata": {}, "output_type": "execute_result" } ], "source": [ "a = [\"adfads\",\"Bsdfadf\"]\n", "\" \".join(a)" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "T7N-XoT6O8zM", "outputId": "be610b8d-2ede-4b04-b457-5dd53a8259d8" }, "outputs": [ { "data": { "text/plain": [ "['Location', 'Date', 'Vaccine', 'TotalVaccinations']" ] }, "execution_count": 122, "metadata": {}, "output_type": "execute_result" } ], "source": [ "a =[ \"\".join([c.capitalize() for c in cols.split()]) for cols in df.columns ]\n", "a" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "CBza5uw9RILW" }, "outputs": [], "source": [ "cols=a" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 423 }, "id": "IpnKQBt4Rmzh", "outputId": "20c27b87-30a9-4efb-d0e1-4384cdd3112a" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
LocationDateVaccineTotalVaccinations
0Austria2021-01-08Johnson&Johnson0
1Austria2021-01-08Moderna0
2Austria2021-01-08Oxford/AstraZeneca0
3Austria2021-01-08Pfizer/BioNTech31284
4Austria2021-01-15Johnson&Johnson0
...............
21892European Union2021-11-20Oxford/AstraZeneca67212673
21893European Union2021-11-20Pfizer/BioNTech438725101
21894European Union2021-11-20Sinopharm/Beijing2157500
21895European Union2021-11-20Sinovac9
21896European Union2021-11-20Sputnik V1845062
\n", "

21897 rows × 4 columns

\n", "
" ], "text/plain": [ " Location Date Vaccine TotalVaccinations\n", "0 Austria 2021-01-08 Johnson&Johnson 0\n", "1 Austria 2021-01-08 Moderna 0\n", "2 Austria 2021-01-08 Oxford/AstraZeneca 0\n", "3 Austria 2021-01-08 Pfizer/BioNTech 31284\n", "4 Austria 2021-01-15 Johnson&Johnson 0\n", "... ... ... ... ...\n", "21892 European Union 2021-11-20 Oxford/AstraZeneca 67212673\n", "21893 European Union 2021-11-20 Pfizer/BioNTech 438725101\n", "21894 European Union 2021-11-20 Sinopharm/Beijing 2157500\n", "21895 European Union 2021-11-20 Sinovac 9\n", "21896 European Union 2021-11-20 Sputnik V 1845062\n", "\n", "[21897 rows x 4 columns]" ] }, "execution_count": 124, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.columns=cols\n", "df" ] }, { "cell_type": "markdown", "metadata": { "id": "5RUXQQ5NO3Mq" }, "source": [ "# Mascaras en columnas" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "3bH8btGwGvH9" }, "outputs": [], "source": [ "tf = df[\"TotalVaccinations\"] > 2157500" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 423 }, "id": "7NrcMjBCHaKY", "outputId": "ab956500-189c-406d-afd8-d9cccf774651" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
LocationDateVaccineTotalVaccinations
67Austria2021-04-30Pfizer/BioNTech2273457
71Austria2021-05-07Pfizer/BioNTech2604643
75Austria2021-05-14Pfizer/BioNTech2904840
79Austria2021-05-21Pfizer/BioNTech3283752
83Austria2021-05-28Pfizer/BioNTech3620298
...............
21886European Union2021-11-19Pfizer/BioNTech438577477
21890European Union2021-11-20Johnson&Johnson16950779
21891European Union2021-11-20Moderna61206560
21892European Union2021-11-20Oxford/AstraZeneca67212673
21893European Union2021-11-20Pfizer/BioNTech438725101
\n", "

7626 rows × 4 columns

\n", "
" ], "text/plain": [ " Location Date Vaccine TotalVaccinations\n", "67 Austria 2021-04-30 Pfizer/BioNTech 2273457\n", "71 Austria 2021-05-07 Pfizer/BioNTech 2604643\n", "75 Austria 2021-05-14 Pfizer/BioNTech 2904840\n", "79 Austria 2021-05-21 Pfizer/BioNTech 3283752\n", "83 Austria 2021-05-28 Pfizer/BioNTech 3620298\n", "... ... ... ... ...\n", "21886 European Union 2021-11-19 Pfizer/BioNTech 438577477\n", "21890 European Union 2021-11-20 Johnson&Johnson 16950779\n", "21891 European Union 2021-11-20 Moderna 61206560\n", "21892 European Union 2021-11-20 Oxford/AstraZeneca 67212673\n", "21893 European Union 2021-11-20 Pfizer/BioNTech 438725101\n", "\n", "[7626 rows x 4 columns]" ] }, "execution_count": 126, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df[tf] #Mascara, nuevo data frame con un numero diferentes de lineas" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 423 }, "id": "MzAhfVwqHeAD", "outputId": "9e374f72-6621-4fe8-c4f3-ced91972228c" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
LocationDateVaccineTotalVaccinations
0NaNNaNNaNNaN
1NaNNaNNaNNaN
2NaNNaNNaNNaN
3NaNNaNNaNNaN
4NaNNaNNaNNaN
...............
21892European Union2021-11-20Oxford/AstraZeneca67212673.0
21893European Union2021-11-20Pfizer/BioNTech438725101.0
21894NaNNaNNaNNaN
21895NaNNaNNaNNaN
21896NaNNaNNaNNaN
\n", "

21897 rows × 4 columns

\n", "
" ], "text/plain": [ " Location Date Vaccine TotalVaccinations\n", "0 NaN NaN NaN NaN\n", "1 NaN NaN NaN NaN\n", "2 NaN NaN NaN NaN\n", "3 NaN NaN NaN NaN\n", "4 NaN NaN NaN NaN\n", "... ... ... ... ...\n", "21892 European Union 2021-11-20 Oxford/AstraZeneca 67212673.0\n", "21893 European Union 2021-11-20 Pfizer/BioNTech 438725101.0\n", "21894 NaN NaN NaN NaN\n", "21895 NaN NaN NaN NaN\n", "21896 NaN NaN NaN NaN\n", "\n", "[21897 rows x 4 columns]" ] }, "execution_count": 127, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.where(tf) #asigna NAN a todo el data frame donde no se cumple la condicion establecida" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 423 }, "id": "TkMPe3KqHxFY", "outputId": "33091a15-7b3c-4350-b07c-2a24d7feee04" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
LocationDateVaccineTotalVaccinations
67Austria2021-04-30Pfizer/BioNTech2273457.0
71Austria2021-05-07Pfizer/BioNTech2604643.0
75Austria2021-05-14Pfizer/BioNTech2904840.0
79Austria2021-05-21Pfizer/BioNTech3283752.0
83Austria2021-05-28Pfizer/BioNTech3620298.0
...............
21886European Union2021-11-19Pfizer/BioNTech438577477.0
21890European Union2021-11-20Johnson&Johnson16950779.0
21891European Union2021-11-20Moderna61206560.0
21892European Union2021-11-20Oxford/AstraZeneca67212673.0
21893European Union2021-11-20Pfizer/BioNTech438725101.0
\n", "

7626 rows × 4 columns

\n", "
" ], "text/plain": [ " Location Date Vaccine TotalVaccinations\n", "67 Austria 2021-04-30 Pfizer/BioNTech 2273457.0\n", "71 Austria 2021-05-07 Pfizer/BioNTech 2604643.0\n", "75 Austria 2021-05-14 Pfizer/BioNTech 2904840.0\n", "79 Austria 2021-05-21 Pfizer/BioNTech 3283752.0\n", "83 Austria 2021-05-28 Pfizer/BioNTech 3620298.0\n", "... ... ... ... ...\n", "21886 European Union 2021-11-19 Pfizer/BioNTech 438577477.0\n", "21890 European Union 2021-11-20 Johnson&Johnson 16950779.0\n", "21891 European Union 2021-11-20 Moderna 61206560.0\n", "21892 European Union 2021-11-20 Oxford/AstraZeneca 67212673.0\n", "21893 European Union 2021-11-20 Pfizer/BioNTech 438725101.0\n", "\n", "[7626 rows x 4 columns]" ] }, "execution_count": 128, "metadata": {}, "output_type": "execute_result" } ], "source": [ "n_df = df.where(tf).copy()\n", "n_df.dropna() #Volvemos a obtener el data frame generado con la mascara y #filtrado como un array" ] }, { "cell_type": "markdown", "metadata": { "id": "hyy88nqqKDhH" }, "source": [ "Volviendo a la mascara" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "YnDQTZ6yJrUC" }, "outputs": [], "source": [ "df = df[tf]" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 423 }, "id": "f1b7l4CSK7AL", "outputId": "dad15279-5a4d-405e-aa25-b4dd2b98454d" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
LocationDateVaccineTotalVaccinations
67Austria2021-04-30Pfizer/BioNTech2273457
71Austria2021-05-07Pfizer/BioNTech2604643
75Austria2021-05-14Pfizer/BioNTech2904840
79Austria2021-05-21Pfizer/BioNTech3283752
83Austria2021-05-28Pfizer/BioNTech3620298
...............
21886European Union2021-11-19Pfizer/BioNTech438577477
21890European Union2021-11-20Johnson&Johnson16950779
21891European Union2021-11-20Moderna61206560
21892European Union2021-11-20Oxford/AstraZeneca67212673
21893European Union2021-11-20Pfizer/BioNTech438725101
\n", "

7626 rows × 4 columns

\n", "
" ], "text/plain": [ " Location Date Vaccine TotalVaccinations\n", "67 Austria 2021-04-30 Pfizer/BioNTech 2273457\n", "71 Austria 2021-05-07 Pfizer/BioNTech 2604643\n", "75 Austria 2021-05-14 Pfizer/BioNTech 2904840\n", "79 Austria 2021-05-21 Pfizer/BioNTech 3283752\n", "83 Austria 2021-05-28 Pfizer/BioNTech 3620298\n", "... ... ... ... ...\n", "21886 European Union 2021-11-19 Pfizer/BioNTech 438577477\n", "21890 European Union 2021-11-20 Johnson&Johnson 16950779\n", "21891 European Union 2021-11-20 Moderna 61206560\n", "21892 European Union 2021-11-20 Oxford/AstraZeneca 67212673\n", "21893 European Union 2021-11-20 Pfizer/BioNTech 438725101\n", "\n", "[7626 rows x 4 columns]" ] }, "execution_count": 130, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df" ] }, { "cell_type": "markdown", "metadata": { "id": "2HT7gAqaL_yO" }, "source": [ "Comparacion para valores de una misma columna" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 423 }, "id": "kShE-lRWKrGJ", "outputId": "d20f72d6-1245-49d1-cf6d-1f94aa009917" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
LocationDateVaccineTotalVaccinations
71Austria2021-05-07Pfizer/BioNTech2604643
75Austria2021-05-14Pfizer/BioNTech2904840
79Austria2021-05-21Pfizer/BioNTech3283752
83Austria2021-05-28Pfizer/BioNTech3620298
87Austria2021-06-04Pfizer/BioNTech4047114
...............
21876European Union2021-11-18Johnson&Johnson16867390
21877European Union2021-11-18Moderna60957149
21883European Union2021-11-19Johnson&Johnson16948949
21884European Union2021-11-19Moderna61165719
21890European Union2021-11-20Johnson&Johnson16950779
\n", "

6348 rows × 4 columns

\n", "
" ], "text/plain": [ " Location Date Vaccine TotalVaccinations\n", "71 Austria 2021-05-07 Pfizer/BioNTech 2604643\n", "75 Austria 2021-05-14 Pfizer/BioNTech 2904840\n", "79 Austria 2021-05-21 Pfizer/BioNTech 3283752\n", "83 Austria 2021-05-28 Pfizer/BioNTech 3620298\n", "87 Austria 2021-06-04 Pfizer/BioNTech 4047114\n", "... ... ... ... ...\n", "21876 European Union 2021-11-18 Johnson&Johnson 16867390\n", "21877 European Union 2021-11-18 Moderna 60957149\n", "21883 European Union 2021-11-19 Johnson&Johnson 16948949\n", "21884 European Union 2021-11-19 Moderna 61165719\n", "21890 European Union 2021-11-20 Johnson&Johnson 16950779\n", "\n", "[6348 rows x 4 columns]" ] }, "execution_count": 133, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df[ (df[\"TotalVaccinations\"]>2273457) & (df[\"TotalVaccinations\"]<61206560 ) ]" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 423 }, "id": "Syh9VQC-5stx", "outputId": "5ab8cf09-8fef-4071-ef5b-cc17d4e69986" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
indexLocationDateVaccineTotalVaccinations
067Austria2021-04-30Pfizer/BioNTech2273457
171Austria2021-05-07Pfizer/BioNTech2604643
275Austria2021-05-14Pfizer/BioNTech2904840
379Austria2021-05-21Pfizer/BioNTech3283752
483Austria2021-05-28Pfizer/BioNTech3620298
..................
762121886European Union2021-11-19Pfizer/BioNTech438577477
762221890European Union2021-11-20Johnson&Johnson16950779
762321891European Union2021-11-20Moderna61206560
762421892European Union2021-11-20Oxford/AstraZeneca67212673
762521893European Union2021-11-20Pfizer/BioNTech438725101
\n", "

7626 rows × 5 columns

\n", "
" ], "text/plain": [ " index Location Date Vaccine TotalVaccinations\n", "0 67 Austria 2021-04-30 Pfizer/BioNTech 2273457\n", "1 71 Austria 2021-05-07 Pfizer/BioNTech 2604643\n", "2 75 Austria 2021-05-14 Pfizer/BioNTech 2904840\n", "3 79 Austria 2021-05-21 Pfizer/BioNTech 3283752\n", "4 83 Austria 2021-05-28 Pfizer/BioNTech 3620298\n", "... ... ... ... ... ...\n", "7621 21886 European Union 2021-11-19 Pfizer/BioNTech 438577477\n", "7622 21890 European Union 2021-11-20 Johnson&Johnson 16950779\n", "7623 21891 European Union 2021-11-20 Moderna 61206560\n", "7624 21892 European Union 2021-11-20 Oxford/AstraZeneca 67212673\n", "7625 21893 European Union 2021-11-20 Pfizer/BioNTech 438725101\n", "\n", "[7626 rows x 5 columns]" ] }, "execution_count": 135, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.reset_index()" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 423 }, "id": "qFO5yCaMCViS", "outputId": "03ca3317-18e1-405d-a612-dccf9d4c2f47" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
LocationDateVaccineTotalVaccinations
0Austria2021-04-30Pfizer/BioNTech2273457
1Austria2021-05-07Pfizer/BioNTech2604643
2Austria2021-05-14Pfizer/BioNTech2904840
3Austria2021-05-21Pfizer/BioNTech3283752
4Austria2021-05-28Pfizer/BioNTech3620298
...............
7621European Union2021-11-19Pfizer/BioNTech438577477
7622European Union2021-11-20Johnson&Johnson16950779
7623European Union2021-11-20Moderna61206560
7624European Union2021-11-20Oxford/AstraZeneca67212673
7625European Union2021-11-20Pfizer/BioNTech438725101
\n", "

7626 rows × 4 columns

\n", "
" ], "text/plain": [ " Location Date Vaccine TotalVaccinations\n", "0 Austria 2021-04-30 Pfizer/BioNTech 2273457\n", "1 Austria 2021-05-07 Pfizer/BioNTech 2604643\n", "2 Austria 2021-05-14 Pfizer/BioNTech 2904840\n", "3 Austria 2021-05-21 Pfizer/BioNTech 3283752\n", "4 Austria 2021-05-28 Pfizer/BioNTech 3620298\n", "... ... ... ... ...\n", "7621 European Union 2021-11-19 Pfizer/BioNTech 438577477\n", "7622 European Union 2021-11-20 Johnson&Johnson 16950779\n", "7623 European Union 2021-11-20 Moderna 61206560\n", "7624 European Union 2021-11-20 Oxford/AstraZeneca 67212673\n", "7625 European Union 2021-11-20 Pfizer/BioNTech 438725101\n", "\n", "[7626 rows x 4 columns]" ] }, "execution_count": 138, "metadata": {}, "output_type": "execute_result" } ], "source": [ "q=df.set_index(\"Location\")\n", "q.reset_index()" ] }, { "cell_type": "markdown", "metadata": { "id": "LzRnzvzlDXoW" }, "source": [ "unique()" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "bZ-WpOTcCVk2", "outputId": "5432bdbe-a283-4bbe-bbe9-a435a4b2ce8e" }, "outputs": [ { "data": { "text/plain": [ "array(['Austria', 'Belgium', 'Chile', 'Croatia', 'Czechia', 'Denmark',\n", " 'Ecuador', 'Finland', 'France', 'Germany', 'Hong Kong', 'Hungary',\n", " 'Ireland', 'Italy', 'Japan', 'Lithuania', 'Netherlands', 'Norway',\n", " 'Poland', 'Portugal', 'Romania', 'Slovakia', 'South Korea',\n", " 'Spain', 'Sweden', 'Switzerland', 'Ukraine', 'United States',\n", " 'Uruguay', 'European Union'], dtype=object)" ] }, "execution_count": 139, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.Location.unique()" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "IuIjpTIvCVnW", "outputId": "efd33359-dac3-4468-b5d8-3d6ba5480cde" }, "outputs": [ { "data": { "text/plain": [ "array(['Pfizer/BioNTech', 'Oxford/AstraZeneca', 'Sinovac', 'Moderna',\n", " 'Johnson&Johnson'], dtype=object)" ] }, "execution_count": 140, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.Vaccine.unique()" ] }, { "cell_type": "markdown", "metadata": { "id": "ArthcctTDkPi" }, "source": [ "Definir columnas" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 423 }, "id": "A9ygXhyXDi0m", "outputId": "0bb09ffe-1242-4b42-cae6-98731a7db37b" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
LocationDateVaccine
67Austria2021-04-30Pfizer/BioNTech
71Austria2021-05-07Pfizer/BioNTech
75Austria2021-05-14Pfizer/BioNTech
79Austria2021-05-21Pfizer/BioNTech
83Austria2021-05-28Pfizer/BioNTech
............
21886European Union2021-11-19Pfizer/BioNTech
21890European Union2021-11-20Johnson&Johnson
21891European Union2021-11-20Moderna
21892European Union2021-11-20Oxford/AstraZeneca
21893European Union2021-11-20Pfizer/BioNTech
\n", "

7626 rows × 3 columns

\n", "
" ], "text/plain": [ " Location Date Vaccine\n", "67 Austria 2021-04-30 Pfizer/BioNTech\n", "71 Austria 2021-05-07 Pfizer/BioNTech\n", "75 Austria 2021-05-14 Pfizer/BioNTech\n", "79 Austria 2021-05-21 Pfizer/BioNTech\n", "83 Austria 2021-05-28 Pfizer/BioNTech\n", "... ... ... ...\n", "21886 European Union 2021-11-19 Pfizer/BioNTech\n", "21890 European Union 2021-11-20 Johnson&Johnson\n", "21891 European Union 2021-11-20 Moderna\n", "21892 European Union 2021-11-20 Oxford/AstraZeneca\n", "21893 European Union 2021-11-20 Pfizer/BioNTech\n", "\n", "[7626 rows x 3 columns]" ] }, "execution_count": 142, "metadata": {}, "output_type": "execute_result" } ], "source": [ "cols=['Location', 'Date', 'Vaccine']\n", "df[cols]" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 455 }, "id": "HdoB5slgCVqO", "outputId": "dafd0f0e-fdfe-4811-a944-f95687310644" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
DateTotalVaccinations
LocationVaccine
AustriaPfizer/BioNTech2021-04-302273457
Pfizer/BioNTech2021-05-072604643
Pfizer/BioNTech2021-05-142904840
Pfizer/BioNTech2021-05-213283752
Pfizer/BioNTech2021-05-283620298
............
European UnionPfizer/BioNTech2021-11-19438577477
Johnson&Johnson2021-11-2016950779
Moderna2021-11-2061206560
Oxford/AstraZeneca2021-11-2067212673
Pfizer/BioNTech2021-11-20438725101
\n", "

7626 rows × 2 columns

\n", "
" ], "text/plain": [ " Date TotalVaccinations\n", "Location Vaccine \n", "Austria Pfizer/BioNTech 2021-04-30 2273457\n", " Pfizer/BioNTech 2021-05-07 2604643\n", " Pfizer/BioNTech 2021-05-14 2904840\n", " Pfizer/BioNTech 2021-05-21 3283752\n", " Pfizer/BioNTech 2021-05-28 3620298\n", "... ... ...\n", "European Union Pfizer/BioNTech 2021-11-19 438577477\n", " Johnson&Johnson 2021-11-20 16950779\n", " Moderna 2021-11-20 61206560\n", " Oxford/AstraZeneca 2021-11-20 67212673\n", " Pfizer/BioNTech 2021-11-20 438725101\n", "\n", "[7626 rows x 2 columns]" ] }, "execution_count": 149, "metadata": {}, "output_type": "execute_result" } ], "source": [ "g=df.set_index(['Location','Vaccine'])\n", "g" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 1000 }, "id": "ly4BOasaCVtG", "outputId": "95975871-61bd-4139-b739-22d2304dfa5e" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
DateTotalVaccinations
Vaccine
Pfizer/BioNTech2021-04-302273457
Pfizer/BioNTech2021-05-072604643
Pfizer/BioNTech2021-05-142904840
Pfizer/BioNTech2021-05-213283752
Pfizer/BioNTech2021-05-283620298
Pfizer/BioNTech2021-06-044047114
Pfizer/BioNTech2021-06-114487274
Pfizer/BioNTech2021-06-184944088
Pfizer/BioNTech2021-06-255391151
Pfizer/BioNTech2021-07-025784489
Pfizer/BioNTech2021-07-096116071
Pfizer/BioNTech2021-07-166392275
Pfizer/BioNTech2021-07-236644089
Pfizer/BioNTech2021-07-306856749
Pfizer/BioNTech2021-08-067035563
Pfizer/BioNTech2021-08-137161389
Pfizer/BioNTech2021-08-207255573
Pfizer/BioNTech2021-08-277314987
Pfizer/BioNTech2021-09-037367498
Pfizer/BioNTech2021-09-107423706
Pfizer/BioNTech2021-09-177495432
Pfizer/BioNTech2021-09-247570056
Pfizer/BioNTech2021-10-017652494
Pfizer/BioNTech2021-10-087765350
Pfizer/BioNTech2021-10-157873668
Pfizer/BioNTech2021-10-228000912
Pfizer/BioNTech2021-10-298117682
Pfizer/BioNTech2021-11-058336536
Pfizer/BioNTech2021-11-128758674
Pfizer/BioNTech2021-11-199284152
\n", "
" ], "text/plain": [ " Date TotalVaccinations\n", "Vaccine \n", "Pfizer/BioNTech 2021-04-30 2273457\n", "Pfizer/BioNTech 2021-05-07 2604643\n", "Pfizer/BioNTech 2021-05-14 2904840\n", "Pfizer/BioNTech 2021-05-21 3283752\n", "Pfizer/BioNTech 2021-05-28 3620298\n", "Pfizer/BioNTech 2021-06-04 4047114\n", "Pfizer/BioNTech 2021-06-11 4487274\n", "Pfizer/BioNTech 2021-06-18 4944088\n", "Pfizer/BioNTech 2021-06-25 5391151\n", "Pfizer/BioNTech 2021-07-02 5784489\n", "Pfizer/BioNTech 2021-07-09 6116071\n", "Pfizer/BioNTech 2021-07-16 6392275\n", "Pfizer/BioNTech 2021-07-23 6644089\n", "Pfizer/BioNTech 2021-07-30 6856749\n", "Pfizer/BioNTech 2021-08-06 7035563\n", "Pfizer/BioNTech 2021-08-13 7161389\n", "Pfizer/BioNTech 2021-08-20 7255573\n", "Pfizer/BioNTech 2021-08-27 7314987\n", "Pfizer/BioNTech 2021-09-03 7367498\n", "Pfizer/BioNTech 2021-09-10 7423706\n", "Pfizer/BioNTech 2021-09-17 7495432\n", "Pfizer/BioNTech 2021-09-24 7570056\n", "Pfizer/BioNTech 2021-10-01 7652494\n", "Pfizer/BioNTech 2021-10-08 7765350\n", "Pfizer/BioNTech 2021-10-15 7873668\n", "Pfizer/BioNTech 2021-10-22 8000912\n", "Pfizer/BioNTech 2021-10-29 8117682\n", "Pfizer/BioNTech 2021-11-05 8336536\n", "Pfizer/BioNTech 2021-11-12 8758674\n", "Pfizer/BioNTech 2021-11-19 9284152" ] }, "execution_count": 160, "metadata": {}, "output_type": "execute_result" } ], "source": [ "g.loc[\"Austria\"]" ] }, { "cell_type": "markdown", "metadata": { "id": "YweapkaM5b8D" }, "source": [ "Operacion groupby " ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "jzrVlOVK3_zF", "outputId": "150d5b7d-f8bc-4570-8140-5263576eebbd" }, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 81, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.groupby([\"Location\"])" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 1000 }, "id": "lll9mMpY4D0R", "outputId": "1e8aa034-e338-44dc-e8b3-f175b952a09e" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
DateVaccineTotalVaccinations
Location
Austria303030
Belgium515151
Chile476476476
Croatia181818
Czechia212212212
Denmark272727
Ecuador287287287
European Union101910191019
Finland262626
France720720720
Germany867867867
Hong Kong260260260
Hungary313131
Ireland252525
Italy711711711
Japan135135135
Lithuania333
Netherlands575757
Norway252525
Poland989898
Portugal454545
Romania240240240
Slovakia232323
South Korea472472472
Spain100100100
Sweden313131
Switzerland358358358
Ukraine264264264
United States841841841
Uruguay174174174
\n", "
" ], "text/plain": [ " Date Vaccine TotalVaccinations\n", "Location \n", "Austria 30 30 30\n", "Belgium 51 51 51\n", "Chile 476 476 476\n", "Croatia 18 18 18\n", "Czechia 212 212 212\n", "Denmark 27 27 27\n", "Ecuador 287 287 287\n", "European Union 1019 1019 1019\n", "Finland 26 26 26\n", "France 720 720 720\n", "Germany 867 867 867\n", "Hong Kong 260 260 260\n", "Hungary 31 31 31\n", "Ireland 25 25 25\n", "Italy 711 711 711\n", "Japan 135 135 135\n", "Lithuania 3 3 3\n", "Netherlands 57 57 57\n", "Norway 25 25 25\n", "Poland 98 98 98\n", "Portugal 45 45 45\n", "Romania 240 240 240\n", "Slovakia 23 23 23\n", "South Korea 472 472 472\n", "Spain 100 100 100\n", "Sweden 31 31 31\n", "Switzerland 358 358 358\n", "Ukraine 264 264 264\n", "United States 841 841 841\n", "Uruguay 174 174 174" ] }, "execution_count": 85, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.groupby([\"Location\"]).count()\n" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 237 }, "id": "KPhcLn2V4iWx", "outputId": "d21eea83-09b6-4226-a093-6d89e6205fc4" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
LocationDateTotalVaccinations
Vaccine
Johnson&Johnson552552552
Moderna149914991499
Oxford/AstraZeneca138313831383
Pfizer/BioNTech341034103410
Sinovac782782782
\n", "
" ], "text/plain": [ " Location Date TotalVaccinations\n", "Vaccine \n", "Johnson&Johnson 552 552 552\n", "Moderna 1499 1499 1499\n", "Oxford/AstraZeneca 1383 1383 1383\n", "Pfizer/BioNTech 3410 3410 3410\n", "Sinovac 782 782 782" ] }, "execution_count": 86, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.groupby([\"Vaccine\"]).count()\n" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "FQvezD1R6Ctj" }, "outputs": [], "source": [ "part_df = df[df.Location==\"Austria\"].reset_index()\n" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "AT900oDKEzv7" }, "outputs": [], "source": [ "mask = df.Location.isnull()" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 423 }, "id": "CoLsYq6SEz6s", "outputId": "11b98c8d-f54e-4f1b-b371-0693481b3e68" }, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
LocationDateVaccineTotalVaccinations
67Austria2021-04-30Pfizer/BioNTech2273457
71Austria2021-05-07Pfizer/BioNTech2604643
75Austria2021-05-14Pfizer/BioNTech2904840
79Austria2021-05-21Pfizer/BioNTech3283752
83Austria2021-05-28Pfizer/BioNTech3620298
...............
21886European Union2021-11-19Pfizer/BioNTech438577477
21890European Union2021-11-20Johnson&Johnson16950779
21891European Union2021-11-20Moderna61206560
21892European Union2021-11-20Oxford/AstraZeneca67212673
21893European Union2021-11-20Pfizer/BioNTech438725101
\n", "

7626 rows × 4 columns

\n", "
" ], "text/plain": [ " Location Date Vaccine TotalVaccinations\n", "67 Austria 2021-04-30 Pfizer/BioNTech 2273457\n", "71 Austria 2021-05-07 Pfizer/BioNTech 2604643\n", "75 Austria 2021-05-14 Pfizer/BioNTech 2904840\n", "79 Austria 2021-05-21 Pfizer/BioNTech 3283752\n", "83 Austria 2021-05-28 Pfizer/BioNTech 3620298\n", "... ... ... ... ...\n", "21886 European Union 2021-11-19 Pfizer/BioNTech 438577477\n", "21890 European Union 2021-11-20 Johnson&Johnson 16950779\n", "21891 European Union 2021-11-20 Moderna 61206560\n", "21892 European Union 2021-11-20 Oxford/AstraZeneca 67212673\n", "21893 European Union 2021-11-20 Pfizer/BioNTech 438725101\n", "\n", "[7626 rows x 4 columns]" ] }, "execution_count": 165, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.fillna(0)" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "JGTR5OK1Ez-J" }, "outputs": [], "source": [] }, { "cell_type": "markdown", "metadata": { "id": "UrIgqDWQZGik" }, "source": [] }, { "cell_type": "markdown", "metadata": { "id": "lYFVJTpNZGlc" }, "source": [] }, { "cell_type": "markdown", "metadata": { "id": "3jKw7WwEZGpZ" }, "source": [ "Series de tiempo \n", "https://raw.githubusercontent.com/jbrownlee/Datasets/master/daily-min-temperatures.csv" ] }, { "cell_type": "markdown", "metadata": { "id": "L4rLg5bQ9d2D" }, "source": [ "Algunos repositorios y paginas de interes\n", "\n", "\n", "1. https://www.nature.com/sdata/policies/repositories\n", "\n", "2. https://paperswithcode.com/\n", "\n", "3. https://towardsdatascience.com/31-datasets-for-your-next-data-science-project-6ef9a6f8cac6\n", "\n", "4. https://www.data.gov/\n", "\n", "5. https://archive.ics.uci.edu/ml/index.php\n", "\n", "6. https://data.world/datasets/geodata\n", "\n", "7. https://matmatch.com/advanced-search?categories=ceramic\n", "\n", "8. https://github.com/sedaoturak/data-resources-for-materials-science\n", "\n", "\n", "9. https://guides.library.cmu.edu/machine-learning/datasets" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "v2mDZbXR83m4" }, "outputs": [], "source": [] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "urTeXCN5830I" }, "outputs": [], "source": [] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "UzDnFEBc832-" }, "outputs": [], "source": [] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "rW0o-sUK836Y" }, "outputs": [], "source": [] } ], "metadata": { "colab": { "name": "Sesion_01b_pandas.ipynb", "provenance": [] }, "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.8.5" } }, "nbformat": 4, "nbformat_minor": 1 }