List: AWS | Curated by abhinaya rajaram

Sep 20, 2022
22 stories
AWS
In
Towards Data Science
by
Patrick Brus
MLOps: Bring your Models into Production using the CloudContinuous Deployment Pipeline of your Deep Learning Model to the Cloud using AWS, S3 and CloudFormation
Sep 19, 2022
1
Sep 19, 2022
1
AltexSoft Inc
What is Data Pipeline: Components, Types, and Use CasesLet’s say you run a large online bookstore. It’s open 24/7. Users may place orders and pay for them literally every minute or second. That…
Aug 13, 2020
Aug 13, 2020
Subhash Burramsetty
AWS Data Wrangler — Simplifying Pandas integration with AWS data related servicesEnterprise organisations are utilising cloud services to build data lakes, warehouses, automated ETL pipelines…
Sep 11, 2020
Sep 11, 2020
In
Towards Data Science
by
Antonio Cachuan
A gentle introduction to Apache Arrow with Apache Spark and PandasThis time I am going to try to explain how can we use Apache Arrow in conjunction with Apache Spark and Python. First, let me share some…
Jan 29, 2019
4
Jan 29, 2019
4
Katia SEBIH
How to automate data extractions using AWSPART I: Building the infrastructure
Jan 3, 2022
Jan 3, 2022
In
Dev Genius
by
Rohit Kumar Prajapati
Basic ETL using PysparkIn this post, we will perform ETL operations using PySpark.
Sep 2, 2022
3
Sep 2, 2022
3
In
Python in Plain English
by
Mahbub Zaman
How To Create a Docker Image From a ContainerQuickly run python scripts inside a Docker image
Sep 11, 2021
1
Sep 11, 2021
1
Shivam Shrivastava
Load data into Redshift from S3Ever wondered how Zomato knows what’s you favorite restaurant? Or how Myntra knows what dresses you like? They do this using your…
Apr 2, 2022
Apr 2, 2022
In
Innovation-res
by
George Bakas
Setup Custom AWS lambda (λ) function dependencies using Docker containersLet’s face it… When it comes to running serverless applications, AWS Lambda (λ) service is one of the most easy to develop and deploy. With…
Jun 22, 2022
Jun 22, 2022
Jaynab Khatun
How to read multiple CSV file from S3 location using Lambda functionSometime we need to read data from S3 location. The data can be store in multiple file in S3. We can read data from multiple file like CSV…
Jun 27, 2022
Jun 27, 2022
In
Towards Dev
by
Lloyd Matereke
Deploying a Simple CI/CD Pipeline using AWS CDK (Python)Introduction
Apr 5, 2022
1
Apr 5, 2022
1
Subham Kumar Sahoo
Deploy AWS lambda function from containerDue to 250 MB limit on lambda package code, we will be using Docker container image to deploy lambda function to AWS with 10 GB size limit.
Jun 5, 2022
Jun 5, 2022
In
Dev Genius
by
Haq Nawaz
How to install Apache Airflow on Docker with a custom image?Using Apache Airflow, Docker
Sep 5, 2022
Sep 5, 2022
In
Towards Dev
by
Ruchi Sharma
Parsing JSON dataset using PandasPhoto by Gabriel Heinzer on Unsplash
Mar 26, 2022
Mar 26, 2022
In
Python in Plain English
by
Antonio Soto
How to Read API Data Using PythonA beginner’s guide on reading API data using Python.
Aug 18, 2022
Aug 18, 2022
Abdul Rafee Wahab
How-to: Create an ETL Job using AWS Glue Studio, S3, & AthenaBackground
Apr 25, 2022
1
Apr 25, 2022
1
Joan Ngugi
Understanding AWS Glue for ETLIn the big data world, the biggest problem for many companies might be getting insights from data before it’s outdated. If you need to…
Apr 21, 2022
2
Apr 21, 2022
2
Sicong Zhao
How to build a data pipeline on AWS — Part 1Recently I have been working on daoitright.xyz, a data center of DAOs with a focus on quantifying the fairness of the DAO governance. To…
May 6, 2022
May 6, 2022
In
Towards Dev
by
Bilal Mussa
Ingesting data from Google Cloud Storage into Python and Pushing into BigQuery using GCP (functions…Recently I had to load data from Google Cloud Storage into BigQuery and automate the process so that it will run each day at 8am.
Dec 1, 2021
2
Dec 1, 2021
2
In
Dev Genius
by
SHUBHAM KAUSHIK
Creating Data Lake using AWS S3, Glue, and AthenaThis article will store a large amount of data in the AWS S3 bucket and use AWS glue to store the metadata for this data. And then…
May 15, 2021
May 15, 2021