Pinned Loading
-
autopredict
autopredict Publicauto-predict is a python library to automate building of prediciton models,feature selection and some more handy utilities used in building prediction models.
Python 2
-
Airflow-Batch-Pipeline-S3-Redshift
Airflow-Batch-Pipeline-S3-Redshift PublicData pipeline that has been designed to automate the parsing of user search patterns from logs of a property hosting website. The pipeline extracts data from S3 , massages the data and puts the fin…
Jupyter Notebook
-
Airflow-Monitoring-ElasticSearch-LogStash-Kibana
Airflow-Monitoring-ElasticSearch-LogStash-Kibana PublicIn this project we look to set up Airflow monitoring using ElasticSearch-LogStash-Kibana (ELK stack). We will set up the stack using docker images.
-
Data-Streaming-Using-Aws-Kinesis-Lambda-S3
Data-Streaming-Using-Aws-Kinesis-Lambda-S3 PublicA production grade data pipeline that streams real time data of an online retailer website into AWS S3 data lake. The pipeline uses AWS Kinesis to receive real time streaming data and a lambda cons…
Python
-
Forex-Currency-Processing-Airflow-Hdfs-Hive-Spark
Forex-Currency-Processing-Airflow-Hdfs-Hive-Spark PublicWe build a Forex-currency rates pipeline to get currency rates from an external API and load the data into HDFS from where we use pyspark job to massage the data and insert it into a Hive table. Th…
Shell 1
-
genai-prod-catalog-enrichment
genai-prod-catalog-enrichment PublicGen AI tool for product catalog extraction
Python
If the problem persists, check the GitHub status page or contact support.